re:
4.2.1 Regular Expression Syntax http://docs.python.org/lib/re-syntax.html
*?, +?, ??
Adding "?" after the qualifier makes it perform the match in non-greedy or
minimal fashion; as few characters as possible will be matched.
the regular expression module fails to perform non-greedy matches as
described in the documentation: more than "as few characters as possible"
are matched.
this is a bug and it needs to be fixed.
examples follow.
lothar@erda /ntd/vl
$ cat vwre.py
#! /usr/bin/env python
import re
vwre = re.compile("V.* ?W")
vwlre = re.compile("V.* ?WL")
if __name__ == "__main__":
newdoc = "V1WVVV2WWW "
vwli = re.findall(vwre , newdoc)
print "vwli[], expect", ['V1W', 'V2W']
print "vwli[], return", vwli
newdoc = "V1WLV2WV3WV4WL V5WV6WL"
vwlli = re.findall(vwlr e, newdoc)
print "vwlli[], expect", ['V1WL', 'V4WL', 'V6WL']
print "vwlli[], return", vwlli
lothar@erda /ntd/vl
$ python vwre.py
vwli[], expect ['V1W', 'V2W']
vwli[], return ['V1W', 'VVV2W']
vwlli[], expect ['V1WL', 'V4WL', 'V6WL']
vwlli[], return ['V1WL', 'V2WV3WV4WL', 'V5WV6WL']
lothar@erda /ntd/vl
$ python -V
Python 2.3.3
Jul 18 '05
12 4426
"lothar" wrote: with respect to the documentation, the module is broken.
nope.
the module does not necessarily deliver a "minimal length" match for a non-greedy pattern.
it isn't supposed to: a regular expression describes a *set* of matching
strings, and the engine is free to return any string from that set. Python's
engine returns the *first* string it finds that belongs to the set. if you use
a non-greedy operator, the engine will return the first non-greedy match
it finds, not the overall shortest non-greedy match.
if you don't want to understand how regular expressions work, don't use
them.
</F>
a non-greedy match is implicitly defined in the documentation to be one such
that there is no proper substring in the return which could also match the
regex.
the documentation implies the module will return a non-greedy match.
the module does not return a non-greedy match.
"Fredrik Lundh" <fr*****@python ware.com> wrote in message
news:ma******** *************** *************** @python.org... "lothar" wrote:
with respect to the documentation, the module is broken. nope.
the module does not necessarily deliver a "minimal length" match for a non-greedy pattern.
it isn't supposed to: a regular expression describes a *set* of matching strings, and the engine is free to return any string from that set.
Python's engine returns the *first* string it finds that belongs to the set. if
you use a non-greedy operator, the engine will return the first non-greedy match it finds, not the overall shortest non-greedy match.
if you don't want to understand how regular expressions work, don't use them.
</F>
"lothar" wrote: a non-greedy match is implicitly defined in the documentation to be one such that there is no proper substring in the return which could also match the regex.
no, that's not what it says. this is what is says:
Adding "?" after the qualifier makes it perform the match in non-greedy
or minimal fashion; as few characters as possible will be matched.
note that it says "qualifier" (that is, the preceeding *, +, or ? operator). it
doesn't say that the *entire* regex should be non-greedy. it does not say
that search, findall, sub etc. should look for the shortest possible overall
match. all it says is that the preceeding operator, and that operator only,
should look for the shortest possible match, rather than the longest.
the module does not return a non-greedy match.
it does. the problem is all in your head.
</F> This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics |
by: Josiah Carlson |
last post by:
Good day everyone,
I have produced a patch against the latest CVS to add support for two
new formatting characters in the struct module. It is currently an RFE,
which I include a link to at the end of this post. Please read the
email before you respond to it.
Generally, the struct module is for packing and unpacking of binary
data. It includes support to pack and unpack the c types:
byte, char, short, long, long long, char, *, and...
|
by: dody suria wijaya |
last post by:
I found this problem when trying to split a module into two.
Here's an example:
==============
#Module a (a.py):
from b import *
class Main: pass
==============
==============
|
by: Steven Bethard |
last post by:
In the "empty classes as c structs?" thread, we've been talking in some
detail about my proposed "generic objects" PEP. Based on a number of
suggestions, I'm thinking more and more that instead of a single
collections type, I should be proposing a new "namespaces" module
instead. Some of my reasons:
(1) Namespace is feeling less and less like a collection to me. Even
though it's still intended as a data-only structure, the use cases...
|
by: Xah Lee |
last post by:
Python Doc Problem Example: gzip
Xah Lee, 20050831
Today i need to use Python to compress/decompress gzip files. Since
i've read the official Python tutorial 8 months ago, have spent 30
minutes with Python 3 times a week since, have 14 years of computing
experience, 8 years in mathematical computing and 4 years in unix admin
and perl, i have quickly found the official doc:
http://python.org/doc/2.4.1/lib/module-gzip.html
|
by: Agnes |
last post by:
I want to write a program with many sub-method.
for example,
1)method :company_search(code) which return name,addresss...etc
2)method:currency(code) which return the current exchange rate....etc
..... manys
Should I write it use module ??? or in code file ??
What is the difference about it ?
Thanks
From Agnes
| |
by: sophie_newbie |
last post by:
OK this might seem like a retarded question, but what is the difference
between a library and a module?
If I do:
import string
am I importing a module or a library?
And if i do string.replace() am I using a module or a function or a
|
by: André |
last post by:
Hi,
i'm developping asp.net applications and therefore i use VB.net. I have some
questions about best practises.
According what i read about class and module and if i understand it right, a
module does the same as a class but cannot herite or be herited. 1)Is that
right?
2) So i guess this module does exactly the same as the class?
|
by: Frank Aune |
last post by:
Hello,
I just recently found out that wx.lib.pubsub has finally moved away from wx,
and now lives at:
http://pubsub.wiki.sourceforge.net
I'm trying to use pubsub3, which is the third version and now the default one,
but I'm having a hard time creating topics and messages for sending:
|
by: Harishankar |
last post by:
Hi,
Sorry to start off on a negative note in the list, but I feel that the Python
subprocess module is sorely deficient because it lacks a mechanism to:
1. Create non-blocking pipes which can be read in a separate thread (I am
currently writing a mencoder GUI in Tkinter and need a full fledged process
handler to control the command line and to display the progress in a
text-box)
|
by: emallove |
last post by:
I'm running into the below "No modules named _sha256" issue, with a
python installed in a non-standard location.
$ python
Python 2.5.2 (r252:60911, May 20 2008, 09:46:50)
on linux2
Type "help", "copyright", "credits" or "license" for more information.
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/ws/ompi-tools/lib/python2.5/md5.py", line 6, in <module>
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it.
First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
| |
by: Oralloy |
last post by:
Hello folks,
I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>".
The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed.
This is as boiled down as I can make it.
Here is my compilation command:
g++-12 -std=c++20 -Wnarrowing bit_field.cpp
Here is the code in...
|
by: tracyyun |
last post by:
Dear forum friends,
With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules.
He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms.
Adolph will...
|
by: conductexam |
last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one.
At the time of converting from word file to html my equations which are in the word document file was convert into image.
Globals.ThisAddIn.Application.ActiveDocument.Select();...
|
by: TSSRALBI |
last post by:
Hello
I'm a network technician in training and I need your help.
I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs.
The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols.
I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
|
by: adsilva |
last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
| |
by: 6302768590 |
last post by:
Hai team
i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
|
by: bsmnconsultancy |
last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...
| |