On 11/07/2006 12:32 PM, yichao.zhang wrote:
I'm trying to match the characters from u'\uff00' to u'\uff0f'.
the code below and get a TypeError.
p = re.compile(u'\uff00'-u'\uff0f')
That is not a valid regex. It is an attempt to subtract one unicode char
from another, but this is (correctly) not supported, as the error
message says. re.compile expects a string (8-bit or Unicode).
If you wanted to match ASCII characters from 'A' to 'Z', you wouldn't
put re.compile('A'-'Z'), would you? Well I hope not, I hope you would
use re.compile('[A-Z]') -- does that give you a clue?
Traceback (most recent call last):
File "<interactive input>", line 1, in ?
TypeError: unsupported operand type(s) for -: 'unicode' and 'unicode'
so re module does NOT support this operation
Incorrect conclusion. The argument that you attempted to supply is not a
valid expression and thus not a valid argument for *any* function. It
was intercepted before it got to re.compile. re.compile is innocent.
however, is there any alternative way to solve my problem?
Any comments/suggestions much appreciated!
1. Read the fantastic manual.
2. Learn to understand error messages.
3. Assume the most plausible cause (you stuffed up, not the people who
worked on the re module).
HTH,
John