473,406 Members | 2,633 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,406 software developers and data experts.

Regular expression - misbehaviour

Hi,

why oh why doesn't this work as per any other language I
have used regexes with;
--------
Regex r = new Regex("[A-Z]{2}");
bool test = r.IsMatch("ACD");
--------

The boolean 'test' is true, despite the {2} quantifier
specifies 'exactly two'. Why is this?

Perl/ Java performs this match correctly - e.g. false. Is
this a bad implementation of regexes? Or am I missing
something?

Mark

Nov 16 '05 #1
6 1524
I disagree. I don't know Perl, but I would be horrified if Java didn't
match this as well.

Yes, your expression states that you want to match exactly two
uppercase letters, but remember that the pattern matcher returns true
if the pattern occurs _anywhere within_ the input string. So, your test
matches two places in the string: the "AC" of "ACD", and the "CD" of
"ACD", so of course it returns true.

To state that you want the pattern to have to match the whole string,
you need to say,

Regex r = new Regex("^[A-Z]{2}$");

do you not?

Nov 16 '05 #2
Mark,

Try this out

^([A-Z]{2,2})$

HTH

Ollie Riches

"Mark" <an*******@discussions.microsoft.com> wrote in message
news:2a****************************@phx.gbl...
Hi,

why oh why doesn't this work as per any other language I
have used regexes with;
--------
Regex r = new Regex("[A-Z]{2}");
bool test = r.IsMatch("ACD");
--------

The boolean 'test' is true, despite the {2} quantifier
specifies 'exactly two'. Why is this?

Perl/ Java performs this match correctly - e.g. false. Is
this a bad implementation of regexes? Or am I missing
something?

Mark

Nov 16 '05 #3
I disagree. I don't know Perl, but I would be horrified if Java didn't
match this as well.

Yes, your expression states that you want to match exactly two
uppercase letters, but remember that the pattern matcher returns true
if the pattern occurs _anywhere within_ the input string. So, your test
matches two places in the string: the "AC" of "ACD", and the "CD" of
"ACD", so of course it returns true.

To state that you want the pattern to have to match the whole string,
you need to say,

Regex r = new Regex("^[A-Z]{2}$");

do you not?

Nov 16 '05 #4
Thanks for this. Java 1.4's Pattern object compiles the
expression, and treats it as a complete pattern match e.g.
it seems to imply the ^ and $ is there and tests for the
whole pattern regardless using the Matcher object. Not
sure why this is though??

I agree inclusion of ^ and $ is standard Regex stuff, and
C# works OK.
-----Original Message-----
I disagree. I don't know Perl, but I would be horrified if Java didn'tmatch this as well.

Yes, your expression states that you want to match exactly twouppercase letters, but remember that the pattern matcher returns trueif the pattern occurs _anywhere within_ the input string. So, your testmatches two places in the string: the "AC" of "ACD", and the "CD" of"ACD", so of course it returns true.

To state that you want the pattern to have to match the whole string,you need to say,

Regex r = new Regex("^[A-Z]{2}$");

do you not?

.

Nov 16 '05 #5
RCS
After a long weekend of working with regular expressions and wanting to take
a bottle of pills by the end, I ran across "The Regulator" - you put in your
critieria and it spits out the regular expression

http://regex.osherove.com/

Seems to me, it's very easy to have human-error with regular expressions
because they are so non-intuitive..
"Mark" <an*******@discussions.microsoft.com> wrote in message
news:2a****************************@phx.gbl...
Hi,

why oh why doesn't this work as per any other language I
have used regexes with;
--------
Regex r = new Regex("[A-Z]{2}");
bool test = r.IsMatch("ACD");
--------

The boolean 'test' is true, despite the {2} quantifier
specifies 'exactly two'. Why is this?

Perl/ Java performs this match correctly - e.g. false. Is
this a bad implementation of regexes? Or am I missing
something?

Mark

Nov 16 '05 #6
That's odd. How would you find out if a pattern matches any part of a
string using Java's pattern matching routines, then? Do you have to put
..* before and after the pattern?

Nov 16 '05 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
by: Kenneth McDonald | last post by:
I'm working on the 0.8 release of my 'rex' module, and would appreciate feedback, suggestions, and criticism as I work towards finalizing the API and feature sets. rex is a module intended to make...
4
by: Buddy | last post by:
Can someone please show me how to create a regular expression to do the following My text is set to MyColumn{1, 100} Test I want a regular expression that sets the text to the following...
4
by: Neri | last post by:
Some document processing program I write has to deal with documents that have headers and footers that are unnecessary for the main processing part. Therefore, I'm using a regular expression to go...
11
by: Dimitris Georgakopuolos | last post by:
Hello, I have a text file that I load up to a string. The text includes certain expression like {firstName} or {userName} that I want to match and then replace with a new expression. However,...
3
by: James D. Marshall | last post by:
The issue at hand, I believe is my comprehension of using regular expression, specially to assist in replacing the expression with other text. using regular expression (\s*) my understanding is...
7
by: Billa | last post by:
Hi, I am replaceing a big string using different regular expressions (see some example at the end of the message). The problem is whenever I apply a "replace" it makes a new copy of string and I...
25
by: Mike | last post by:
I have a regular expression (^(.+)(?=\s*).*\1 ) that results in matches. I would like to get what the actual regular expression is. In other words, when I apply ^(.+)(?=\s*).*\1 to " HEART...
1
by: Allan Ebdrup | last post by:
I have a dynamic list of regular expressions, the expressions don't change very often but they can change. And I have a single string that I want to match the regular expressions against and find...
1
by: NvrBst | last post by:
I want to use the .replace() method with the regular expression /^ %VAR % =,($|&)/. The following DOESN'T replace the "^default.aspx=,($|&)" regular expression with "":...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.