473,804 Members | 3,424 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Regular expressions

Folks,

I've started using regular expressions for parsing some data string
that come along - works quite nicely, however, as a newbie to RE, I'm
still struggling with some special cases:

1) I get strings representing decimal values, and they have the
following format:
* first, a plus or minus sign
* then, a number of numeric chars
* if the precision is > 0, then a dot and "precision" number of
numerical characters will follow

This last thing is what's killing me - how can I put the "conditial"
thing ("if precision > 0, then a dot and digits) into a single regular
expression?

What I have now is this:

[+-]\d{x,x}.\d{p,p}

which works fine, as long as I have a DECIMAL number with x digits
before and p digits after the comma (e.g. +120.45 if x=3 and p=2).

However, if p=0, this string won't be matched +120 since it doesn't
have the dot at the end.........

Any ideas??

2) For a date, which comes along as DD.MM.YYYY, I'd like to be able to
match both the cases where DD is either "01" or " 1" (leading 0 or
space). I tried various things, but nothing seems to work - it seems
if I allow it to have a leading whitespace, it'll also match "
01.12.2003" (leading whitespace and then two digits for the day) -
this is *not* what I want!

I've tried \d{2,2}.\d{2,2} .\d{4,4}, which works fine if the day is
specified with a leading zero - but will fail if I pass it "
1.12.2003". I also tried (\s\d|\d{2,2}). \d{2,2}.\d{4,4} , but as I
mentioned - in this case, all these dates are being matched:
"01.01.2003 " - okay
" 1.01.2003" - okay
" 01.01.2003" - NOT okay (leading zero + 2 digits)

Any takers?

Thanks!
Marc

=============== =============== =============== =============== ====
Marc Scheuner May The Source Be With You!
Bern, Switzerland m.scheuner(at)i nova.ch
Nov 15 '05 #1
6 9019
Got another riddle:

how can I make a regular expression that will match all of the
following:

DECIMAL
DECIMAL(5)
DECIMAL(15, 5) (fifteen-comma-space-five)

Anyone? I've tried numerous expressions - either I get too much
(everything), or I get the two last ones (with the parenthesis) - but
I can't seem to make it work for all three cases in just one
expression..... ..

Thanks!
Marc

=============== =============== =============== =============== ====
Marc Scheuner May The Source Be With You!
Bern, Switzerland m.scheuner(at)i nova.ch
Nov 15 '05 #2
For #1, I don't know of a way to match based on
arithmetical comparisons (like p>0), so you would probably
have to use two different expressions:

if p>0 -> [+-]\d{x}.\d{p}
if p=0 -> [+-]\d{x}

This expression will make sure that a result does not end
with only a decimal point, but it will accept -123 even if
p>0:

[+-]\d{x}(?(\.\d).\ d{p})$
Additionally, if you just wanted to make sure that no more
than p digits followed the decimal point, you could use
this:

[+-]\d{x}(?(\.\d).\ d{0,p})$
----------------------------------------
For #2, The last expression you listed:

(\s\d|\d{2,2}). \d{2,2}.\d{4,4}

matches corredtly. If you check the Match object's value,
it will not include the leading space if a leading zero
exists. If you want to make sure that the input string
contains only what you are trying to match, place ^
(beginning of string) at the beginning and $(end of
string) at the end:

^(\s\d|\d{2}).\ d{2}.\d{4}$
Hope this helps,

Brian Davis
www.knowdotnet.com

-----Original Message-----
Folks,

I've started using regular expressions for parsing some data stringthat come along - works quite nicely, however, as a newbie to RE, I'mstill struggling with some special cases:

1) I get strings representing decimal values, and they have thefollowing format:
* first, a plus or minus sign
* then, a number of numeric chars
* if the precision is > 0, then a dot and "precision" number ofnumerical characters will follow

This last thing is what's killing me - how can I put the "conditial"thing ("if precision > 0, then a dot and digits) into a single regularexpression?

What I have now is this:

[+-]\d{x,x}.\d{p,p}

which works fine, as long as I have a DECIMAL number with x digitsbefore and p digits after the comma (e.g. +120.45 if x=3 and p=2).
However, if p=0, this string won't be matched +120 since it doesn'thave the dot at the end.........

Any ideas??

2) For a date, which comes along as DD.MM.YYYY, I'd like to be able tomatch both the cases where DD is either "01" or " 1" (leading 0 orspace). I tried various things, but nothing seems to work - it seemsif I allow it to have a leading whitespace, it'll also match "01.12.2003" (leading whitespace and then two digits for the day) -this is *not* what I want!

I've tried \d{2,2}.\d{2,2} .\d{4,4}, which works fine if the day isspecified with a leading zero - but will fail if I pass it "1.12.2003". I also tried (\s\d|\d{2,2}). \d{2,2}.\d{4,4} , but as Imentioned - in this case, all these dates are being matched: "01.01.2003 " - okay
" 1.01.2003" - okay
" 01.01.2003" - NOT okay (leading zero + 2 digits)

Any takers?

Thanks!
Marc

============== =============== =============== ============== ======Marc Scheuner May The Source Be With You!Bern, Switzerland m.scheuner(at) inova.ch.

Nov 15 '05 #3

Try this one:

^DECIMAL(\(\d+( , \d+)?\))?$
Brian Davis
www.knowdotnet.com
-----Original Message-----
Got another riddle:

how can I make a regular expression that will match all of thefollowing:

DECIMAL
DECIMAL(5)
DECIMAL(15, 5) (fifteen-comma-space-five)

Anyone? I've tried numerous expressions - either I get too much(everything) , or I get the two last ones (with the parenthesis) - butI can't seem to make it work for all three cases in just oneexpression.... ...

Thanks!
Marc

============== =============== =============== ============== ======Marc Scheuner May The Source Be With You!Bern, Switzerland m.scheuner(at) inova.ch.

Nov 15 '05 #4
Marc,

The usual way to do what you want is to use the '?' quantifier, which means
"match 0 or 1 time". So, to add the plus or minus, you get:

(+|-)?\d+

plus or minus one or zero times followed by one or more digits.

Add in the decimal part:

(+|-)?\d+(\.\d+)?

adds in "." followed by one or more digits, match the whole thing zero or
one time.
Lastly, you might want to download my regular expression workbench at the
csharp site below. It will make playing with regex much easier.

--
Eric Gunnerson

Visit the C# product team at http://www.csharp.net
Eric's blog is at http://blogs.gotdotnet.com/ericgu/

This posting is provided "AS IS" with no warranties, and confers no rights.
"Marc Scheuner [MVP ADSI]" <m.********@ino va.SPAMBEGONE.c h> wrote in message
news:hd******** *************** *********@4ax.c om...
Folks,

I've started using regular expressions for parsing some data string
that come along - works quite nicely, however, as a newbie to RE, I'm
still struggling with some special cases:

1) I get strings representing decimal values, and they have the
following format:
* first, a plus or minus sign
* then, a number of numeric chars
* if the precision is > 0, then a dot and "precision" number of
numerical characters will follow

This last thing is what's killing me - how can I put the "conditial"
thing ("if precision > 0, then a dot and digits) into a single regular
expression?

What I have now is this:

[+-]\d{x,x}.\d{p,p}

which works fine, as long as I have a DECIMAL number with x digits
before and p digits after the comma (e.g. +120.45 if x=3 and p=2).

However, if p=0, this string won't be matched +120 since it doesn't
have the dot at the end.........

Any ideas??

2) For a date, which comes along as DD.MM.YYYY, I'd like to be able to
match both the cases where DD is either "01" or " 1" (leading 0 or
space). I tried various things, but nothing seems to work - it seems
if I allow it to have a leading whitespace, it'll also match "
01.12.2003" (leading whitespace and then two digits for the day) -
this is *not* what I want!

I've tried \d{2,2}.\d{2,2} .\d{4,4}, which works fine if the day is
specified with a leading zero - but will fail if I pass it "
1.12.2003". I also tried (\s\d|\d{2,2}). \d{2,2}.\d{4,4} , but as I
mentioned - in this case, all these dates are being matched:
"01.01.2003 " - okay
" 1.01.2003" - okay
" 01.01.2003" - NOT okay (leading zero + 2 digits)

Any takers?

Thanks!
Marc

=============== =============== =============== =============== ====
Marc Scheuner May The Source Be With You!
Bern, Switzerland m.scheuner(at)i nova.ch

Nov 15 '05 #5
>The usual way to do what you want is to use the '?' quantifier, which means
"match 0 or 1 time". So, to add the plus or minus, you get:
Add in the decimal part:
(+|-)?\d+(\.\d+)?
Yeah, I stumbled across that after a while of *not* looking at my
regex :-) Thanks.
Lastly, you might want to download my regular expression workbench at the
csharp site below. It will make playing with regex much easier.


Excellent, thanks so much!

Marc

=============== =============== =============== =============== ====
Marc Scheuner May The Source Be With You!
Bern, Switzerland m.scheuner(at)i nova.ch
Nov 15 '05 #6
>Try this one:
^DECIMAL(\(\d+ (, \d+)?\))?$


Thanks - it would work, trouble is, I also need to recognize other
types such as DATE, INTEGER, CHAR(x), VARCHAR(y) and so forth, and
they're not on a line of their own (so I can't use the ^ and $
delimiters).

I think I got it figured out by now - thanks for your input! Highly
appreciated.

Marc

=============== =============== =============== =============== ====
Marc Scheuner May The Source Be With You!
Bern, Switzerland m.scheuner(at)i nova.ch
Nov 15 '05 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

8
2433
by: Michael McGarry | last post by:
Hi, I am horrible with Regular Expressions, can anyone recommend a book on it? Also I am trying to parse the following string to extract the number after load average. ".... load average: 0.04, 0.02, 0.01" how can I extract this number with RE or otherwise?
1
4187
by: Kenneth McDonald | last post by:
I'm working on the 0.8 release of my 'rex' module, and would appreciate feedback, suggestions, and criticism as I work towards finalizing the API and feature sets. rex is a module intended to make regular expressions easier to create and use (and in my experience as a regular expression user, it makes them MUCH easier to create and use.) I'm still working on formal documentation, and in any case, such documentation isn't necessarily the...
2
5105
by: Sehboo | last post by:
Hi, I have several regular expressions that I need to run against documents. Is it possible to combine several expressions in one expression in Regex object. So that it is faster, or will I have to use all the expressions seperately? Here are my regular expressions that check for valid email address and link Dim Expression As String =
4
5187
by: Együd Csaba | last post by:
Hi All, I'd like to "compress" the following two filter expressions into one - assuming that it makes sense regarding query execution performance. .... where (adate LIKE "2004.01.10 __:30" or adate LIKE "2004.01.10 __:15") .... into something like this: .... where adate LIKE "2004.01.10 __:(30/15)" ...
7
3833
by: Billa | last post by:
Hi, I am replaceing a big string using different regular expressions (see some example at the end of the message). The problem is whenever I apply a "replace" it makes a new copy of string and I want to avoid that. My question here is if there is a way to pass either a memory stream or array of "find", "replace" expressions or any other way to avoid multiple copies of a string. Any help will be highly appreciated
3
3028
by: a | last post by:
I'm a newbie needing to use some Regular Expressions in PHP. Can I safely use the results of my tests using 'The Regex Coach' (http://www.weitz.de/regex-coach/index.html) Are the Regular Expressions used in Perl identical to the Regular Expressions in PHP?
25
5177
by: Mike | last post by:
I have a regular expression (^(.+)(?=\s*).*\1 ) that results in matches. I would like to get what the actual regular expression is. In other words, when I apply ^(.+)(?=\s*).*\1 to " HEART (CONDUCTION DEFECT) 37.33/2 HEART (CONDUCTION DEFECT) WITH CATHETER 37.34/2 " the expression is "HEART (CONDUCTION DEFECT)". How do I gain access to the expression (not the matches) at runtime? Thanks, Mike
1
4388
by: Allan Ebdrup | last post by:
I have a dynamic list of regular expressions, the expressions don't change very often but they can change. And I have a single string that I want to match the regular expressions against and find the first regular expression that matches the string. I've gor the regular expressions ordered so that the highest priority is first (if two or more regular expressions match the string I want the first one returned) The code that does this has...
13
7497
by: Wiseman | last post by:
I'm kind of disappointed with the re regular expressions module. In particular, the lack of support for recursion ( (?R) or (?n) ) is a major drawback to me. There are so many great things that can be accomplished with regular expressions this way, such as validating a mathematical expression or parsing a language with nested parens, quoting or expressions. Another feature I'm missing is once-only subpatterns and possessive quantifiers...
12
2494
by: FAQEditor | last post by:
Anybody have any URL's to tutorials and/or references for Regular Expressions? The four I have so far are: http://docs.sun.com/source/816-6408-10/regexp.htm http://en.wikipedia.org/wiki/Regular_expression http://www.regular-expressions.info/javascript.html http://www.webreference.com/js/column5/
0
10593
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
10329
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
10085
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
9163
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7626
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6858
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5663
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
4304
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
3
3000
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.