problem with pyparsing - suppress

Hi,
I want learn pyparsing and got stuck with this:

Expand|Select|Wrap|Line Numbers

 
from pyparsing import *
 
body = ZeroOrMore(word)

begin = Keyword('begin story').suppress()

end = Keyword('end story').suppress()
 
sentence = begin + body + end 

print sentence.parseString("begin story once upon a time end story")

I'm getting error :
ParseException: Expected "end story" (at char 36), (line:1, col:37)

I don't get it why? end story is there isn't it?
thanks

Mar 2 '10 #1

Subscribe Post Reply

✓ answered by ptmcg

This is a very common issue with learning pyparsing. Pyparsing does not do any right-to-left backtracking like regex'es do. It is purely left-to-right. So make sure your repetition does not accidentally include the terminating sentinel value.

See embedded comments below:

Expand|Select|Wrap|Line Numbers

 
from pyparsing import * 
 
# define these up front

begin = Keyword('begin story').suppress() 

end = Keyword('end story').suppress() 

word=Word(alphas)  
 
# what you *really* mean by 'body' - you want

# ZeroOrMore words, as long as they aren't 'end story' -

# so just say that

body = ZeroOrMore(~end + word) 
 
# the rest is just like you had it

sentence = begin + body + end  

print sentence.parseString("begin story once upon a time end story")

prints:

Expand|Select|Wrap|Line Numbers

['once', 'upon', 'a', 'time']

-- Paul

2497

Glenton

391

Expert 256MB

Hi

Your line 3 should be "body = ZeroOrMore(Word(alphas))", right? Or did you already define word=Word(alphas).

Anyway, this is a classic "gotcha" in regular expressions. It always takes the longest string it can that matches the characteristics.

For example if you run it with sentence defined as begin+body you get
['once','upon','a','time','end','story']. In other words the "end story" is matched by the body. Then it comes to the end of the string and goes, "but where's the "end story" that was prophesied." *

It needs something to help it differentiate. D*mmit, man, it's a string-parser, not a mind-reader!!** But if you gave it something to work with:

Expand|Select|Wrap|Line Numbers

 from pyparsing import *
 
word=Word(alphas) 

body = ZeroOrMore(word)

begin = Keyword('begin story').suppress()

end = Keyword('$end story').suppress()
 
sentence = begin + body + end 

print sentence.parseString("begin story once upon a time $end story")

Good luck!

*might be over anthropomorphising the string parser
**might not be an exact Star Trek quote

Mar 3 '10 #2

kc2ine

LOL, it's not mind reader? shoot :)

but what if I want to have 'end story' ending tag without the dollar sign... :(

thanks Glenton anyway.

Mar 4 '10 #3

Glenton

391

Expert 256MB

Well, if you know it ends with ' end story', you could just use string slicing.

Expand|Select|Wrap|Line Numbers

 from pyparsing import *
 
word=Word(alphas) 

body = ZeroOrMore(word)

begin = Keyword('begin story').suppress()
 
sentence = begin + body
 
myString="begin story once upon a time end story"

print sentence.parseString(myString[:-10])

Mar 5 '10 #4

ptmcg

Expand|Select|Wrap|Line Numbers

 
from pyparsing import * 
 
# define these up front

begin = Keyword('begin story').suppress() 

end = Keyword('end story').suppress() 

word=Word(alphas)  
 
# what you *really* mean by 'body' - you want

# ZeroOrMore words, as long as they aren't 'end story' -

# so just say that

body = ZeroOrMore(~end + word) 
 
# the rest is just like you had it

sentence = begin + body + end  

print sentence.parseString("begin story once upon a time end story")

prints:

Expand|Select|Wrap|Line Numbers

['once', 'upon', 'a', 'time']

-- Paul

Mar 9 '10 #5

Similar topics

pyparsing

by: Bo¹tjan Jerko | last post by:

Hello ! I am trying to understand pyparsing. Here is a little test program to check Optional subclass: from pyparsing import Word,nums,Literal,Optional lbrack=Literal("").suppress()...

Python

Pyparsing: Non-greedy matching?

by: Peter Fein | last post by:

I'm trying to use pyparsing write a screenscraper. I've got some arbitrary HTML text I define as opener & closer. In between is the HTML data I want to extract. However, the data may contain the...

Python

pyparsing with nested table

by: astarocean | last post by:

using pyparsing to deal with nested tables , wanna keep table's structure and propertys . but program was chunked with the </td> tag of inner table. have any ideas? here's the program ...

Python

Parsing files -- pyparsing to the rescue?

by: rh0dium | last post by:

Hi all, I have a file which I need to parse and I need to be able to break it down by sections. I know it's possible but I can't seem to figure this out. The sections are broken by <> with...

Python

pyparsing: crash on empty element

by: gry | last post by:

I have: def unpack_sql_array(s): # unpack a postgres "array", e.g. "{'w1','w2','w3'}" into a list(str) import pyparsing as pp withquotes = pp.dblQuotedString.setParseAction(pp.removeQuotes)...

Python

problem with meteo datas

by: napolpie | last post by:

----Messaggio originale---- Da: napolpie@tin.it Data: 3-mag-2007 10.02 A: <python-list@python.org> Ogg: problem with meteo datas Hello, I'm Peter and I'm new in python codying and I'm using...

Python

help with pyparsing

by: Prabhu Gurumurthy | last post by:

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 All, I have the following lines that I would like to parse in python using pyparsing, but have some problems forming the grammar. Line in...

Python

Problem with processing XML

by: John Carlyle-Clarke | last post by:

Hi. I'm new to Python and trying to use it to solve a specific problem. I have an XML file in which I need to locate a specific text node and replace the contents with some other text. The...

Python

More fun with PyParsing - almost did it on my own..

by: rh0dium | last post by:

Hi all, I almost did my first pyparsing without help but here we go again. Let's start with my code. The sample data is listed below. # This will gather the following ( "NamedPin"...

Python

pyparsing problem

by: name | last post by:

Hi, I try to parse a file with pyparsing and get this output: - alias: host alias xyz - host_name: - ip_address: - use:

Python

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

General

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...

C# / C Sharp