473,387 Members | 1,575 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,387 software developers and data experts.

effbot TidyHTMLTreeBuilder problem

Hi all,

I'm using TidyHTMLTreeBuilder to model syntax structure of HTML
documents. I've been trying to feed in Yahoo and CNN, but the parser
seems to crash:

" File
"C:\Python23\Lib\site-packages\elementtidy\TidyHTMLTreeBuilder.py",
line 89, in parse
return ElementTree.parse(source, TreeBuilder())
File "C:\Python23\lib\site-packages\elementtree\ElementTree.py", line
865, in parse
tree.parse(source, parser)
File "C:\Python23\lib\site-packages\elementtree\ElementTree.py", line
590, in parse
self._root = parser.close()
File
"C:\Python23\Lib\site-packages\elementtidy\TidyHTMLTreeBuilder.py",
line 75, in close
return ElementTree.XML(stdout)
File "C:\Python23\lib\site-packages\elementtree\ElementTree.py", line
879, in XML
return parser.close()
File "C:\Python23\lib\site-packages\elementtree\ElementTree.py", line
1169, in close
self._parser.Parse("", 1) # end of data
ExpatError: no element found: line 1, column 0"

Could someone else please try it on their system and see if they also
have the same problem? I suspect this problem relates to <form> inside
<table>.

Thank you very much for any help.

Cheers,
Michael

Jul 18 '05 #1
0 1035

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
by: dayzman | last post by:
Hi, Is anyone here familiar with ElementTree by effbot? With <html><body>hello</body></html> how is "hello" stored in the element tree? Which node is it under? Similarly, with: foo <a href =...
3
by: Erik Bethke | last post by:
Hello All, So I have been using Effbot's XML stuff a lot! And I have been using py2exe, pygame and wxPython all mixed together... I am getting this strange error log when writing XML files...
0
by: Michael Spencer | last post by:
What is the recommended way to change the icon of the exe ExeMaker* produces? (I tried replacing the exemaker.ico file, and indeed removing it; but that had no effect.) Thanks Michael ...
0
by: David Murmann | last post by:
http://effbot.org/F ;)
1
by: Méta-MCI | last post by:
Good evening! I installed the Console of EFFBOT (http://effbot.org/downloads/#console). It functions well. It's a very fun/friendly tool. Except a detail: when I send (by console.write()) more...
0
by: skip | last post by:
The PSU warns you to keep on the lookout for posts from the anti-effbot. It's rumored that he recently turned up in this quadrant of the galaxy. His posts tend to be somewhat illogical, suggest a...
14
by: Méta-MCI | last post by:
Hi! (***sorry for my approximative english***) A few months ago, I needed a console, under Windows. After several research, I selected the console of EffBot. Thank you very much,...
4
by: fraleysinger | last post by:
Downloaded to Knoppix 5.1: : aggdraw-1.2a3-20060212.tar.gz Followed README. Wouldn't compile. Couldn't find way of contacting Effbot directly. PIL stuff comes with Knoppix, but specs on...
1
by: spdegabrielle | last post by:
Sorry, I'm new to python and was trying to get imageTK; this led me to try find PIL, but pythonware and effbot both seem to be offline. I can't find any mention of an outage on python.org, this...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.