By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
459,252 Members | 1,684 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 459,252 IT Pros & Developers. It's quick & easy.

htmldata 1.0.4 - Manipulate HTML documents via data structure

P: n/a

htmldata 1.0.4 is available.

http://oregonstate.edu/~barnesc/htmldata/

The htmldata module allows one to translate HTML
documents back and forth to list data structures.
This allows for programmatic reading and writing
of HTML documents, with much flexibility.

Functions are also available for extracting
and/or modifying all URLs present in the HTML
or stylesheets of a document.

Version 1.0.4 is a bugfix release offering:
* Python 2.0-2.4 support (thanks to Paul Clinch
for the Python 2.2 patch)
* Properly working XHTML parsing.
* Miscellaneous other fixes (see the changelog
for details).

I have found this library useful for writing
robots, for "wrapping" all of the URLs on
websites inside my own proxy CGI script, for
filtering HTML, and for doing flexible wget-like
mirroring.

It keeps things as simple as possible, so it
should be easy to learn.

- Connelly Barnes


__________________________________
Do you Yahoo!?
Send holiday email and support a worthy cause. Do good.
http://celebrity.mail.yahoo.com
Jul 18 '05 #1
Share this question for a faster answer!
Share on Google+

This discussion thread is closed

Replies have been disabled for this discussion.