By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
440,592 Members | 1,958 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 440,592 IT Pros & Developers. It's quick & easy.

Best tool to convert html into XHTML for XML parsing?

P: n/a
I'm looking for the best tool to convert 'every day' html into proper XHTML
so that I can parse it as an XML document.

So far I've been using Tidylib to do this, but it doesn't handle things as
gracefully as browsers do. For example, take the page at
http://mail.yahoo.com - all browsers display it properly, but tidying it up
with Tidy (using the tool at http://cgi.w3.org/cgi-bin/tidy) will give a
result that renders quite differently than the original.

So are there any tools that would allow me to properly convert html into
proper xhtml, but without it producing output that would render differently
when viewed in a browser (ie. parse it as a browser would, and create proper
xhtml from that)?

I'm programming in C, if you need to know.

Thx,
Seb

Jul 20 '05 #1
Share this Question
Share on Google+
1 Reply


P: n/a
in Java, JTidy. it's at sourceforge.

Thufir

Jul 20 '05 #2

This discussion thread is closed

Replies have been disabled for this discussion.