Help | Site Map
Connecting Tech Pros Worldwide
 
 
LinkBack Thread Tools
  #1  
Old July 24th, 2005, 12:51 AM
Sebastien B.
Guest
 
Posts: n/a
Default Best tool to convert html into XHTML for XML parsing?

I'm looking for the best tool to convert 'every day' html into proper XHTML
so that I can parse it as an XML document.

So far I've been using Tidylib to do this, but it doesn't handle things as
gracefully as browsers do. For example, take the page at
http://mail.yahoo.com - all browsers display it properly, but tidying it up
with Tidy (using the tool at http://cgi.w3.org/cgi-bin/tidy) will give a
result that renders quite differently than the original.

So are there any tools that would allow me to properly convert html into
proper xhtml, but without it producing output that would render differently
when viewed in a browser (ie. parse it as a browser would, and create proper
xhtml from that)?

I'm programming in C, if you need to know.

Thx,
Seb



  #2  
Old July 24th, 2005, 12:51 AM
hawat.thufir@gmail.com
Guest
 
Posts: n/a
Default Re: Best tool to convert html into XHTML for XML parsing?

in Java, JTidy. it's at sourceforge.

Thufir

 

Bookmarks

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are Off
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

What is Bytes?

We are a network of experts and professionals in IT and software development that help one another with answers to tough questions and share insights. Get the best answers to your questions from over network members.
Post your question now . . .
It's fast and it's free

Popular Articles