Connecting Tech Pros Worldwide Forums | Help | Site Map

C++ HTML library

Atul
Guest
 
Posts: n/a
#1: Jun 2 '06
I have tried without much success to locate a non-commercial C++
library
than can parse html documents. Can anyone recommend a good one?
I would prefer an events based (SAX like) library though a DOM one is
fine too.

Thanks,

Atul


Roland Pibinger
Guest
 
Posts: n/a
#2: Jun 2 '06

re: C++ HTML library


On 1 Jun 2006 23:06:18 -0700, "Atul" <atul.bhouraskar@gmail.com>
wrote:[color=blue]
>I have tried without much success to locate a non-commercial C++
>library
>than can parse html documents. Can anyone recommend a good one?
>I would prefer an events based (SAX like) library though a DOM one is
>fine too.[/color]

Maybe the HTML Tidy Library ( http://tidy.sourceforge.net/ ) combined
with an XML parser? Alternatively, some free HTML parsers written in
Java are available.

Best wishes,
Roland Pibinger
Phlip
Guest
 
Posts: n/a
#3: Jun 2 '06

re: C++ HTML library


Atul wrote:
[color=blue]
> I have tried without much success to locate a non-commercial C++
> library
> than can parse html documents. Can anyone recommend a good one?
> I would prefer an events based (SAX like) library though a DOM one is
> fine too.[/color]

Wouldn't "browser forgiveness" make a DOM and especially a SAX parser worse
than useless?

(Please answer on some other forum;)

Pick one:

- bond with either Internet Explorer or Firefox
- pull a page
- walk the DOM to your goal

- get wxWidgets and its HTML control

- use Tidy to convert to XHTML
- and bang that with TinyXML

--
Phlip
http://c2.com/cgi/wiki?ZeekLand <-- NOT a blog!!!


Closed Thread