Philip,
If all you are looking for is the title, then I would recommend using
Regular Expressions. It will just be more performant. If you need more
information from the object model, then I would use COM interop and create
an instance of MSHTML.HTMLDocument. This will allow you to load a document
into the object, and access the DOM.
Hope this helps.
--
- Nicholas Paldino [.NET/C# MVP]
-
mv*@spam.guard.caspershouse.com
"Philip Townsend" <pt*******@v1tech.com> wrote in message
news:ez**************@TK2MSFTNGP10.phx.gbl...
Does anybody know of a way to parse HTML files when it is unknown what
the file will look like? I need to extract the <title> element from a
group of pages, where some pages may not be titled. There is no .net
object available that I can see. Does anybody know of any controls
available for purchase? Thaks...
*** Sent via Developersdex http://www.developersdex.com ***
Don't just participate in USENET...get rewarded for it!