"wilk" <wi******@poczta.onet.pl> wrote in
news:e0**************@tk2msftngp13.phx.gbl:
Is anybody know here any class in .NET that would help me to parse
html in C# ?
Or maybe you can even tell me how to do it?
Do you mean encode it, so that it is not executeable... IE. filter users
input before adding it to a message board post? If so, follow this link...
http://msdn.microsoft.com/library/de...l=/library/en-
us/dv_vbcode/html/vbtskCodeEncodingHTMLTextVisualBasic.asp
To parse / tokenize html tags from an HTMl document you may have to make
your own components? to get started I found:
System.Web.UI.BaseParser()
"Provides a base set of functionality for classes involved in parsing
ASP.NET page requests and server controls."
http://msdn.microsoft.com/library/de...l=/library/en-
us/cpref/html/frlrfsystemwebuibaseparserclasstopic.asp
try looking on SourceForge.net, and searching for "Html parser". All the
ones I saw are in Java, but looking at the code for these may help you
write your own in C#.
Also you may want to keep checking the feature lists for ASP.NET 2.0
(whidbey). That may be one of the new features? Post your request on the
forums of this site, and maybe MS will publically expose it's own parser in
2.0?
http://www.asp.net/whidbey/
Michael Lang, MCSD