Hi,
i previously worked on a program, written in c#, and a library was used to parse html. If i remember correctly, the library did two things: 1) it removed all html tags from the document and 2) converted the input stream to a string. I would like to do the same with java but the few libraries i found online dont seem to remove html tags, they just seem to clean up the html code so that the tags all match up with each other.
Can anyone suggest a java html parser library which would help me accomplish my task of reading a html document, removing its tags and converting to a string?
thank you