Alex wrote:
Quote:
The parser that we are currently using is not able to:
- parse unary XML elements
- parse the text block between XML elemnts
- handle specific characters, like '
Then it isn't an XML parser. (I'm willing to let folks cheat on the
"only ASCII" issue, though that's a Bad Idea, but an XML parser has to
support XML syntax.)
Good parsers certainly exist. I'm biased since I've contributed to it,
but my first suggestion for a general-purpose parser in either Java or
C++ would be Xerces, available from Apache; IBM's main current product
parser is a somewhat enhanced version of the Xerces code.
But there are certainly lots of other parser packages. Even the W3C has
given up trying to track them all, and just suggests you do a websearch
for "XML parser" combined with the language(s) you're interested in.
--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry