Hi there,
I am in some need of help. I am trying to parse using the apache sax
parser
a file that has vaid UTF-8 characters - I keep end up getting a
sun.io.MalformedInputException error.
This is my code:
infile = "<?xml version=\"1.0\"
encoding=\"UTF-8\"?><display_values><currency_display>\u00A5 Japanese
Yen</currency_display></display_values>";
// the above is perfectly valid UNICODE symbol for Yen
XMLReader xr = new org.apache.xerces.parsers.SAXParser();
xr.setContentHandler(this);
xr.setErrorHandler(this);
ByteArrayInputStream bi = new
ByteArrayInputStream(infile.getBytes());
Reader reader = new InputStreamReader(bi,"UTF-8");
InputSource is = new InputSource(reader);
is.setEncoding("UTF-8");
xr.parse(is); // CRASHES RIGHT HERE...
this is the complete trace...
[8/29/04 22:38:40:756 GMT-05:00] 692c692c SystemErr R
sun.io.MalformedInputException
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
java.lang.Throwable.<init>(Throwable.java)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
sun.io.ByteToCharUTF8.convert(ByteToCharUTF8.java)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
sun.nio.cs.StreamDecoder$ConverterSD.convertInto(S treamDecoder.java)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
sun.nio.cs.StreamDecoder$ConverterSD.implRead(Stre amDecoder.java)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
sun.nio.cs.StreamDecoder.read(StreamDecoder.java)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
java.io.InputStreamReader.read(InputStreamReader.j ava)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.impl.XMLEntityScanner.load(Unkno wn Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.impl.XMLEntityScanner.scanQName( Unknown Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.impl.XMLNSDocumentScannerImpl.sc anStartElement(Unknown
Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.impl.XMLDocumentFragmentScannerI mpl$FragmentContentDispatcher.dispatch(Unknown
Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.impl.XMLDocumentFragmentScannerI mpl.scanDocument(Unknown
Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.parsers.XML11Configuration.parse (Unknown Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.parsers.DTDConfiguration.parse(U nknown Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
org.apache.xerces.parsers.AbstractSAXParser.parse( Unknown Source)
[8/29/04 22:38:40:776 GMT-05:00] 692c692c SystemErr R at
com.polyorb.tipranavir.pdf.ConvertXML.cparse(Conve rtXML.java)
What am I doing wrong here???
Thank you for any guideance...
Regards, Alex.