David Thielen wrote:
I learn something new everyday - I was not aware of this. How long has
this been part of the standard?
Since the very beginning. The WD-xml-961114 draft says (4.2.3):
"Entities encoded in UCS-2 must begin with the Byte Order Mark
described by ISO 10646 Annex E and Unicode Appendix B (the ZERO
WIDTH NO-BREAK SPACE character, U+FEFF). This is an encoding
signature, not part of either the markup or character data of
the XML document. XML processors must be able to use this
character to differentiate between UTF-8 and UCS-2 encoded
documents." [p.20]
///Peter
--
XML FAQ:
http://xml.silmaril.ie/