This is my understanding so far, and please correct any errors:
1. US-ASCII is a subset of ISO-8859-1
2. US-ASCII is a subset of UTF-8
3. ISO-8859-1 is not a subset of UTF-8
But ... are the numeric entities (in hex or decimal) for ISO-8859-1
the same in UTF-8?
Can an HTML document that uses only Latin-1 numeric entities have
its content-type changed to UTF-8 and still be valid?
Do Latin-1 numeric entities have to be written either as x## or ###,
or can they have trailing zeros, like x00## or 0###, which is what
you would have with UTF-8?
TIA
Ian
--
http://www.aspipes.org/
http://www.bookstacks.org/
http://www.learnsomethingnew.us/