mary wrote:
[color=blue]
> When I read an HTML file starting with
>
> <META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=UTF-8">
>
> and then I write it into another file, say OUTPUT.txt, I get an
> inverted question mark, "¿",
> at the beginning of the OUTPUT.txt file. Why is that?[/color]
Are you saving the file with Notepad.exe?
That program prefixes files that it perceives as Unicode (even UTF-8) with a
Byte Order Mark. If you use an editor to open your file in hex (or "binary")
mode, you might see the BOM, FEFF or FFEF, at the beginning.
Your output system does not interpret the codes as UTF-8, so it probably
uses ISO Latin-1. That has no glyph for FF or EF, so you get a "missing
glyph" symbol as ¿.
This could all be wrong, but the details are off-topic, so nobody is allowed
to contradict me.
--
Phlip
http://industrialxp.org/community/bi...UserInterfaces