Hello,
I just needed some help on how the DOM is decoded by the IE
parser.
As per the MSDN page,
http://msdn.microsoft.com/workshop/a...ce/charsets/ch...
,server encodings are considered first,then the <metatag specified
encodings and then finally the user's preferred settings(which is
usually Western-European aka windows-1252).
I used Ethereal to packet sniff the traffic to www.baidu.com .There is
no encoding specified in the content-type header on this site .Also,
the preferred encoding is specified in the <metatags as gb2312
..However ie.Document.charset returns the value windows-1252.
My question is this...which encoding is being used by the parser to
actually decode the page to Unicode?Is it the default user encoding or
the <meta>
tag encoding?And if the default user encoding is being used to decode
the content ,isn't that contradictory to what is specified on the page
above?
Thanks
Provost Zak