470,841 Members | 905 Online
Bytes | Developer Community
New Post

Home Posts Topics Members FAQ

Post your question to a community of 470,841 developers. It's quick & easy.

UTF-8, LWP and http-equiv meta tags


I'm trying to retrieve an HTML document in UTF-8 format using LWP, but
have hit a snag: the document redefines the Content-type: header from
"text/html" to "text/html; charset=UTF-8" using a <meta
http-equiv="Content-type"... /> tag. LWP doesn't pick this up, and I
seem to be ending up with a string with UTF-8 in it, but perl thinks
it's already been decoded.

Is there anyway to tell perl to turn a string with bytes in it that look
like UTF-8 into a string with real wide characters? Or a way to get LWP
to make the problem go away?

thanks in advance

Jul 19 '05 #1
0 1554

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

4 posts views Thread by Alban Hertroys | last post: by
38 posts views Thread by Haines Brown | last post: by
6 posts views Thread by archana | last post: by
7 posts views Thread by Jimmy Shaw | last post: by
1 post views Thread by sheldon.regular | last post: by
23 posts views Thread by Allan Ebdrup | last post: by
35 posts views Thread by Bjoern Hoehrmann | last post: by
4 posts views Thread by =?ISO-8859-2?Q?Boris_Du=B9ek?= | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.