467,080 Members | 909 Online
Bytes | Developer Community
Ask Question

Home New Posts Topics Members FAQ

Post your question to a community of 467,080 developers. It's quick & easy.

[2.5.1] "UnicodeDecodeError: 'ascii' codec can't decode byte"?

Hello

I'm getting this error while downloading and parsing web pages:

=====
title = m.group(1)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe9 in position
48: ordinal not in range(128)
=====

From what I understand, it's because some strings are Unicode, and
hence contain characters that are illegal in ASCII.

Does someone know how to solve this error?

Thank you.
Oct 29 '08 #1
  • viewed: 14247
Share:
3 Replies
Gilles Ganault wrote:
I'm getting this error while downloading and parsing web pages:

=====
title = m.group(1)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe9 in position
48: ordinal not in range(128)
=====

From what I understand, it's because some strings are Unicode, and
hence contain characters that are illegal in ASCII.
You just need to use a codec according to the encoding of the webpage. Take
a look at
http://wiki.python.org/moin/Python3UnicodeDecodeError
It is about Python 3, but the principles apply nonetheless. In any case,
throwing the error at a websearch will turn up lots of solutions.

Uli

--
Sator Laser GmbH
Geschäftsführer: Thorsten Föcking, Amtsgericht Hamburg HR B62 932

Oct 29 '08 #2
Ulrich Eckhardt wrote:
Gilles Ganault wrote:
>I'm getting this error while downloading and parsing web pages:

=====
title = m.group(1)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe9 in position
48: ordinal not in range(128)
=====

From what I understand, it's because some strings are Unicode, and
hence contain characters that are illegal in ASCII.

You just need to use a codec according to the encoding of the webpage. Take
a look at
http://wiki.python.org/moin/Python3UnicodeDecodeError
It is about Python 3, but the principles apply nonetheless. In any case,
throwing the error at a websearch will turn up lots of solutions.
I won't believe that statement is producing the error until I see a
traceback. As far as I'm aware the re module can handle Unicode. Getting
a UnicodeDecodeError in an assignment would be unusual to say the least.
Though it's not, I suppose, impossible that calling the .group() method
of a match object might, it seems unlikely.

regards
Steve
--
Steve Holden +1 571 484 6266 +1 800 494 3119
Holden Web LLC http://www.holdenweb.com/

Oct 29 '08 #3
Ulrich Eckhardt wrote:
Gilles Ganault wrote:
>I'm getting this error while downloading and parsing web pages:

=====
title = m.group(1)
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe9 in position
48: ordinal not in range(128)
=====

From what I understand, it's because some strings are Unicode, and
hence contain characters that are illegal in ASCII.

You just need to use a codec according to the encoding of the webpage. Take
a look at
http://wiki.python.org/moin/Python3UnicodeDecodeError
It is about Python 3, but the principles apply nonetheless. In any case,
throwing the error at a websearch will turn up lots of solutions.
I won't believe that statement is producing the error until I see a
traceback. As far as I'm aware the re module can handle Unicode. Getting
a UnicodeDecodeError in an assignment would be unusual to say the least.
Though it's not, I suppose, impossible that calling the .group() method
of a match object might, it seems unlikely.

regards
Steve
--
Steve Holden +1 571 484 6266 +1 800 494 3119
Holden Web LLC http://www.holdenweb.com/

Oct 29 '08 #4

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

13 posts views Thread by - Steve - | last post: by
13 posts views Thread by baumann.Pan@gmail.com | last post: by
6 posts views Thread by Kai Bhli | last post: by
3 posts views Thread by mr | last post: by
8 posts views Thread by jeffpierce12@hotmail.com | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.