472,106 Members | 1,359 Online
Bytes | Software Development & Data Engineering Community
Post +

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 472,106 software developers and data experts.

Error when trying to write unicode xml to zipfile

I get below error when trying to write unicode xml to a zipfile.

zip.writestr('content.xml', content.toxml())
File "/usr/lib/python2.4/zipfile.py", line 460, in writestr
zinfo.CRC = binascii.crc32(bytes) # CRC-32 checksum
UnicodeEncodeError: 'ascii' codec can't encode character u'\u25cf' in
position 2848: ordinal not in range(128)

Any ideas?

Martin

Jul 9 '07 #1
5 6460
En Mon, 09 Jul 2007 20:11:24 -0300, Martin <ma************@gmail.com>
escribió:
I get below error when trying to write unicode xml to a zipfile.

zip.writestr('content.xml', content.toxml())
File "/usr/lib/python2.4/zipfile.py", line 460, in writestr
zinfo.CRC = binascii.crc32(bytes) # CRC-32 checksum
UnicodeEncodeError: 'ascii' codec can't encode character u'\u25cf' in
position 2848: ordinal not in range(128)

Any ideas?
Encode before writing. Assuming you want to use utf-8:
zip.writestr('content.xml', content.toxml().encode('utf-8'))

In general, when working with unicode, it's best to decode bytes into
unicode as early as possible (when reading input), process only unicode
inside the program, and encode into bytes at the last step (when writing
output).
Some non-unicode-aware libraries may interfere with this flow,
unfortunately.

--
Gabriel Genellina

Jul 10 '07 #2
En Mon, 09 Jul 2007 20:11:24 -0300, Martin <ma************@gmail.com>
escribió:
I get below error when trying to write unicode xml to a zipfile.

zip.writestr('content.xml', content.toxml())
File "/usr/lib/python2.4/zipfile.py", line 460, in writestr
zinfo.CRC = binascii.crc32(bytes) # CRC-32 checksum
UnicodeEncodeError: 'ascii' codec can't encode character u'\u25cf' in
position 2848: ordinal not in range(128)

Any ideas?
Encode before writing. Assuming you want to use utf-8:
zip.writestr('content.xml', content.toxml().encode('utf-8'))

In general, when working with unicode, it's best to decode bytes into
unicode as early as possible (when reading input), process only unicode
inside the program, and encode into bytes at the last step (when writing
output).
Some non-unicode-aware libraries may interfere with this flow,
unfortunately.

--
Gabriel Genellina

Jul 10 '07 #3
Gabriel Genellina wrote:
En Mon, 09 Jul 2007 20:11:24 -0300, Martin <ma************@gmail.com>
escribió:
>I get below error when trying to write unicode xml to a zipfile.

zip.writestr('content.xml', content.toxml())
File "/usr/lib/python2.4/zipfile.py", line 460, in writestr
zinfo.CRC = binascii.crc32(bytes) # CRC-32 checksum
UnicodeEncodeError: 'ascii' codec can't encode character u'\u25cf' in
position 2848: ordinal not in range(128)

Any ideas?

Encode before writing. Assuming you want to use utf-8:
zip.writestr('content.xml', content.toxml().encode('utf-8'))
Unless, obviously, you were serialising to a non-utf8 encoding. But since the
"toxml()" method seems to return unicode here (which sounds surprising), I
expect it a) to provide no XML declaration at all or b) to be broken anyway.

Stefan
Jul 10 '07 #4
Unless, obviously, you were serialising to a non-utf8 encoding. But since the
"toxml()" method seems to return unicode here (which sounds surprising), I
expect it a) to provide no XML declaration at all or b) to be broken anyway.
Or c) the user forgot to specify the encoding= parameter in toxml().

Regards,
Martin
Jul 10 '07 #5
Martin v. Löwis wrote:
>Unless, obviously, you were serialising to a non-utf8 encoding. But since the
"toxml()" method seems to return unicode here (which sounds surprising), I
expect it a) to provide no XML declaration at all or b) to be broken anyway.

Or c) the user forgot to specify the encoding= parameter in toxml().
Then I would expect it a) to serialise to a UTF-8 compatible encoding that
does not require a declaration (which excludes Python unicode) or b) to be
broken. :)

Stefan
Jul 10 '07 #6

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

1 post views Thread by Jeff Thur | last post: by
2 posts views Thread by ticars | last post: by
1 post views Thread by hbsnam | last post: by
reply views Thread by leo001 | last post: by

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.