By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
440,235 Members | 1,022 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 440,235 IT Pros & Developers. It's quick & easy.

Unicode lists and join (python 2.2.3)

P: n/a
Hello,
I have a list of strings, some of the strings might be unicode. I am
trying to a .join operation on the list and the .join raises a unicode
exception. I am looking for ways to get around this.
I would like to get a unicode string out of the list with all string
elements seperated by '\n'

#!/usr/bin/env python
import sys
import string

try:
x = [u"\xeeabc2:xyz", u"abc3:123"]
u = "\xe7abc"
x.append("%s:%s" % ("xfasfs", u))
x.append(u"Hello:afddfdsfa")

y = u'\n'.join(x)
print("Unicode Call worked!")
except Exception, err:
print("Exception raised %s" % err)

on a related note
Why does this work with no exceptions

x=[]
u = "\xe7abc"
x.append("%s:%s" % ("xfasfs", u))

and this doesnt
x=[]
u = "\xe7abc"
x.append("%s:%s" % (u"xfasfs", u))
Thanks,
Nitin.
Jun 27 '08 #1
Share this Question
Share on Google+
1 Reply


P: n/a
x = [u"\xeeabc2:xyz", u"abc3:123"]
u = "\xe7abc"
u is not a Unicode string.
x.append("%s:%s" % ("xfasfs", u))
so what you append is not a Unicode string, either.
x.append(u"Hello:afddfdsfa")

y = u'\n'.join(x)
As a consequence, .join tries to convert the byte string to
a Unicode string, and fails, because it contains non-ASCII
bytes.
Why does this work with no exceptions

x=[]
u = "\xe7abc"
x.append("%s:%s" % ("xfasfs", u))
% here is applied to a byte string, with all arguments also byte
strings. The result is a byte string.
>
and this doesnt
x=[]
u = "\xe7abc"
x.append("%s:%s" % (u"xfasfs", u))
% is applied to a byte string, with one argument being a Unicode
string. The result is a Unicode string, where all byte strings
get converted to Unicode. Converting u fails, as it has non-ASCII
bytes in it.

Regards,
Martin
Jun 27 '08 #2

This discussion thread is closed

Replies have been disabled for this discussion.