hello all,
I am new in Python. And I have got a problem about unicode.
I have got a unicode string, when I was going to send it out throuth a
socket by send(), I got an exception. How can I send the unicode string
to the remote end of the socket as it is without any conversion of
encode, so the remote end of the socket will receive unicode string?
Thanks 9 5028
You could not. Unicode is an abstract data type. It must be encoded into
octets in order to send via socket. And the other end must decode the
octets to retrieve the unicode string. Needless to say the encoding scheme
must be consistent and understood by both ends.
On 18 Feb 2005 11:03:46 -0800, <zy*****@163.net> wrote: hello all, I am new in Python. And I have got a problem about unicode. I have got a unicode string, when I was going to send it out throuth a socket by send(), I got an exception. How can I send the unicode string to the remote end of the socket as it is without any conversion of encode, so the remote end of the socket will receive unicode string?
Thanks
aurora wrote: You could not. Unicode is an abstract data type. It must be encoded into octets in order to send via socket. And the other end must decode the octets to retrieve the unicode string. Needless to say the encoding scheme must be consistent and understood by both ends.
So use pickle.
--Irmen
Irmen de Jong wrote: aurora wrote:
You could not. Unicode is an abstract data type. It must be encoded into octets in order to send via socket. And the other end must decode the octets to retrieve the unicode string. Needless to say the encoding scheme must be consistent and understood by both ends.
So use pickle.
--Irmen
Well, on second thought: don't use pickle.
If all you want to transfer is unicode strings (or normal strings)
it's safer to just encode them to, say, UTF-8, transfer
that octet stream across, and on the other side, decode the
UTF-8 octets back into a unicode string.
--Irmen
You probably want to use UTF-16 or UTF-8 on both sides of the socket.
See http://www.python.org/moin/Unicode for more information.
So, we have a Unicode string... mystring=u'eggs and ham' mystring
u'eggs and ham'
Now, we want to send it over: to_send=mystring.encode('utf-8') to_send
'eggs and ham'
It's encoded in UTF-8 now.
On the other side, (result=to_send,) we decode:
result=received.decode('utf-8') result
u'eggs and ham'
You have transfered a unicode string. {:)}=
It's really funny, I cannot send a unicode stream throuth socket with
python while all the other languages as perl,c and java can do it.
then, how about converting the unicode string to a binary stream? It is
possible to send a binary through socket with python? zy*****@163.net wrote: It's really funny, I cannot send a unicode stream throuth socket with python while all the other languages as perl,c and java can do it.
You may really start laughing loudly <wink> after you find out that you
can send arbitrary python objects over sockets. If you want language
specific way of sending objects, see Irmen's first answer: use pickle.
then, how about converting the unicode string to a binary stream?
Sure, there are already three answers in this thread that suggest you
to do that. Use encode method of unicode strings.
It is possible to send a binary through socket with python?
Sure. If it wouldn't be possible to send bytes through sockets with Python
what else do you think could be sent? Perhaps you're confused that
bytes are stored in byte strings in Python, which are often called strings in
documentation and conversations? It will be fixed in Python 3.0, but
these days you have to store bytes in str type.
Serge.
On 18 Feb 2005 19:10:36 -0800, <zy*****@163.net> wrote: It's really funny, I cannot send a unicode stream throuth socket with python while all the other languages as perl,c and java can do it. then, how about converting the unicode string to a binary stream? It is possible to send a binary through socket with python?
I was answering your specific question:
"How can I send the unicode string to the remote end of the socket as it
is without any conversion of encode"
The answer is you could not. Not that you cannot sent unicode but you have
to encode it. The same applies to perl, c or Java. The only difference is
the detail of how strings get encoded.
There are a few posts suggest various means. Or you can check out
codecs.getwriter() which closer resembles Java's way.
anonymous coward <zy*****@163.net> wrote: It's really funny, I cannot send a unicode stream throuth socket with python while all the other languages as perl,c and java can do it.
Are you sure you understand what Unicode is, and how sockets work?
Sockets are used to transfer byte streams. If you want to transfer
a python-level object, you have to decide how to encode it as a
byte stream. For integers, you have to decide whether to use a single
byte, a string of decimal ascii characters, netstring syntax, etc. For
text, you have to decide what character encoding to use. For arbitrary
objects, you have to decide what serialisation protocol to use. etc.
(and yes, the same applies to all other languages. Java sockets and C
sockets are no different from Python sockets...)
</F>
On 18 Feb 2005 19:10:36 -0800, rumours say that zy*****@163.net might have
written: It's really funny, I cannot send a unicode stream throuth socket with python while all the other languages as perl,c and java can do it.
I don't know about perl. What I think you mean by unicode in C most probably is
the wchar_t, which is Unicode encoded as 'ucs-2' or 'utf-16' (little or big
endian, depending on your platform) or maybe a 4-byte int, for which I don't
know a Python equivalent. And I /assume/ in Java that Unicode is equivalent to
'utf-16' encoded strings when input/output.
Perhaps Unicode encoded as 'utf-16' is what you're after. However, Unicode
encoded as 'utf-8' (like others also suggested) might be what you /should/ be
using, given that this encoding has some attractive properties (no null bytes,
no spurious control characters etc).
Don't interpret as weakness the explicitness requested from Python.
--
TZOTZIOY, I speak England very best.
"Be strict when sending and tolerant when receiving." (from RFC1958)
I really should keep that in mind when talking with people, actually... This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics
by: Pettersen, Bjorn S |
last post by:
I've been trying to stay blissfully unaware of Unicode, however now it seems like it's my turn. From the outside it seems like a rather massive subject, so any pointers as to where I should _start_...
|
by: Laurent Therond |
last post by:
Maybe you have a minute to clarify the following matter...
Consider:
---
from cStringIO import StringIO
def bencode_rec(x, b):
t = type(x)
|
by: John Roth |
last post by:
I've got an interesting little problem that I can't find an
answer to after hunting through the doc (2.3.3). I've
got a string that contains something that kind of
resembles an HTML document. On...
|
by: Ivan Voras |
last post by:
I have a string fetched from database, in iso8859-2, with 8bit
characters, and I'm trying to send it over the network, via a socket:
File "E:\Python24\lib\socket.py", line 249, in write
data =...
|
by: wael |
last post by:
hello all,
i want convert w_char to UCS2 encoded (0041) this is a char encoded UCS2
please look at this
http://www.unicode.org/charts/
http://www.unicode.org/
every language has a chart
bye...
|
by: Ben |
last post by:
I'm left with some legacy code using plain old str, and I need to make
sure it works with unicode input/output. I have a simple plan to do
this:
- Run the code with "python -U" so all the string...
|
by: John Nagle |
last post by:
Here's a strange little bug. "socket.getaddrinfo" blows up
if given a bad domain name containing ".." in Unicode. The
same string in ASCII produces the correct "gaierror" exception.
Actually,...
|
by: raghupise |
last post by:
Hi everybody,
I have one already existing project(Socket programmin).
I have to make it comfortable for unicode as well as common strings.
Shall i take whcar_t datatye instead of char datatype....
|
by: Russell E. Owen |
last post by:
I have code like this:
except Exception, e:
self.setState(self.Failed, str(e))
which fails if the exception contains a unicode argument.
I did, of course, try unicode(e) but that fails.
The...
|
by: lllomh |
last post by:
Define the method first
this.state = {
buttonBackgroundColor: 'green',
isBlinking: false, // A new status is added to identify whether the button is blinking or not
}
autoStart=()=>{
|
by: isladogs |
last post by:
The next Access Europe meeting will be on Wednesday 4 Oct 2023 starting at 18:00 UK time (6PM UTC+1) and finishing at about 19:15 (7.15PM)
The start time is equivalent to 19:00 (7PM) in Central...
|
by: Aliciasmith |
last post by:
In an age dominated by smartphones, having a mobile app for your business is no longer an option; it's a necessity. Whether you're a startup or an established enterprise, finding the right mobile app...
|
by: giovanniandrean |
last post by:
The energy model is structured as follows and uses excel sheets to give input data:
1-Utility.py contains all the functions needed to calculate the variables and other minor things (mentions...
|
by: NeoPa |
last post by:
Introduction
For this article I'll be using a very simple database which has Form (clsForm) & Report (clsReport) classes that simply handle making the calling Form invisible until the Form, or all...
|
by: Teri B |
last post by:
Hi, I have created a sub-form Roles. In my course form the user selects the roles assigned to the course.
0ne-to-many. One course many roles.
Then I created a report based on the Course form and...
|
by: isladogs |
last post by:
The next Access Europe meeting will be on Wednesday 1 Nov 2023 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM)
Please note that the UK and Europe revert to winter time on...
|
by: NeoPa |
last post by:
Introduction
For this article I'll be focusing on the Report (clsReport) class. This simply handles making the calling Form invisible until all of the Reports opened by it have been closed, when it...
|
by: GKJR |
last post by:
Does anyone have a recommendation to build a standalone application to replace an Access database? I have my bookkeeping software I developed in Access that I would like to make available to other...
| |