How do I converted a null (0) terminated string to a Python string?

Michael

Hi All,

I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.

Thanks,
M. McDonnell

Sep 14 '06 #1

Subscribe Post Reply

25362

Steve Holden

Michael wrote:

Hi All,

I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.

Thanks,
M. McDonnell

Have you received this string in Python or in C? If the former, then
just throw away the last character of the string you've received and
you're done!

s = s[:-1]

regards
Steve
--
Steve Holden +44 150 684 7255 +1 800 494 3119
Holden Web LLC/Ltd http://www.holdenweb.com
Skype: holdenweb http://holdenweb.blogspot.com
Recent Ramblings http://del.icio.us/steve.holden

Sep 14 '06 #2

John Machin

Michael wrote:

Hi All,

I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.

I think you mean NUL, not null.

What have you received it into, if it's not a Python string?

You probably need/want this:

if strg[-1] == "\0":
strg = strg[:-1]
alternatively:
strg = strg.rstrip("\0") # requires Python 2.2.2 or later

It's possible you may be talking about a fixed length string which
contains useful_stuff + "\0" + padding -- in that case you need

strg = strg.split("\0")[0] # grab upto (but not including) the first
NUL (if any)

If you're not sure what you've got, print repr(the_input_string)

HTH,
John

Sep 14 '06 #3

Michael

Thank you very much for your responses. To answer some of the
questions... Yes, I am in Python receiving a C language 0 terminated
string that was sent to my Python program in a UDP packet (which is how
I know the count). Are your responses still correct given this
clarification?

Thanks much,
MDM

John Machin wrote:

Michael wrote:
Hi All,

I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.

I think you mean NUL, not null.

What have you received it into, if it's not a Python string?

You probably need/want this:

if strg[-1] == "\0":
strg = strg[:-1]
alternatively:
strg = strg.rstrip("\0") # requires Python 2.2.2 or later

It's possible you may be talking about a fixed length string which
contains useful_stuff + "\0" + padding -- in that case you need

strg = strg.split("\0")[0] # grab upto (but not including) the first
NUL (if any)

If you're not sure what you've got, print repr(the_input_string)

HTH,
John

Sep 14 '06 #4

John Machin

Michael top-posted [corrected]:

John Machin wrote:
Michael wrote:
Hi All,
>
I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.
>
I think you mean NUL, not null.

What have you received it into, if it's not a Python string?

You probably need/want this:

if strg[-1] == "\0":
strg = strg[:-1]
alternatively:
strg = strg.rstrip("\0") # requires Python 2.2.2 or later

It's possible you may be talking about a fixed length string which
contains useful_stuff + "\0" + padding -- in that case you need

strg = strg.split("\0")[0] # grab upto (but not including) the first
NUL (if any)

If you're not sure what you've got, print repr(the_input_string)

HTH,
John
Thank you very much for your responses. To answer some of the
questions... Yes, I am in Python receiving a C language 0 terminated
string that was sent to my Python program in a UDP packet (which is how
I know the count). Are your responses still correct given this
clarification?

My responses are correct. Your "clarification" indicates to me that you
are going by what you are told, not by inspection of (several instances
of) the packet contents, using repr(). It's up to you whether you want
to be skeptical about the packet contents or not. I certainly wouldn't
be throwing the last byte away without checking that it was in fact a
NUL.

Cheers,
John

Sep 14 '06 #5

Michael

John,

Thanks for your reply. Just wondering... how are Python strings
formatted? Evidently they're not 0 terminated.

Thanks again,
MDM

John Machin wrote:

Michael top-posted [corrected]:
John Machin wrote:
Michael wrote:
Hi All,

I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.

>
I think you mean NUL, not null.
>
What have you received it into, if it's not a Python string?
>
You probably need/want this:
>
if strg[-1] == "\0":
strg = strg[:-1]
alternatively:
strg = strg.rstrip("\0") # requires Python 2.2.2 or later
>
It's possible you may be talking about a fixed length string which
contains useful_stuff + "\0" + padding -- in that case you need
>
strg = strg.split("\0")[0] # grab upto (but not including) the first
NUL (if any)
>
If you're not sure what you've got, print repr(the_input_string)
>
HTH,
John
Thank you very much for your responses. To answer some of the
questions... Yes, I am in Python receiving a C language 0 terminated
string that was sent to my Python program in a UDP packet (which is how
I know the count). Are your responses still correct given this
clarification?

My responses are correct. Your "clarification" indicates to me that you
are going by what you are told, not by inspection of (several instances
of) the packet contents, using repr(). It's up to you whether you want
to be skeptical about the packet contents or not. I certainly wouldn't
be throwing the last byte away without checking that it was in fact a
NUL.

Cheers,
John

Sep 14 '06 #6

John Machin

Michael top-posted [again]:

>
John Machin wrote:
Michael top-posted [corrected]:
John Machin wrote:
Michael wrote:
Hi All,
>
I've received (via UDP) a null terminated string and need to convert it
into a Python string. Can anyone tell me how this is done? If it helps,
I know the number of characters in the string.
>

I think you mean NUL, not null.

What have you received it into, if it's not a Python string?

You probably need/want this:

if strg[-1] == "\0":
strg = strg[:-1]
alternatively:
strg = strg.rstrip("\0") # requires Python 2.2.2 or later

It's possible you may be talking about a fixed length string which
contains useful_stuff + "\0" + padding -- in that case you need

strg = strg.split("\0")[0] # grab upto (but not including) the first
NUL (if any)

If you're not sure what you've got, print repr(the_input_string)

HTH,
John
Thank you very much for your responses. To answer some of the
questions... Yes, I am in Python receiving a C language 0 terminated
string that was sent to my Python program in a UDP packet (which is how
I know the count). Are your responses still correct given this
clarification?
My responses are correct. Your "clarification" indicates to me that you
are going by what you are told, not by inspection of (several instances
of) the packet contents, using repr(). It's up to you whether you want
to be skeptical about the packet contents or not. I certainly wouldn't
be throwing the last byte away without checking that it was in fact a
NUL.

Cheers,
John
John,

Thanks for your reply. Just wondering... how are Python strings
formatted? Evidently they're not 0 terminated.

A Python string is an object. The details of the internal storage may
vary between implementations. CPython has 8-bit str objects and 16-bit
or 32-bit Unicode objects. In IronPython, (str is Unicode) is true, and
they are 16 bits. In any case the object knows its own length without
having to scan for a terminator. Thus, a string can contain NULs.

Having said all that, the CPython str implementation does have an
additional byte at the end; this is set to zero and is not counted in
the length. However you never see that and don't really need to know
unless you are writing an extension module in C -- it's handy to know
that you don't have to append a NUL if you want to call a C library
function.

Cheers,
John

Sep 14 '06 #7

Fredrik Lundh

Michael wrote:

Thanks for your reply. Just wondering... how are Python strings
formatted? Evidently they're not 0 terminated.

have you tried *printing* the thing you got via UDP?

to get a programmer-friendly representation of an arbitrary object, use

print repr(obj)

(where obj is your string, in this case).

</F>

Sep 14 '06 #8

John Machin

Fredrik Lundh wrote:

Michael wrote:

Thanks for your reply. Just wondering... how are Python strings
formatted? Evidently they're not 0 terminated.

have you tried *printing* the thing you got via UDP?

to get a programmer-friendly representation of an arbitrary object, use

print repr(obj)

(where obj is your string, in this case).

Probably not; there was no indication after the two messages where I
mentioned repr :-)

Sep 14 '06 #9

Michael

John,

Since I'm new to Python, I'm having trouble understanding what this
means (see below). Would appreciate any help.

if strg[-1] == "\0":
strg = strg[:-1]

Thanks,
MDM

John Machin wrote:

Fredrik Lundh wrote:
Michael wrote:

Thanks for your reply. Just wondering... how are Python strings
formatted? Evidently they're not 0 terminated.
have you tried *printing* the thing you got via UDP?

to get a programmer-friendly representation of an arbitrary object, use

print repr(obj)

(where obj is your string, in this case).

Probably not; there was no indication after the two messages where I
mentioned repr :-)

Sep 14 '06 #10

John Machin

Michael wrote:

John,

Since I'm new to Python, I'm having trouble understanding what this
means (see below). Would appreciate any help.

if strg[-1] == "\0":

If the last (i.e index -1) byte in the string equals ASCII NUL:

strg = strg[:-1]

then take a slice of the string from the start up to but not including
the last byte and assign that to "strg"

In other words, if the last byte of strg is NUL, throw it away.

The truly paranoid would code that as
if strg and strg[-1] etc etc
so that it wouldn't blow up if strg is empty -- strange things can
happen when you are reading other people's data :-)

Perhaps you should work through the tutorial; all the above concepts
are treated in this section:
http://docs.python.org/tut/node5.htm...00000000000000

HTH,
John

Sep 14 '06 #11

Marc 'BlackJack' Rintsch

In <11**********************@k70g2000cwa.googlegroups .com>, John Machin
wrote:

In other words, if the last byte of strg is NUL, throw it away.

The truly paranoid would code that as
if strg and strg[-1] etc etc
so that it wouldn't blow up if strg is empty -- strange things can
happen when you are reading other people's data :-)

I would spell it:

if strg.endswith('\0'):
strg = strg[:-1]

Ciao,
Marc 'BlackJack' Rintsch

Sep 14 '06 #12

Richard Brodie

"Marc 'BlackJack' Rintsch" <bj****@gmx.netwrote in message
news:pa****************************@gmx.net...

I would spell it:

if strg.endswith('\0'):
strg = strg[:-1]

I would just go with: strg = strg.rstrip('\0')

Sep 14 '06 #13

Michael

I guess, I still don't see how this will work. I'm receiving a C
zero-terminated string in my Python program as a 1K byte block (UDP
datagram). If the string sent was "abc", then what I receive in Python
is <a><c><0><garbage><garbage>...<last_garbage_byt e>. How is Python
going to know where in this 1K byte block the end of the string is? It
seems that what I need to do is tell Python that the string ends at
zero-relative index 3. What am I missing here?

Marc 'BlackJack' Rintsch wrote:

In <11**********************@k70g2000cwa.googlegroups .com>, John Machin
wrote:

In other words, if the last byte of strg is NUL, throw it away.

The truly paranoid would code that as
if strg and strg[-1] etc etc
so that it wouldn't blow up if strg is empty -- strange things can
happen when you are reading other people's data :-)

I would spell it:

if strg.endswith('\0'):
strg = strg[:-1]

Ciao,
Marc 'BlackJack' Rintsch

Sep 14 '06 #14

Robert Kern

Michael wrote:

I guess, I still don't see how this will work. I'm receiving a C
zero-terminated string in my Python program as a 1K byte block (UDP
datagram). If the string sent was "abc", then what I receive in Python
is <a><c><0><garbage><garbage>...<last_garbage_byt e>. How is Python
going to know where in this 1K byte block the end of the string is? It
seems that what I need to do is tell Python that the string ends at
zero-relative index 3. What am I missing here?

Nothing. This is what I would do:
In [34]: s
Out[34]: 'abc\x00garbage'

In [35]: s.split('\x00', 1)[0]
Out[35]: 'abc'

In [36]: s.split?
Type: builtin_function_or_method
Base Class: <type 'builtin_function_or_method'>
String Form: <built-in method split of str object at 0x6ada2c8>
Namespace: Interactive
Docstring:
S.split([sep [,maxsplit]]) -list of strings

Return a list of the words in the string S, using sep as the
delimiter string. If maxsplit is given, at most maxsplit
splits are done. If sep is not specified or is None, any
whitespace string is a separator.
Using the maxsplit argument saves split from having to do unnecessary work
splitting the garbage portion if there are nulls there, too.

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
that is made terrible by our own mad attempt to interpret it as though it had
an underlying truth."
-- Umberto Eco

Sep 14 '06 #15

Robert Kern

Robert Kern wrote:

Michael wrote:
>I guess, I still don't see how this will work. I'm receiving a C
zero-terminated string in my Python program as a 1K byte block (UDP
datagram). If the string sent was "abc", then what I receive in Python
is <a><c><0><garbage><garbage>...<last_garbage_byt e>. How is Python
going to know where in this 1K byte block the end of the string is? It
seems that what I need to do is tell Python that the string ends at
zero-relative index 3. What am I missing here?

Nothing. This is what I would do:
In [34]: s
Out[34]: 'abc\x00garbage'

In [35]: s.split('\x00', 1)[0]
Out[35]: 'abc'

And I see that this is the advice that John Machin already gave you. Shame on me
for not reading the thread before replying.

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
that is made terrible by our own mad attempt to interpret it as though it had
an underlying truth."
-- Umberto Eco

Sep 14 '06 #16

Fredrik Lundh

Marc 'BlackJack' Rintsch wrote:

I would spell it:

if strg.endswith('\0'):
strg = strg[:-1]

unless you're in a hurry; startswith and endswith are horribly
inefficient compared to ordinary indexing/slicing.

(as I've pointed out elsewhere, even "s[:len(t)] == t" is usually faster
than "s.startswith(t)" for short prefixes, where "short" is a lot longer
than you may think).

</F>

Sep 14 '06 #17

Gabriel Genellina

At Thursday 14/9/2006 11:45, Michael wrote:

>Since I'm new to Python, I'm having trouble understanding what this
means (see below). Would appreciate any help.

if strg[-1] == "\0":
strg = strg[:-1]

The Python Tutorial will answer your questions, it is enlightning and
easy to follow.

Gabriel Genellina
Softlab SRL

__________________________________________________
Preguntá. Respondé. Descubrí.
Todo lo que querías saber, y lo que ni imaginabas,
está en Yahoo! Respuestas (Beta).
¡Probalo ya!
http://www.yahoo.com.ar/respuestas

Sep 14 '06 #18

John Machin

Michael wrote:

I guess, I still don't see how this will work. I'm receiving a C
zero-terminated string in my Python program as a 1K byte block (UDP
datagram). If the string sent was "abc", then what I receive in Python
is <a><c><0><garbage><garbage>...<last_garbage_byt e>. How is Python
going to know where in this 1K byte block the end of the string is? It
seems that what I need to do is tell Python that the string ends at
zero-relative index 3. What am I missing here?

You are missing this, contained in the third message in this thread
i.e. my first reply to you:
"""
It's possible you may be talking about a fixed length string which
contains useful_stuff + "\0" + padding -- in that case you need

strg = strg.split("\0")[0] # grab upto (but not including) the first
NUL (if any)
"""

Please note that Robert Kern's solution is better than the above (as he
says, it won't waste time splitting up the garbage, the mind-boggling
relative size of which I hadn't catered for).

You are also missing the frequent (3 at last count) exhortations to:
print repr(your_string)
which is much more precise than verbal descriptions.

Cheers,
John

Sep 14 '06 #19

Michael

Robert,

Thanks to you and everyone else for the help. The "s.split('\x00',
1)[0] " solved the problem.

Thanks again,
MDM

Robert Kern wrote:

Michael wrote:
I guess, I still don't see how this will work. I'm receiving a C
zero-terminated string in my Python program as a 1K byte block (UDP
datagram). If the string sent was "abc", then what I receive in Python
is <a><c><0><garbage><garbage>...<last_garbage_byt e>. How is Python
going to know where in this 1K byte block the end of the string is? It
seems that what I need to do is tell Python that the string ends at
zero-relative index 3. What am I missing here?

Nothing. This is what I would do:
In [34]: s
Out[34]: 'abc\x00garbage'

In [35]: s.split('\x00', 1)[0]
Out[35]: 'abc'

In [36]: s.split?
Type: builtin_function_or_method
Base Class: <type 'builtin_function_or_method'>
String Form: <built-in method split of str object at 0x6ada2c8>
Namespace: Interactive
Docstring:
S.split([sep [,maxsplit]]) -list of strings

Return a list of the words in the string S, using sep as the
delimiter string. If maxsplit is given, at most maxsplit
splits are done. If sep is not specified or is None, any
whitespace string is a separator.
Using the maxsplit argument saves split from having to do unnecessary work
splitting the garbage portion if there are nulls there, too.

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
that is made terrible by our own mad attempt to interpret it as though it had
an underlying truth."
-- Umberto Eco

Sep 15 '06 #20

George Sakkis

Michael wrote:

Robert,

Thanks to you and everyone else for the help. The "s.split('\x00',
1)[0] " solved the problem.

And a probably faster version: s[:s.index('\x00')]

George

Sep 15 '06 #21

Robert Kern

George Sakkis wrote:

Michael wrote:

>Robert,

Thanks to you and everyone else for the help. The "s.split('\x00',
1)[0] " solved the problem.

And a probably faster version: s[:s.index('\x00')]

Yup. About twice as fast for at least one dataset:
In [182]: import timeit

In [183]: t1 = timeit.Timer("s.split('\\x00', 1)[0]", "s='abc\\x00'*256")

In [184]: t2 = timeit.Timer("s[:s.index('\\x00')]", "s='abc\\x00'*256")

In [192]: t1.repeat(3, 100000)
Out[192]: [0.68063879013061523, 0.67146611213684082, 0.66347002983093262]

In [193]: t2.repeat(3, 100000)
Out[193]: [0.35819387435913086, 0.35968899726867676, 0.37595295906066895]

--
Robert Kern

"I have come to believe that the whole world is an enigma, a harmless enigma
that is made terrible by our own mad attempt to interpret it as though it had
an underlying truth."
-- Umberto Eco

Sep 15 '06 #22

Similar topics

fgets() and embedded null characters

by: David Mathog | last post by:

Every so often one of my fgets() based programs encounters an input file containing embedded nulls. fgets is happy to read these but the embedded nulls subsequently cause problems elsewhere in...

C / C++

Null character and fixed strings

by: ehabaziz2001 | last post by:

Hi, Till now I do not understand how the null character automatically added to the end of the string and it is not an element of the string array . All books said the null character (\0) added...

C / C++

win32com, BSTR, and null terminated strings

by: Matt Helm | last post by:

I am using win32com to access a third party COM interface but am having trouble using the string that is returned. The vendor's docs show the following method: HRESULT CookString(BSTR param_a,...

Python

MySQLDB - parameterised SQL - "TypeError: not all arguments converted during string formatting"

by: shearichard | last post by:

Hi - I have written some python to insert a row into a table using MySQLDB. I have never before written SQL/Python using embedded parameters in the SQL and I'm having some difficulties. Could...

Python

Null terminated strings...

by: Steve Barnett | last post by:

I need to call a Win32 function that returns a set of string values. Each value is terminated by a null and the last value is terminated by two nulls. How do I code for this in C#? The call is:...

C# / C Sharp

What could normally causes the string to have no null terminated

by: semut | last post by:

Given that the string is of null terminated type. What could be the possible causes (by experience) the string to have no null terminated and cause buffer overflow later. I know it is quite broad,...

C / C++

What are the common causes to null-terminated string for not having \0 character and later buffer overflow

by: semut | last post by:

Given that the string is of null-terminated type. What could be the possible causes (by experience) the string to have no null character (\0) and cause buffer overflow later. I know it is quite...

C / C++

ctypes and using c_char_p without NULL terminated string

by: kpoman | last post by:

Hi to all, I am trying to use some dll which needs some data types that I can't find in python. From the dll documentation, I am trying to use this: HRESULT IMKWsq::Compress ( VARIANT ...

Python

transforming from a non-null terminated string into a null terminatedstring

by: ssylee | last post by:

If I'm being supplied with a char* that is not null-terminated, is it impossible to transform it into a null terminated char* with only abstract information about the char* information?

C / C++

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

General

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice