Reading PODs Cross-Compiler

Bill Woessner wrote:
....

>
Is there a guaranteed way to accomplish this?

Test and see ?

... I'm guessing the reason
it's not working has to do with padding and byte-alignment. Is there
some way to force the two compilers to agree on padding and byte-
alignment?

Not by the C++ standard.

However, I have done exactly what you describe across platforms and
endianness for many different compilers.

Avoid bitfields.

Many (if not most) compilers support #pragma pack(). Even compilers
that require word alignment for reading and writing words allow you to
read and write from structs that contain words that are not aligned.
(albeit using more cycles).

In dealing with endianness I have used a template class - check this out:

http://groups.google.com/group/comp....16f3add?hl=en&

Is it guaranteed to work on every platform ever - probably not. It will
more than likely work on every platform you care about for now.

I've used different techniques to serialize/deserialize since I used
this so I can't say that this is the best way to go but if you're
already reading/writing POD structs then this might work for you.

NO guarentees but likely to work and not break for a long long time.

May 18 '07 #4

James Kanze

On May 18, 7:59 pm, Bill Woessner <woess...@gmail.comwrote:

Suppose I have a structure, foo, which is a POD. I would like to read
and write it to disk as follows:

std::ofstream outs;
foo bar;
outs.write(reinterpret_cast<char*>(&bar), sizeof(foo));
...
std::ifstream ins;
foo bar;
ins.read(reinterpret_cas<char*>(&bar), sizeof(foo));

This works fine if the code doing the reading and writing are compiled
with the same compiler. However, I now have a situation where I would
need to write with one compiler and read with another. Fortunately,
the reading and writing will occur on the same platform, so endianness
is not a problem.

Is there a guaranteed way to accomplish this? I'm guessing the reason
it's not working has to do with padding and byte-alignment. Is there
some way to force the two compilers to agree on padding and byte-
alignment?

No. There's not even a means of forcing two different
compilers, or two different versions of the same compiler, to
agree on byte order. Note too that if you are writing the data
to disk, you presumably want to read it later in time.
Including after an upgrade of the system. So you have to write
something that is compatible with all future compilers on as yet
unknown systems as well.

The solution to this is well known. Just define a format (or
use XDR, which is more or less a quasi standard for this sort of
thing), and format the data to whatever format you define. It's
rather simple, in fact (as long as you don't have floating
point), although it does involve writing code.

--
James Kanze (Gabi Software) email: ja*********@gmail.com
Conseils en informatique orientée objet/
Beratung in objektorientierter Datenverarbeitung
9 place Sémard, 78210 St.-Cyr-l'École, France, +33 (0)1 30 23 00 34

May 19 '07 #5

James Kanze

On May 19, 12:33 am, Gianni Mariani <gi3nos...@mariani.wswrote:

Bill Woessner wrote:

...

Is there a guaranteed way to accomplish this?

Test and see ?

Test what? How do you test that something will work with the
next release of the compiler, or when you upgrade hardware (from
32 bits to 64).

The real problem here, of course, isn't whether to test, but
what to test. He definitly should test the code, once he's
written it. But before even starting to write it, he should
define what it is supposed to do, i.e. the actual format on the
disk. Without doing that, he doesn't know what to test. (A
test for this sort of thing might involve writing known data to
disk, then reading it using something like "od -t x1", and
verifying that the file contains exactly the bytes he expects.)

Obviously, whatever format he chooses, and whatever shortcuts he
chooses to implement it:

-- he needs to document it, so that future programmers will
know what he is doing, and

-- he needs very rigorous tests of the code (especially if he
is cutting corners), so that the code can be validated on
any future platforms.

(As an example of what I mean by cutting corners: if all of his
current target machines use 32 bit 2's complement integers, and
are little endian, then he might decide to use this as his
integral format, knowing that memcpy will format correctly.
While probably not worth the bother for integral types, I
regularly write code which supposes that floats and doubles are
IEEE---formatting the output for IEEE when this is not the case
is a significant amount of work, and as long as my code doesn't
have to be ported to anything but the mainstream Unix machines
or PC's, there's no point in it.)

[...]

Is it guaranteed to work on every platform ever - probably not. It will
more than likely work on every platform you care about for now.

Sort of a superficial point of view, don't you think. You
should at least document the restrictions.

--
James Kanze (Gabi Software) email: ja*********@gmail.com
Conseils en informatique orientée objet/
Beratung in objektorientierter Datenverarbeitung
9 place Sémard, 78210 St.-Cyr-l'École, France, +33 (0)1 30 23 00 34

May 19 '07 #6

James Kanze wrote:
....

>Is it guaranteed to work on every platform ever - probably not. It will
more than likely work on every platform you care about for now.

Sort of a superficial point of view, don't you think. You
should at least document the restrictions.

I suspect you can write a unit test for that as well !

May 19 '07 #7

James Kanze wrote:

On May 18, 7:59 pm, Bill Woessner <woess...@gmail.comwrote:

....

>
>Is there a guaranteed way to accomplish this? I'm guessing the reason
it's not working has to do with padding and byte-alignment. Is there
some way to force the two compilers to agree on padding and byte-
alignment?

No. There's not even a means of forcing two different
compilers, or two different versions of the same compiler, to
agree on byte order. Note too that if you are writing the data
to disk, you presumably want to read it later in time.
Including after an upgrade of the system. So you have to write
something that is compatible with all future compilers on as yet
unknown systems as well.

The solution to this is well known. Just define a format (or
use XDR, which is more or less a quasi standard for this sort of
thing), and format the data to whatever format you define. It's
rather simple, in fact (as long as you don't have floating
point), although it does involve writing code.

I just thought - cool. Let's check it out. I'm somewhat dissapointed.

XDR has some significant issues AFAICT.

a) Limited to 32 bit lengths. Yes - laugh. The last project I needed
to support serialization required greater than 4 gig files. Ran on a 64
bit machine.

b) Why the null on the string when you have a length ? Hugh ?

c) when you have large numbers of small strings you can have a big file.

d) Why the rounding to 4 bytes ? Really, this is a file format.

e) Container type is limited to array. I don't want to have to specify
yet another transformation once the data is ready into internal data
structures. When you're pushing large amounts of data, this can be an
issue.

f) Forward and reverse compatability. I don't see (I could be mistaken)
that an older version of the spec can read a file's data, modify some
values and write out the file and still preserve the data that was not
decipherable.

In the last system I wrote, it was important that the system could be
downgraded and upgraded and data preserved. Using the system we
designed, it was possible, as long as the definition of the serialized
data was never regressive, i.e. nothing in the data spec was backed out.

In a file format specification, it is also important to specify a unique
header and possibly footer. In a system where you have multiple data
formats, it's a type safety check.

In the next system I write, I think I will specify that all integers be
a more compressed format, especially lengths. Most lengths are less
than 247 so wasting 3 extra bytes everywhere means you have a file with
lots of zero bytes. If it's not a file but a packet you're pushing over
a net, it adds up to a significant overhead for some data streams. A
format something like :

0-247 - 1 byte - the value.
first byte is 248 - next 1 byte + 248.
first byte is 249 - next 2 bytes + 256 + 248
first byte is 250 - next 3 bytes + 65536 + 256 + 248
first byte is 251 - next 4 bytes + 2^24 + 65536 + 256 + 248
first byte is 252 - next 5 bytes + 2^32 + 2^24 + 65536 + 256 + 248
first byte is 253 - next 6 bytes + 2^40 + 2^32 + 2^24 + 65536 + 256 + 248
etc ...

- Every number has a single representation (critical).
- Supports encoding 128 bit ints (possibly modify to support very large
numbers)

As for a "language", it's important to have a common data definition
format, however I think that can be much simpler than XDR.

G

May 19 '07 #8

James Kanze

On May 19, 7:44 pm, Gianni Mariani <gi3nos...@mariani.wswrote:

James Kanze wrote:
On May 18, 7:59 pm, Bill Woessner <woess...@gmail.comwrote:

[...]

The solution to this is well known. Just define a format (or
use XDR, which is more or less a quasi standard for this sort of
thing), and format the data to whatever format you define. It's
rather simple, in fact (as long as you don't have floating
point), although it does involve writing code.

I just thought - cool. Let's check it out. I'm somewhat dissapointed.

With XDR? I'm not really surprised. It's nothing exceptional.
But it does cover a lot of useful cases, and in cases where it
is sufficient, it's easier to refer to the XDR specification
than to invent and to write your own. And it has the advantage
of being wide spread.

Of course, if you want something more complete, there's always
BER:-).

--
James Kanze (Gabi Software) email: ja*********@gmail.com
Conseils en informatique orientée objet/
Beratung in objektorientierter Datenverarbeitung
9 place Sémard, 78210 St.-Cyr-l'École, France, +33 (0)1 30 23 00 34

May 19 '07 #9

New idea for internet documents reading

James Kanze wrote:

On May 19, 7:44 pm, Gianni Mariani <gi3nos...@mariani.wswrote:
>James Kanze wrote:
>>On May 18, 7:59 pm, Bill Woessner <woess...@gmail.comwrote:

[...]

>>The solution to this is well known. Just define a format (or
use XDR, which is more or less a quasi standard for this sort of
thing), and format the data to whatever format you define. It's
rather simple, in fact (as long as you don't have floating
point), although it does involve writing code.

>I just thought - cool. Let's check it out. I'm somewhat dissapointed.

With XDR? I'm not really surprised. It's nothing exceptional.
But it does cover a lot of useful cases, and in cases where it
is sufficient, it's easier to refer to the XDR specification
than to invent and to write your own. And it has the advantage
of being wide spread.

Of course, if you want something more complete, there's always
BER:-).

I don't remember why I gave up on ASN.1 ...

May 19 '07 #10

Similar topics

by: Luca | last post by:

Hello Everybody, I'm a 26 years old Italian "Florentine" Computer technician :) I'm writing you about an idea that I've got of a function that could be introduced in new web browsers (or even...

.NET Framework

classes wrting/reading to binary file

by: nightflyer | last post by:

Hi all, [code snippet appended at the end.) my question: A class has a few string variables with not know length at design time. Now I declare lets say a 1000 of those classes and put them...

Reading the value of an "xsi:type" attribute [unfortunate, but necessary cross-posting]

by: Carl Lindmark | last post by:

*Cross-posting from microsoft.public.dotnet.languages.csharp, since I believe the question is better suited in this XML group* Hello all, I'm having some problems understanding all the ins and...

.NET Framework

Reading file contents to unmanaged memory block (IntPtr) from C#

by: TT (Tom Tempelaere) | last post by:

Hi, In my project I need to use VirtualAlloc (kernel32) to allocate memory to ensure that data is 4K aligned. I need to fill the data block with file contents. I notice that there are no 'Read'...

C# / C Sharp

memset on structs with non-PODs

by: Patrick Kowalzick | last post by:

Dear all, I have an existing piece of code with a struct with some PODs. struct A { int x; int y; };

Reading the clipboard in Mozilla

by: Phil Endecott | last post by:

Dear All, I'm trying to read the content of the clipboard in a cross-browser way. Google will find various scripts such as this one: ...

Javascript

Reading text files with javascript

by: paulnightingale | last post by:

Hi I've got a ticker tape that is written in Java Script 1.2 which displays text that has to be currently changed in the program code. What I want to do is to find the bit of javascript to get the...

Javascript

Cross Thread Exception after reading Asynchronous Serial Port

by: Mo | last post by:

I am trying to set a text box value when data is received from the com port (barcode reader). I am getting the following error when I try to set the text box TXNumber after data is received ...

C# / C Sharp

Why can't PODs have constructors?

by: JohnQ | last post by:

There must be something at the implementation level that makes the standard disallow constructors in PODs (?). What is that? Don't most implementations just break out the constructor member...

Reading an array of numbers

by: The Ax | last post by:

I want to read this array backwards. http://hiscore.runescape.com/index_lite.ws?player=grimmstriker Its not my site so I cant find the arrays name instead of: 1319710,847,1620107 -1,-1,-1 i...

Javascript

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...