reading Java floats from C

sbalko

Hi,

I am trying to read Java-floats (IEEE 754 encoding) stored in a binary
file from C (gcc on linux/i386, more specifically). Unfortunately, C
seems to expect floats to be stored somewhat differently than Java
does. I suspected an endianess problem and tried out ntohl/htonl but it
doesn't help.

Any clues?

Thanks,
Sören

Nov 14 '05 #1

Subscribe Reply

5466

Lawrence Kirby

On Mon, 20 Jun 2005 12:52:31 -0700, sbalko wrote:

Hi,

I am trying to read Java-floats (IEEE 754 encoding) stored in a binary
file
The question there would be how Java stores floats in a file, which would
depend in the code used to store them. From the information given there's
no reason to assume that the Java code is storing them in the same binary
format used internally. I think you'll need to discuss this in a Java
related newsgroup.
from C (gcc on linux/i386, more specifically). Unfortunately, C
seems to expect floats to be stored somewhat differently than Java
does. I suspected an endianess problem and tried out ntohl/htonl but it
doesn't help.

C doesn't specify the representation used by floating point types,
although IEEE 754 is typical. If you give some information about the
file format the Java code is using, and the C code you are using to read
the values we should be able to help you.

Lawrence

Nov 14 '05 #2

sbalko

Lawrence Kirby schrieb:

I am trying to read Java-floats (IEEE 754 encoding) stored in a binary
file

The question there would be how Java stores floats in a file, which would
depend in the code used to store them. From the information given there's
no reason to assume that the Java code is storing them in the same binary
format used internally. I think you'll need to discuss this in a Java
related newsgroup.

On the java side, I am using DataOutputStrea m's writeFloat method which
explicitly uses IEEE 754 to encode a float into 4 bytes.

from C (gcc on linux/i386, more specifically). Unfortunately, C
seems to expect floats to be stored somewhat differently than Java
does. I suspected an endianess problem and tried out ntohl/htonl but it
doesn't help.

C doesn't specify the representation used by floating point types,
although IEEE 754 is typical. If you give some information about the
file format the Java code is using, and the C code you are using to read
the values we should be able to help you.

Actually the java file is a plain format with intermixed ASCII and
subsequently stored floats . On the C side, things are a bit more
complex. I am using mmap to map the file to a main memory address
(casted to a char* pointer). Then I memcpy 4 bytes from certain offsets
in the buffer to a float variable. I also tried to apply ntohl on that
float but that doesn't solve my problem either.

Nov 14 '05 #3

Malcolm

<sb****@gmail.c om> wrote

Actually the java file is a plain format with intermixed ASCII and
subsequently stored floats . On the C side, things are a bit more
complex. I am using mmap to map the file to a main memory address
(casted to a char* pointer). Then I memcpy 4 bytes from certain offsets
in the buffer to a float variable. I also tried to apply ntohl on that
float but that doesn't solve my problem either.

Do you know how floating point numbers are generally constructed?

By doing some experiments you ought to be able to work out what
representation your Java platform and C compiler uses, and to convert. Watch
out for special cases like nan, infinity, and very small numbers.

Nov 14 '05 #4

Lawrence Kirby

On Mon, 20 Jun 2005 15:34:53 -0700, sbalko wrote:

....

Actually the java file is a plain format with intermixed ASCII and
subsequently stored floats .
I suggest you log the representation of the data you've read in. You
access the representation of an object by treating it as an array of
unsigned char e.g.

TYPE var = value;
const unsigned char *ptr = (const unsigned char *)&var;

for (i = 0; i < sizeof var; i++)
printf(" %02x", ptr[i]);

Also do this for the same values set in the C environment. You should then
be able to see if

a) you've read the data in correctly

b) how the Java and C representations correspond.
On the C side, things are a bit more
complex. I am using mmap to map the file to a main memory address
(casted to a char* pointer). Then I memcpy 4 bytes from certain offsets
in the buffer to a float variable. I also tried to apply ntohl on that
float but that doesn't solve my problem either.

ntohl isn't a standard C library function. Given the common socket related
definition of it you can't apply it directly to a float, you would be
converting the value to a long, swapping bytes and converting back again
which will give a completely wrong result.

Instead of using memcpy() to copy into the float try code that copies the
bytes to the float object in reverse order. E.g.

void unmarshall_floa t(float *fl, const unsigned char *data)
{
unsigned char *flrep = (unsigned char *)fl;
int i;

for (i = 0; i < sizeof(float); i++)
flrep[i] = data[sizeof(float)-1-i];
}

Lawrence

Nov 14 '05 #5

Joe Wright

Lawrence Kirby wrote:

On Mon, 20 Jun 2005 15:34:53 -0700, sbalko wrote:

...

Actually the java file is a plain format with intermixed ASCII and
subsequentl y stored floats .

I suggest you log the representation of the data you've read in. You
access the representation of an object by treating it as an array of
unsigned char e.g.

TYPE var = value;
const unsigned char *ptr = (const unsigned char *)&var;

for (i = 0; i < sizeof var; i++)
printf(" %02x", ptr[i]);

Also do this for the same values set in the C environment. You should then
be able to see if

a) you've read the data in correctly

b) how the Java and C representations correspond.

On the C side, things are a bit more
complex. I am using mmap to map the file to a main memory address
(casted to a char* pointer). Then I memcpy 4 bytes from certain offsets
in the buffer to a float variable. I also tried to apply ntohl on that
float but that doesn't solve my problem either.

ntohl isn't a standard C library function. Given the common socket related
definition of it you can't apply it directly to a float, you would be
converting the value to a long, swapping bytes and converting back again
which will give a completely wrong result.

Instead of using memcpy() to copy into the float try code that copies the
bytes to the float object in reverse order. E.g.

void unmarshall_floa t(float *fl, const unsigned char *data)
{
unsigned char *flrep = (unsigned char *)fl;
int i;

for (i = 0; i < sizeof(float); i++)
flrep[i] = data[sizeof(float)-1-i];
}

Lawrence

I would, if possible, coerce Java to write text like '1.23456789e2' for
floats. Convert them with strtod() on the C side.

--
Joe Wright
"Everything should be made as simple as possible, but not simpler."
--- Albert Einstein ---

Nov 14 '05 #6

Robert Maas, see http://tinyurl.com/uh3t

(I've cross-posted this to comp.programmin g where it's more relevant.
Also I've blacked out the specific names of programming languages
because that's irrelevant to my general answer.)

From: sb****@gmail.co m
I am trying to read ###-floats (IEEE ??? encoding) stored in a binary
file from %%% (??? on ???, more specifically). Unfortunately, %%% seems
to expect floats to be stored somewhat differently than ### does. I
suspected an endianess problem and tried out ntohl/htonl but it
doesn't help. Any clues?

If you can't find such an answer from online documents, why didn't you
just do some experiments? For example, try this to see how ### writes
floats in binary mode: Write a test program that writes out exactly
five values of exactly 0.0, then write out these values in sequence:
9.0 0.0 10.0 0.0 11.0 0.0 12.0 0.0 13.0 0.0 14.0 0.0 15.0 0.0, and then
examine the resultant file to see if you can find:
- The same exact pattern repeating exactly five times before it's
broken by other patterns not the same, to show you what the 0.0 looks
like in binary file format.
- Alternating original pattern and other patterns the same length, to
make sure you haven't accidently used different precision for the
non-zero values generated from the index variable in your loop and the
zero values generated by literals.
- Among those non-zero groups of bytes, see if you can find a bit
pattern that goes somewhat like this:
1001
1010
1011
1100
1101
1110
1111
The '1' might be missing if it's in a notation where the 1 is assumed
rather than explicit, but the other bits should follow that pattern.

At that point you have a good idea where the mantissa is located. Now
to find where the exponent is located, generate this sequence:
0.0 1.0 0.0 2.0 0.0 4.0 0.0 8.0 0.0 16.0 0.0 32.0 0.0 64.0 0.0
You should see a similar pattern in the bits.

Finally you need to know how negative numbers are expressed.
I leave that as an exercise for the reader.

Once you know all that for ###, do the same for %%%.
Write that test data from the program that will be doing reading.
(Unless it's totally broken, it should write data in the same layout
that it expects to read it in.)

Now compare what you learned about ### and %%%, whether sequence of
bytes is the only difference, or there's a more complicated difference
in representation.

Nov 15 '05 #7

Similar topics

2905

Need speed increase while reading large chunk of data.

by: Darsant | last post by:

I'm currently reading 1-n number of binary files, each with 3 different arrays of floats containing about 10,000 values a piece for a total of about 30,000 values per file. I'm looking for a way to load them all into memory. I've tried using vector pushback with reserving, but it was horribly slow. The current method I am using is upon opening the file and reading the number of values, resizing the vectors (I have 3, one for each data...

C / C++

4221

reading a binary file in C++ and having trouble.

by: laclac01 | last post by:

So I am converting some matlab code to C++. I am stuck at one part of the code. The matlab code uses fread() to read in to a vector a file. It's a binary file. The vector is made up of floats, which in matlab is 32 bits. How do I get this binary file in to floats in c++? I try reading the file using the ifstream>>myFloat. But nothing ever goes in to the float. So the closest I have come is having 4 unsigned char store the binary data....

by: napi | last post by:

I think you would agree with me that a C compiler that directly produces Java Byte Code to be run on any JVM is something that is missing to software programmers so far. With such a tool one could stay with C and still be able to produce Java byte code for platform independent apps. Also, old programs (with some tweaking) could be re-compiled and ported to the JVM. We have been developing such a tool over the last 2 years and currently...

C / C++

5984

Reading a large number of text files into an array

by: Matthew Crema | last post by:

Hello, Say I have 1000 text files and each is a list of 32768 integers. I have written a C program to read this data into a large matrix. I am using fopen in combination with fscanf to read the data in. However, it takes about 20 seconds to complete and I wonder if there is a faster way. For example, I found that I could use 'fread' to read the data into a string that looks like this:

C / C++

3180

Reading floats from a binary file

by: Matt McGonigle | last post by:

Hi all, Please help me out with this. Perhaps it is a dumb question, but I can't seem to make it work. I am doing a file conversion using an unformatted binary file for input and outputting to a normal text file. I need to read in a float from the binary file, but it sets my input stream to failbit. Is there a special way I can read in floats? All I am doing is

.NET Framework

1812

c file reading

by: stewart_bristol | last post by:

can someone direct me to some compilable example code which reads in floating points or integers from a tab delimited file (either to variables or arrays) please? this is driving me mad today thanks stew

C / C++

3700

Dive Into Java?

by: erikcw | last post by:

DiveIntoPython.org was the first book I read on python, and I really got a lot out of it. I need to start learning Java (to maintain a project I've inherited), and was wondering if anyone knew of any similar books for Java? Maybe once I know my way around the language, I can sneak Jython in... :-) Thanks! Erik

Python

14992

Problem: "java.lang.OutOfMemoryError: Java heap space" while reading xml using SAX

by: blazedaces | last post by:

Ok, so you know my problem, java is running out of memory reading with SAX, the event-based xml parser intended more-so than DOM for extremely large files. I'll try to explain what I've been doing and why I have to do it. Hopefully someone has a suggestion... Alright, so I'm using a gps-simulation program that outputs gps data, like longitude, lattitude, altitude, etc. (hundreds of terms, these are just the well known ones). In the newer...

Java

4134

strings (dollar.cents) into floats

by: luca bertini | last post by:

Hi, i have strings which look like money values (ie 34.45) is there a way to convert them into float variables? everytime i try I get this error: "numb = float(my_line) ValueError: empty string for float()" " here's the code ************

Python

9617

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...

General

9453

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...

Windows Server

8929

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...

Career Advice

7451

Access Europe - Using VBA to create a class based on a table - Wed 1 May

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...

Microsoft Access / VBA

6710

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...

C# / C Sharp

5354

Trying to create a lan-to-lan vpn between two differents networks

by: TSSRALBI | last post by:

Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...

Networking - Hardware / Configuration

4007

transfer the data from one system to another through ip address

by: 6302768590 | last post by:

Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system

C# / C Sharp

3607

How to add payments to a PHP MySQL app.

by: muto222 | last post by:

How can i add a mobile payment intergratation into php mysql website.

PHP

2849

Comprehensive Guide to Website Development in Toronto: Expert Insights from BSMN Consultancy

by: bsmnconsultancy | last post by:

In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

General