472,374 Members | 1,376 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 472,374 software developers and data experts.

Floating point arithmetic.

Hi there.
I am cross posting this on comp.lang.c as well: sorry for same.
The problem I am facing is as follows:
For example:
double a= 0.15;
double b=2.4;
const double VERYTINY =1.e-10;
I know b/a = 16 and hence the remainder is zero; but I am not
able to find any suitable thing to encode it into in c.
for example (fmod(b,a)>VERYTINY) returns true!
Now for this particular instance, (fmodf(b,a)>VERYTINY)
does return false.
But now if
a=0.15;
b=4.5;
then fmodf and fmod both don't help...

any suggestions on this?
I was pointed to a reference on floating point arithmetic, where they talk
of ulps etc, but is there a small function or fix to deal with this problem
available somewhere?

thanks,
amit.
Jul 22 '05 #1
14 2460

"Amit Bhatia" <bh*****@nospam.com> wrote:
The problem I am facing is as follows:
For example:
Example of what? It makes more sense to state the
thing you want to give an example of, before giving
examples of it.
double a= 0.15;
double b=2.4;
const double VERYTINY =1.e-10;
I know b/a = 16 and hence the remainder is zero;
but I am not able to find any suitable thing to
encode it into in c.
Define "encode".

Define "it".

If you want to code b/a in C, the code is:

b/a
for example
Example of what? Please give some clue as to the thing
you're trying to give an example OF, before giving
examples!
(fmod(b,a)>VERYTINY) returns true!
It returns false on my system. However, it seems to me
you're pushing the granularity of type double to the max.
Even if you WERE off by one part per 1e10, so what?
That's one part per ten billion. Who cares?

But if you really DO care, try acquiring a compiler that
supports type "long double". Or, perhaps, use a computer
with a 64-bit processor and compiler that defines "double"
to be 64 bits. Even if 4 or so of those bits were used
for sign and exponent, you'd still have a granularity of
about 1 part per quintillion (1e18). More precision than
you'll EVER need.
Now for this particular instance, (fmodf(b,a)>VERYTINY)
does return false.
What is fmodf? That's not a part of the C or C++ std.
libraries. I don't have it on my compiler (DJGPP) either.
But now if
a=0.15;
b=4.5;
then fmodf and fmod both don't help...
I get b/a = 30 and (fmod(b,a)>VERYTINY) = false.

What do YOU get?
any suggestions on this?
Suggestions on what, precisely?
I was pointed to a reference on floating point
arithmetic, where they talk of ulps etc,
What's "ulps"?
is there a small function or fix to deal with this problem
available somewhere?


What problem are you trying to fix?

--
Cheers,
Robbie Hatley
Tustin, CA, USA
lo**********@pacbell.net
http://home.pacbell.net/earnur/

Jul 22 '05 #2
On Sun, 11 Jul 2004 01:17:09 -0700, Robbie Hatley <lonewolfintj at pacbell
dot net> wrote:
I was pointed to a reference on floating point
arithmetic, where they talk of ulps etc,


What's "ulps"?


ULP stands for unit in the last place. Its a measure of rounding error and
a common enough term in floating point arithmetic.

john
Jul 22 '05 #3
On Sat, 10 Jul 2004 23:08:01 -0500, Amit Bhatia <bh*****@nospam.com> wrote:
Hi there.
I am cross posting this on comp.lang.c as well: sorry for same.
The problem I am facing is as follows:
For example:
double a= 0.15;
double b=2.4;
const double VERYTINY =1.e-10;
I know b/a = 16 and hence the remainder is zero; but I am not
able to find any suitable thing to encode it into in c.
for example (fmod(b,a)>VERYTINY) returns true!
Now for this particular instance, (fmodf(b,a)>VERYTINY)
does return false.
But now if
a=0.15;
b=4.5;
then fmodf and fmod both don't help...

any suggestions on this?
Maybe use a bigger value for VERYTINY.
Maybe use integral or rational aritmetic.
Maybe use fixed point arithmetic.
I was pointed to a reference on floating point arithmetic, where they
talk
of ulps etc, but is there a small function or fix to deal with this
problem
available somewhere?


There is not easy answer to the problem of floating point rounding errors.
The simplest thing is not to write code that depends on any particular
exactness of floating point arithmetic. Obviously this is not always
possible but it is possible more often than people think.

So the asnwer to your question really depends on what kind of problem you
are actually trying to solve.

john
Jul 22 '05 #4
On Sat, 10 Jul 2004 23:08:01 -0500 in comp.lang.c++, Amit Bhatia
<bh*****@nospam.com> wrote,
Hi there.
I am cross posting this on comp.lang.c as well: sorry for same.
In fact you did so a few days ago and got some good tips.
The problem I am facing is as follows:
For example:
double a= 0.15;
double b=2.4;
const double VERYTINY =1.e-10;
b = two and two fifths. Fifths can not be represented exactly in base
two floating point format, so this number cannot be stored exactly.
a is 16 times that, so of course has a similar approximation.

And so on. Never expect any form of exactitude from any floating point
operation, and your life will be less frustrating.
I know b/a = 16 and hence the remainder is zero; but I am not
able to find any suitable thing to encode it into in c.


It may be close to zero, or it may be just less than 0.15.

If you persist in this folly then I guess you need to test for both
possibilities.
Jul 22 '05 #5
Amit Bhatia <bh*****@nospam.com> wrote in message news:<cc**********@news.ks.uiuc.edu>...
Hi there.
I am cross posting this on comp.lang.c as well: sorry for same.
The problem I am facing is as follows:
For example:
double a= 0.15;
double b=2.4;
const double VERYTINY =1.e-10;
I know b/a = 16 and hence the remainder is zero; but I am not
able to find any suitable thing to encode it into in c.
for example (fmod(b,a)>VERYTINY) returns true!
Now for this particular instance, (fmodf(b,a)>VERYTINY)
does return false.
But now if
a=0.15;
b=4.5;
then fmodf and fmod both don't help...

any suggestions on this?
I was pointed to a reference on floating point arithmetic, where they talk
of ulps etc, but is there a small function or fix to deal with this problem
available somewhere?

thanks,
amit.

int main() {
double a = 0.15;
double b = 2.4;
const double VERYTINY =1.e-10;
printf("%d", (fmod(b,a) > VERYTINY));
return 0;
}

This gives me false!
Jul 22 '05 #6
Amit Bhatia <bh*****@nospam.com> wrote in message news:<cc**********@news.ks.uiuc.edu>...
Hi there.
I am cross posting this on comp.lang.c as well: sorry for same.
The problem I am facing is as follows:
For example:
double a= 0.15;
double b=2.4;
const double VERYTINY =1.e-10;
I know b/a = 16 and hence the remainder is zero; but I am not
able to find any suitable thing to encode it into in c.
for example (fmod(b,a)>VERYTINY) returns true!
Now for this particular instance, (fmodf(b,a)>VERYTINY)
does return false.
But now if
a=0.15;
b=4.5;
then fmodf and fmod both don't help...

any suggestions on this?
I was pointed to a reference on floating point arithmetic, where they talk
of ulps etc, but is there a small function or fix to deal with this problem
available somewhere?

thanks,
amit.

Try the following loop

cout.precision(20);
cout << b << endl << a << endl;
while(b>a)
b -= a;
cout << b << endl << a << endl;

and see the value of a and b

Your problem is that the method of not comparing floating
point values but comparing their difference to a certain limit
does not work for modulo arithmetic. The reason is simple.

Assume x(mod y) = 0
That is, x = yp

In your case, x and y are floating point values,
while p is an integer. Since p is an integer, you
can say x is divisible by y.

But a computer cannot store floating point numbers exactly.
so a floating point number x is actually stored as (x+dx)
or as (x-dx). In your case, 4.5 can be exactly stored, but
neither can 2.4 or 0.15 . Now consider what happens when you
divide (x+dx) or (x-dx) by y. (x,y,p > 0)

(x+dx)(mod y) = dx
This is ok, your comparison will work in this case, as your
VERYTINY is > dx

However, (x-dx)(mod y) = (yp-dx)(mod y)
= (y(p-1) + y - dx)(mod y) = y-dx
This y-dx is definitely mmuch greater than your VERYTINY
and therefore you do not get the desired result.

I'll give you an example with integers.

Consider dividing 57 by 19. 57(mod 19) = 0
58(mod 19) = 1 ( x+dx case)
56(mod 19) = 18 (x-dx case)

So you should write
( fmod(b,a)<VERYTINY || (a-fmod(b,a))<VERYTINY )

-Arijit
Jul 22 '05 #7

When using floating-point, usually we don't test for equality, instead do
the kind of test you do (fmod(b, a) > VERYTINY)...
But if we take a look at the case in particular, the test doesn't fail on my
machine, and I suspect the > VERYTINY doesn't fail in yours, except for
negative numbers. That is,

2.4 can only be finitely represented in binary as 2.3999... that is
10.0110011(0011)
0.15 as 0.14(9) (or 0.00100110011(0011))

in my machine, perhaps yours too, fmod(2.4, 0.15) is exactly 0. This is not
surprising if we examine the mantissa bits of both numbers, they are equal.

but 4.5 does have a finite binary representation as 100.1

but fmod(4.5, 0.15) is only 1.66533e-16, and so is smaller than 1e-10... and
I don't know why the test should fail....
Since the case of the denominator being represented exactly implies that a
multiple is represented exactly too

So, given best possible representations of the input values the test
(abs(fmod(b, a)) < VERYTINY)) should work.

One exception arises when input values are themselves approximations, and,
in particular, the dividend is slightly less than it would take to be an
exact multiple. In that case, the remainder will be less than, but very
close to the denominator. So, we must test the result also against it being
very close to the denominator.

So, the full test, that should work for all cases would be (assuming we want
to know about approximate multiples):

(abs(fmod(b,a)) < VERYTINY) || (abs(fmod(b,a) - a) < VERYTINY)

Now, the value you choose for VERYTINY, depends on the application and how
many operations you have done on the input values. Of course, because the
result of fmod is in absolute value always less than a, if you know your
inputs are positive numbers, you can drop both abs function calls.
Miguel Ramos
Jul 22 '05 #8
What is fmodf? That's not a part of the C or C++ std.
libraries. I don't have it on my compiler (DJGPP) either.


actually fmodf does exist, it is part of the C99 standard, and most modern
compilers have it.
but I think we shouldn't be very harsh, it's obvious he meant modf.
Jul 22 '05 #9

"Miguel" <mi****@exstare.com> wrote in message news:40***********************@news.telepac.pt...
What is fmodf? That's not a part of the C or C++ std.
libraries. I don't have it on my compiler (DJGPP) either.
actually fmodf does exist, it is part of the C99 standard,


"fmodf" is a not a keyword in C or C++, nor is it
a part of their standard libraries, according to my
reading. I found reference to "fmodf" in the
documentation for my compiler (DJGPP), where it is
listed as a "non-ANSI extention to fmod". So unless
fmodf has been added to the standard libraries very
recently, it's not standard.
and most modern compilers have it.
Most compilers have lots of added non-ANSI functions.
Useful, but not standard.
it's obvious he meant modf.


Probably.

--
Cheers,
Robbie Hatley
Tustin, CA, USA
lo**********@pacbell.net
http://home.pacbell.net/earnur/
Jul 22 '05 #10
On Sun, 11 Jul 2004 20:17:52 -0700, Robbie Hatley <lonewolfintj at pacbell
dot net> wrote:

"Miguel" <mi****@exstare.com> wrote in message
news:40***********************@news.telepac.pt...
> What is fmodf? That's not a part of the C or C++ std.
> libraries. I don't have it on my compiler (DJGPP) either.


actually fmodf does exist, it is part of the C99 standard,


"fmodf" is a not a keyword in C or C++, nor is it
a part of their standard libraries, according to my
reading. I found reference to "fmodf" in the
documentation for my compiler (DJGPP), where it is
listed as a "non-ANSI extention to fmod". So unless
fmodf has been added to the standard libraries very
recently, it's not standard.
and most modern compilers have it.


Most compilers have lots of added non-ANSI functions.
Useful, but not standard.
it's obvious he meant modf.


Probably.


Did you not read Miguel's post? fmodf is defined in the C99 standard,
section 7.12.10.1. Therefore it is a standard C function. You are talking
about the older ANSI standard I guess.

john
Jul 22 '05 #11
"John Harrison" <jo*************@hotmail.com> wrote in message
news:opsa0d3euo212331@andronicus...
On Sun, 11 Jul 2004 20:17:52 -0700, Robbie Hatley <lonewolfintj at pacbell
dot net> wrote:

"Miguel" <mi****@exstare.com> wrote in message
news:40***********************@news.telepac.pt...

> What is fmodf? That's not a part of the C or C++ std.
> libraries. I don't have it on my compiler (DJGPP) either.

actually fmodf does exist, it is part of the C99 standard,


"fmodf" is a not a keyword in C or C++, nor is it
a part of their standard libraries, according to my
reading. I found reference to "fmodf" in the
documentation for my compiler (DJGPP), where it is
listed as a "non-ANSI extention to fmod". So unless
fmodf has been added to the standard libraries very
recently, it's not standard.
and most modern compilers have it.


Most compilers have lots of added non-ANSI functions.
Useful, but not standard.
it's obvious he meant modf.


Probably.


Did you not read Miguel's post? fmodf is defined in the C99 standard,
section 7.12.10.1. Therefore it is a standard C function. You are talking
about the older ANSI standard I guess.


Even C89 provided for *f and *l versions of the standard math functions.
They were permitted but not required. So fmodf has been around for a
long time. None of which alters the fact that it *still* probably
should have been modf in the posting.

P.J. Plauger
Dinkumware, Ltd.
http://www.dinkumware.com
Jul 22 '05 #12
Guys,
I was indeed using fmodf and fmod: I use g++ version 3.2 on rh 9.0 (with
-ansi compilation flag) to find the remainder when two double precision
values are divided one of which is exact multiple of the other.
thanks, for pointing me out that it is not exactly computable and I need to
be a bit careful while doing this stuff. ;) Anyway, I am using a slightly
different check now which was suggested by Arijit and which works well.

thanks,
amit.
Jul 22 '05 #13
John Harrison chastised me thusly:
Did you not read Miguel's post?
Oh, I read it; I just didn't believe it, because the
books available to me said otherwise.
fmodf is defined in the C99 standard, section
7.12.10.1. Therefore it is a standard C function.
Quoting section numbers on my ass, eh? :-)
You are talking about the older ANSI standard I guess.


Yes, I suppose my books are all based on C90, not C99.

I see you're going to throw the book at me, so I'd
better get this standard you guys keep talking about.

::: gets standard :::

Now, let me see... Yes, here we go...

Pursuant to ISO/IEC 9899:1999, 7.12.10.1, paragraph 1,
the header math.h shall contain:

double fmod (double x, double y)
float fmodf (float x, float y)
long double fmodl (long double x, long double y)

OK, it seems that you guys were right.

But wait, that's the C standard, not C++.

Pursuant to ISO/IEC 14882, 26.5, paragraph 6, I see
that in standard C++, the <cmath> header includes:

double fmod (double x, double y)
float fmod (float x, float y)
long double fmod (long double x, long double y)

Uses overloaded versions of fmod, instead of fmodf and
fmodl.

26.5, paragraph 2 does say "the contents of these
headers (<cmath> and <cstdlib>) are the same as the C
Standard Library headers math.h and stdlib.h, with the
following additions...", which would seem to imply that
fmodf, fmodl, and many other such library functions
are a part of standard C++. But then, 14882 was
written in 1998, so "Standard C" actually meant C90,
so one could argue that fmodf, fmodl, etc. are therefore
NOT part of standard C++. So perhaps I was right, after
all. :-)

--
Feeling argumentative,
Robbie Hatley
Tustin, CA, USA
email: lonewolfintj at pacbell dot net
web: home dot pacbell dot net slant earnur slant
Jul 22 '05 #14
"Robbie Hatley" <lonewolfintj at pacbell dot net> wrote in message
news:40**********@127.0.0.1...
Pursuant to ISO/IEC 9899:1999, 7.12.10.1, paragraph 1,
the header math.h shall contain:

double fmod (double x, double y)
float fmodf (float x, float y)
long double fmodl (long double x, long double y)

OK, it seems that you guys were right.

But wait, that's the C standard, not C++.

Pursuant to ISO/IEC 14882, 26.5, paragraph 6, I see
that in standard C++, the <cmath> header includes:

double fmod (double x, double y)
float fmod (float x, float y)
long double fmod (long double x, long double y)

Uses overloaded versions of fmod, instead of fmodf and
fmodl.

26.5, paragraph 2 does say "the contents of these
headers (<cmath> and <cstdlib>) are the same as the C
Standard Library headers math.h and stdlib.h, with the
following additions...", which would seem to imply that
fmodf, fmodl, and many other such library functions
are a part of standard C++. But then, 14882 was
written in 1998, so "Standard C" actually meant C90,
so one could argue that fmodf, fmodl, etc. are therefore
NOT part of standard C++. So perhaps I was right, after
all. :-)


Except that you aren't. The float and long double math
functions that are optional in C90 are explicitly required
in C++98.

P.J. Plauger
Dinkumware, Ltd.
http://www.dinkumware.com
Jul 22 '05 #15

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

4
by: Roger Leigh | last post by:
Hello, I'm writing a fixed-precision floating point class, based on the ideas in the example fixed_pt class in the "Practical C++ Programming" book by Steve Oualline (O' Reilly). This uses a...
3
by: Mantorok Redgormor | last post by:
What have some of you guys read to have a solid understanding of how floating-point numbers are represented or handled by the processor and what the difference between single and double precision...
687
by: cody | last post by:
no this is no trollposting and please don't get it wrong but iam very curious why people still use C instead of other languages especially C++. i heard people say C++ is slower than C but i can't...
24
by: j0mbolar | last post by:
C supports single precision floating point and double precision floating point but does it support fixed floating point? i've read that fixed floating point is more accurate than single precision...
10
by: Shawn | last post by:
Hello all, I apologize as I am sure this has probably been dealth with before... but I am doing an exercise from "Practical C Programming" and I have been unable to get it to work perfectly due to...
7
by: Vinoth | last post by:
I'm working in an ARM (ARM9) system which does not have Floating point co-processor or Floating point libraries. But it does support long long int (64 bits). Can you provide some link that would...
1
by: Satpreet | last post by:
I'm looking to simulate the behavior of a digital hardware arithmetic block in a C/C++ program. I was just wondering if there are any libraries (with datatypes and overloaded operators etc.)...
32
by: ma740988 | last post by:
template <class T> inline bool isEqual( const T& a, const T& b, const T epsilon = std::numeric_limits<T>::epsilon() ) { const T diff = a - b; return ( diff <= epsilon ) && ( diff >= -epsilon );...
70
by: Robert Gamble | last post by:
9899:1999 5.1.2.3 Example 4 reads: "EXAMPLE 4 Implementations employing wide registers have to take care to honor appropriate semantics. Values are independent of whether they are represented in a...
2
by: Kemmylinns12 | last post by:
Blockchain technology has emerged as a transformative force in the business world, offering unprecedented opportunities for innovation and efficiency. While initially associated with cryptocurrencies...
0
hi
by: WisdomUfot | last post by:
It's an interesting question you've got about how Gmail hides the HTTP referrer when a link in an email is clicked. While I don't have the specific technical details, Gmail likely implements measures...
0
Oralloy
by: Oralloy | last post by:
Hello Folks, I am trying to hook up a CPU which I designed using SystemC to I/O pins on an FPGA. My problem (spelled failure) is with the synthesis of my design into a bitstream, not the C++...
0
by: Carina712 | last post by:
Setting background colors for Excel documents can help to improve the visual appeal of the document and make it easier to read and understand. Background colors can be used to highlight important...
0
BLUEPANDA
by: BLUEPANDA | last post by:
At BluePanda Dev, we're passionate about building high-quality software and sharing our knowledge with the community. That's why we've created a SaaS starter kit that's not only easy to use but also...
0
by: Rahul1995seven | last post by:
Introduction: In the realm of programming languages, Python has emerged as a powerhouse. With its simplicity, versatility, and robustness, Python has gained popularity among beginners and experts...
1
by: Johno34 | last post by:
I have this click event on my form. It speaks to a Datasheet Subform Private Sub Command260_Click() Dim r As DAO.Recordset Set r = Form_frmABCD.Form.RecordsetClone r.MoveFirst Do If...
1
by: ezappsrUS | last post by:
Hi, I wonder if someone knows where I am going wrong below. I have a continuous form and two labels where only one would be visible depending on the checkbox being checked or not. Below is the...
0
by: jack2019x | last post by:
hello, Is there code or static lib for hook swapchain present? I wanna hook dxgi swapchain present for dx11 and dx9.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.