C-faq q13.20 - C / C++

Thomas Lumley

One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

The code is supposed to generate Gaussian random variables, but with
mean=0 and std_dev=1 the smallest number it can generate is
sqrt(-2*log(0.5)), about -1.177. The distribution is two-humped, and
so not even vaguely Gaussian.

In the interests of making this message more c.l.c-conforming I should
probably also add a coding quibble:
bin = (y < 0.5) ? 0 : 1;
should just be
bin = y<0.5;
-thomas

Nov 18 '06 #1

Subscribe Post Reply

1374

Spiros Bousbouras

Thomas Lumley wrote:

One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

Your first link doesn't contain any code. The code
at the second link does not contain anywhere fabs().
So I have no idea where you got the code you posted
above.

Nov 19 '06 #2

Registered User

Spiros Bousbouras wrote:

Thomas Lumley wrote:
One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

Your first link doesn't contain any code.

Look again. There's a link to the code he's talking about.

Nov 19 '06 #3

Spiros Bousbouras

Registered User wrote:

Spiros Bousbouras wrote:
Thomas Lumley wrote:
One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.
>
The full code is at that link. The part of the code that actually does
the work is
>
y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));
>
return bin ? (mean + y) : (mean - y);
Your first link doesn't contain any code.

Look again. There's a link to the code he's talking about.

I looked again ; I can't find it.

Nov 19 '06 #4

Registered User

Spiros Bousbouras wrote:

Registered User wrote:
Spiros Bousbouras wrote:
Thomas Lumley wrote:
One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);
>
Your first link doesn't contain any code.
Look again. There's a link to the code he's talking about.

I looked again ; I can't find it.

Ok, I'll be more precise. The page at <http://c-faq.com/lib/sd16.html>
(the first link) says:

<quote>another technique, using explictly the inverse of the Normal
density function, contributed by ``Luben'' </quote>

The phrase "another technique" has got a hyperlink to
<http://c-faq.com/lib/gaussrand.luben.htmlwhich contains the code the
OP's talking about.

Nov 19 '06 #5

Ben Bacarisse

"Thomas Lumley" <th****@drizzle.netwrites:

One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

Indeed. The above will not produce a distribution that is symmetric
about "mean" because the decision as to which side of the mean we
return is correlated to the size of the uniform random number y. One
needs either another random sample:

return rand() RAND_MAX/2 ? mean + y : mean - y;

or (if the your source in random bits is rather precious) the decision
must be based on a bit or bits in the original sample that are not
correlated to the size of the original y:

int r = rand();
y = (double)(r + 1) / (RAND_MAX + 1);
y = std_dev * sqrt(-2 * log(y));
return r & 1 ? mean + y : mean - y;

I can't see the advantage of the fabs. I think the above allows y to
be as small as (randomly) possible whist remaining 0. Of course,
this last version raises the question of the suitability of using the
bottom bits returned by rand() which was well thrashed out about a
year ago.

I should add that even then I don't think the result is normally
distributed (though it will at least be symmetrical).

--
Ben.

Nov 19 '06 #6

Ben Bacarisse

Ben Bacarisse <be********@bsb.me.ukwrites:

"Thomas Lumley" <th****@drizzle.netwrites:

>One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

Indeed. The above will not produce a distribution that is symmetric
about "mean" because the decision as to which side of the mean we
return is correlated to the size of the uniform random number y.

I should add that even then I don't think the result is normally
distributed (though it will at least be symmetrical).

Drat! Posted by mistake. After writing, I decided that the original
linked code was so broken (this last phrase of mine is far too week --
the result is *definitely* not Gaussian) that it was best to say
nothing!

Anyway, having said more the nothing, the best thing to say is to
suggest the link be removed lest anyone try to use it. (And don't use
the above unless you want some strange, unspecified, symmetric
distribution or numbers!)

--
Ben.

Nov 19 '06 #7

CBFalconer

Ben Bacarisse wrote:

>

.... snip ...

>
or (if the your source in random bits is rather precious) the
decision must be based on a bit or bits in the original sample
that are not correlated to the size of the original y:

int r = rand();
y = (double)(r + 1) / (RAND_MAX + 1);
y = std_dev * sqrt(-2 * log(y));
return r & 1 ? mean + y : mean - y;

I can't see the advantage of the fabs. I think the above allows
y to be as small as (randomly) possible whist remaining 0. Of
course, this last version raises the question of the suitability
of using the bottom bits returned by rand() which was well
thrashed out about a year ago.

I should add that even then I don't think the result is normally
distributed (though it will at least be symmetrical).

Cfaq 13.20 covers it.

13.20: How can I generate random numbers with a normal or Gaussian
distribution?

A: Here is one method, recommended by Knuth and due originally
to Marsaglia:

#include <stdlib.h>
#include <math.h>

double gaussrand()
{
static double V1, V2, S;
static int phase = 0;
double X;

if(phase == 0) {
do {
double U1 = (double)rand() / RAND_MAX;
double U2 = (double)rand() / RAND_MAX;

V1 = 2 * U1 - 1;
V2 = 2 * U2 - 1;
S = V1 * V1 + V2 * V2;
} while(S >= 1 || S == 0);

X = V1 * sqrt(-2 * log(S) / S);
} else
X = V2 * sqrt(-2 * log(S) / S);

phase = 1 - phase;

return X;
}

See the extended versions of this list (see question 20.40)
for other ideas.

References: Knuth Sec. 3.4.1 p. 117; Marsaglia and Bray,
"A Convenient Method for Generating Normal Variables";
Press et al., _Numerical Recipes in C_ Sec. 7.2 pp.
288-290.

--
Chuck F (cbfalconer at maineline dot net)
Available for consulting/temporary embedded and systems.
<http://cbfalconer.home.att.net>

Nov 19 '06 #8

Ben Bacarisse

CBFalconer <cb********@yahoo.comwrites:

Ben Bacarisse wrote:
>>
... snip ...

>>
I should add that even then I don't think the result is normally
distributed (though it will at least be symmetrical).

Cfaq 13.20 covers it.

Well, yes. That's how I (and probably the OP) got to the faulty
linked code in the first place!

The problem is it covers it then links to "another technique" that is
wrong (or maybe is another technique for something else).

--
Ben.

Nov 19 '06 #9

Thomas Lumley

Ben Bacarisse wrote:

"Thomas Lumley" <th****@drizzle.netwrites:

One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

Indeed. The above will not produce a distribution that is symmetric
about "mean" because the decision as to which side of the mean we
return is correlated to the size of the uniform random number y. One
needs either another random sample:

return rand() RAND_MAX/2 ? mean + y : mean - y;

or (if the your source in random bits is rather precious) the decision
must be based on a bit or bits in the original sample that are not
correlated to the size of the original y:

int r = rand();
y = (double)(r + 1) / (RAND_MAX + 1);
y = std_dev * sqrt(-2 * log(y));
return r & 1 ? mean + y : mean - y;

I can't see the advantage of the fabs.

My guess is that
y = fabs(2*y -1.0)
was intended. This makes bin and y independent and is
basically the same as your first suggestion.

I think the above allows y to
be as small as (randomly) possible whist remaining 0. Of course,
this last version raises the question of the suitability of using the
bottom bits returned by rand() which was well thrashed out about a
year ago.

I should add that even then I don't think the result is normally
distributed (though it will at least be symmetrical).

No, it's not remotely normal. It is bimodal, for a start.
In fact, each tail is a Weibull distribution. It also does not have the

claimed standard deviation.

The basic problem is that the original author was trying to invert
the normal density but should have been trying to invert the
normal cumulative distribution function. Inverting the CDF does
give slightly better results (less discreteness in the extreme tails)
than the two valid techniques in q13.20, but it is slower and quite a
lot more complicated.

Fortunately anyone who does any testing of the code will quickly
find out that it is wrong. The link is still unfortunate.
-thomas

Nov 19 '06 #10

Thad Smith

Thomas Lumley wrote:

One of the examples in q13.20 in the FAQ
(http://c-faq.com/lib/sd16.html, linked from
http://c-faq.com/lib/gaussian.html) seems to be wrong.

The full code is at that link. The part of the code that actually does
the work is

y = (double) rand() / (RAND_MAX + 1.0); /* 0.0 <= y < 1.0 */
bin = (y < 0.5) ? 0 : 1;
y = fabs(y - 1.0); /* 0.0 < y <= 1.0 */
y = std_dev * sqrt((-2.0) * log(y));

return bin ? (mean + y) : (mean - y);

The code is supposed to generate Gaussian random variables, but with
mean=0 and std_dev=1 the smallest number it can generate is
sqrt(-2*log(0.5)), about -1.177. The distribution is two-humped, and
so not even vaguely Gaussian.

I am not familiar with the formula for Gaussian generators, but suspect
that y = fabs(y - 1.0) should be y = fabs(y - 0.5). That would make it
single humped.

The two algorithms on http://c-faq.com/lib/gaussian.html seem to not
generate independent points. The code attributed to Abramowitz and
Stegun apparently returns identical points for two consecutive calls!
The code from Knuth has correlations between the call with phase==0 and
the following call since S is the same for both.

--
Thad

Nov 28 '06 #11

Eric Sosman

Thad Smith wrote On 11/28/06 01:21,:

>
[...]
The two algorithms on http://c-faq.com/lib/gaussian.html seem to not
generate independent points. The code attributed to Abramowitz and
Stegun apparently returns identical points for two consecutive calls!

Yes, whenever sin(2 * PI * V) == cos(2 * PI * V) ...

The code from Knuth has correlations between the call with phase==0 and
the following call since S is the same for both.

No, the values are independent (if you're willing to call
deterministically-generated pseudo-random numbers "independent").
The fact that S is identical is of no consequence; it's just a
scaling factor that normalizes the independent V1,V2. As the
FAQ suggests, "See the references for more information."

--
Er*********@sun.com

Nov 28 '06 #12

Similar topics

C/C++ code beautifier

by: Diego Andres Alvarez Marin | last post by:

Hi all! Is there a linux tool that uses as input my ugly C/C++ code and outputs a pretty and nice formated source code? Regards, Diego

C / C++

Large arrays: malloc or 'just declare'?

by: Hal Styli | last post by:

Hello, Can someone please help. I need to use a large array and I'm not sure when one should use (A) and when one should use (B) below:- #define MAX 10000000 (A) int x;

C / C++

free() is useless...

by: Matt Gostick | last post by:

Well, it's not useless, it just doesn't seem to be working for me. Basically, when I'm returning a pointer from a function, free does not seem to be freeing the memory for me. Here is an example...

C / C++

C FAQ 3.1

by: ccwork | last post by:

Hi all, In CFAQ 3.1, it says: A: Under my compiler, the code int i = 7; printf("%d\n", i++ * i++); prints 49. Regardless of the order of evaluation, shouldn't it print 56?

C / C++

Is this valid?

by: AC | last post by:

Is the following code valid? #include <stdio.h> #include <stdlib.h> #include <string.h> typedef struct { int a; char s;

C / C++

C FAQ wiki

by: Giannis Papadopoulos | last post by:

Although Steve Summit's C FAQ is really good, are there any C FAQ wikis? -- one's freedom stops where others' begin Giannis Papadopoulos http://dop.users.uth.gr/ University of Thessaly...

C / C++

Can you recommend any C programming books?

by: omar khan | last post by:

Any books that helped you progress in C programming? Where do you begin with C programming? Can you recommend any websites? How many months or years did it take you to become knowledge to...

C / C++

2000 Web Templates (Including Customizable Flash Templates)

by: Audrey Pratt | last post by:

Happy Holidays to you and allow us to play Santa this year with these awsome deals that in anyway you can refuse: 2000 Web Templates for only $18.00 (Savings Over $1,000.00) ...

C / C++

File handling question

by: Anunay | last post by:

Hi all, Suppose a text file is given and we have to write a program that will return a random line from the file. This can be easily done. But what if the text file is too big and can't fit...

C / C++

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

General