malloc()/realloc() - have I got this right?

Dave wrote:

>
I'm teaching myself C by working my way through Steve Summit's
tutorial (http://www.eskimo.com/~scs/cclass/cclass.html). In one
of the questions (assignment 6, exercise 7), you have to write a
function to read lines of arbitrary length from the command line,
using malloc() and realloc() to allocate the necessary memory to
hold the lines. I came up with this:

Instead of all that just get ggets.zip from:

<http://cbfalconer.home.att.net/download/ggets.zip>

and read ggets.c (less code that yours, all standard). Then decide
if you want to use it (I have put it in the public domain) or adapt
it to your purposes.

--
[mail]: Chuck F (cbfalconer at maineline dot net)
[page]: <http://cbfalconer.home.att.net>
Try the download section.

** Posted from http://www.teranews.com **

Jun 27 '08 #5

CBFalconer said:

Dave wrote:
>>
I'm teaching myself C by working my way through Steve Summit's
tutorial (http://www.eskimo.com/~scs/cclass/cclass.html). In one
of the questions (assignment 6, exercise 7), you have to write a
function to read lines of arbitrary length from the command line,
using malloc() and realloc() to allocate the necessary memory to
hold the lines. I came up with this:

Instead of all that just get ggets.zip from:

<http://cbfalconer.home.att.net/download/ggets.zip>

Be aware, if you do so, that the above routine has what some people
consider to be serious design flaws. And of course you'll learn more by
writing your own than you will by pinching someone else's.

--
Richard Heathfield <http://www.cpax.org.uk>
Email: -http://www. +rjh@
Google users: <http://www.cpax.org.uk/prg/writings/googly.php>
"Usenet is a strange place" - dmr 29 July 1999

Jun 27 '08 #6

CBFalconer <cb********@yahoo.comwrites:

Dave wrote:
>I'm teaching myself C by working my way through Steve Summit's
tutorial (http://www.eskimo.com/~scs/cclass/cclass.html). In one
of the questions (assignment 6, exercise 7), you have to write a
function to read lines of arbitrary length from the command line,
using malloc() and realloc() to allocate the necessary memory to
hold the lines. I came up with this:

Instead of all that just get ggets.zip from:

<http://cbfalconer.home.att.net/download/ggets.zip>

and read ggets.c (less code that yours, all standard). Then decide
if you want to use it (I have put it in the public domain) or adapt
it to your purposes.

Which would not serve the purpose of teaching himself C nearly as well
as implementing his own. Reading code and writing code are both
valuable skills, but they're not interchangeable.

--
Keith Thompson (The_Other_Keith) ks***@mib.org <http://www.ghoti.net/~kst>
Nokia
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Jun 27 '08 #7

Richard Heathfield wrote:

CBFalconer said:
>Dave wrote:
>>>
I'm teaching myself C by working my way through Steve Summit's
tutorial (http://www.eskimo.com/~scs/cclass/cclass.html). In one
of the questions (assignment 6, exercise 7), you have to write a
function to read lines of arbitrary length from the command line,
using malloc() and realloc() to allocate the necessary memory to
hold the lines. I came up with this:

Instead of all that just get ggets.zip from:

<http://cbfalconer.home.att.net/download/ggets.zip>

Be aware, if you do so, that the above routine has what some people
consider to be serious design flaws. And of course you'll learn
more by writing your own than you will by pinching someone else's.

Here, without the documentation, .h file, test routines, etc. is
the actual (standard C) code for ggets.c. Instead of listening to
the mewling of people who don't like to see malloc called within a
routine, read the code, and make your own decision. ggets is a
macro in ggets.h that uses the FILE stdin.

#include <stdio.h>
#include <stdlib.h>
#include "ggets.h"

#define INITSIZE 112 /* power of 2 minus 16, helps malloc */
#define DELTASIZE (INITSIZE + 16)

enum {OK = 0, NOMEM};

int fggets(char* *ln, FILE *f)
{
int cursize, ch, ix;
char *buffer, *temp;

*ln = NULL; /* default */
if (NULL == (buffer = malloc(INITSIZE))) return NOMEM;
cursize = INITSIZE;

ix = 0;
while ((EOF != (ch = getc(f))) && ('\n' != ch)) {
if (ix >= (cursize - 1)) { /* extend buffer */
cursize += DELTASIZE;
if (NULL == (temp = realloc(buffer, (size_t)cursize))) {
/* ran out of memory, return partial line */
buffer[ix] = '\0';
*ln = buffer;
return NOMEM;
}
buffer = temp;
}
buffer[ix++] = ch;
}
if ((EOF == ch) && (0 == ix)) {
free(buffer);
return EOF;
}

buffer[ix] = '\0';
if (NULL == (temp = realloc(buffer, (size_t)ix + 1))) {
*ln = buffer; /* without reducing it */
}
else *ln = temp;
return OK;
} /* fggets */
/* End of ggets.c */

--
[mail]: Chuck F (cbfalconer at maineline dot net)
[page]: <http://cbfalconer.home.att.net>
Try the download section.
** Posted from http://www.teranews.com **

Jun 27 '08 #8

CBFalconer said:

<snip>

Here, without the documentation, .h file, test routines, etc. is
the actual (standard C) code for ggets.c. Instead of listening to
the mewling of people who don't like to see malloc called within a
routine,

....of whom I am not one...

read the code, and make your own decision.

Okay.

ggets is a macro in ggets.h that uses the FILE stdin.

Fair enough, although presumably you mean FILE *.

>
#include <stdio.h>
#include <stdlib.h>
#include "ggets.h"

#define INITSIZE 112 /* power of 2 minus 16, helps malloc */

MIGHT help malloc. It depends on the implementation.

#define DELTASIZE (INITSIZE + 16)

enum {OK = 0, NOMEM};

Are those the only two failure conditions? What about end of file? Or a
stream error? Why not make it possible to report those?

int fggets(char* *ln, FILE *f)
{
int cursize, ch, ix;
char *buffer, *temp;

*ln = NULL; /* default */

The problem with this approach, as I have explained before, is that it
doesn't allow the caller to re-use an existing buffer. If they try, the
above line leaks the previous memory!

if (NULL == (buffer = malloc(INITSIZE))) return NOMEM;
cursize = INITSIZE;

ix = 0;
while ((EOF != (ch = getc(f))) && ('\n' != ch)) {
if (ix >= (cursize - 1)) { /* extend buffer */
cursize += DELTASIZE;

The problem with this approach is that it doesn't scale well. Remember that
stdin doesn't necessarily mean keyboard input. The input might, for
example, be coming from a very, very fast fixed disk. Consider changing to
an exponential approach, e.g. cursize *= 2; (or even cursize *= 1.1 or
something like that, which is fine if cursize starts with a value of at
least 10, which it does in this case).

if (NULL == (temp = realloc(buffer, (size_t)cursize))) {

The problem with this approach is that it runs a significant risk of
exhausting memory without the programmer having any control over this.
Consider accepting a parameter that specifies the largest buffer size this
call is allowed to allocate.

<snip>

These are genuine concerns. Your continued failure to address them is, of
course, entirely your decision, but I am at a loss to understand how you
can possibly recommend such a function (which you do, continually) whilst
these flaws remain.

--
Richard Heathfield <http://www.cpax.org.uk>
Email: -http://www. +rjh@
Google users: <http://www.cpax.org.uk/prg/writings/googly.php>
"Usenet is a strange place" - dmr 29 July 1999

Jun 27 '08 #9

Richard Heathfield wrote:

CBFalconer said:

.... snip ...

>>
enum {OK = 0, NOMEM};

Are those the only two failure conditions? What about end of file?
Or a stream error? Why not make it possible to report those?

You didn't read the whole routine. It also returns EOF, which is
not defined here. I did point out that this listing omitted the
documentation etc.

>
The problem with this approach, as I have explained before, is
that it doesn't allow the caller to re-use an existing buffer.
If they try, the above line leaks the previous memory!

In your opinion. Similarly, never use malloc to set a pre-existing
pointer variable. Buffer reuse would require additional
parameters, and user confusion. Avoided.

>>
ix = 0;
while ((EOF != (ch = getc(f))) && ('\n' != ch)) {
if (ix >= (cursize - 1)) { /* extend buffer */
cursize += DELTASIZE;

The problem with this approach is that it doesn't scale well.
Remember that stdin doesn't necessarily mean keyboard input. The
input might, for example, be coming from a very, very fast fixed
disk. Consider changing to an exponential approach, e.g. cursize
*= 2; (or even cursize *= 1.1 or something like that, which is
fineat if cursize starts with a value of least 10, which it does
in this case).

Which it is not intended to. It will work for any size, but is
optimized for those sizes expected as interactive input. Note that
the user can alter this part of the algorithm easily.

>
> if (NULL == (temp = realloc(buffer, (size_t)cursize))) {

The problem with this approach is that it runs a significant risk
of exhausting memory without the programmer having any control over
this. Consider accepting a parameter that specifies the largest
buffer size this call is allowed to allocate.

The point is to avoid the need for any extra parameters and
associated confusion.

>
These are genuine concerns. Your continued failure to address them
is, of course, entirely your decision, but I am at a loss to
understand how you can possibly recommend such a function (which you
do, continually) whilst these flaws remain.

Yes, similarly it is a great evil to have file open functions,
since failure to close them may cause data loss. You are confusing
faults with design objectives. All these points have been answered
before, and you are perfectly free not to use ggets.

--
[mail]: Chuck F (cbfalconer at maineline dot net)
[page]: <http://cbfalconer.home.att.net>
Try the download section.
** Posted from http://www.teranews.com **

Jun 27 '08 #10

CBFalconer said:

Richard Heathfield wrote:
>CBFalconer said:

... snip ...

>>>
enum {OK = 0, NOMEM};

Are those the only two failure conditions? What about end of file?
Or a stream error? Why not make it possible to report those?

You didn't read the whole routine. It also returns EOF, which is
not defined here. I did point out that this listing omitted the
documentation etc.

Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

>The problem with this approach, as I have explained before, is
that it doesn't allow the caller to re-use an existing buffer.
If they try, the above line leaks the previous memory!

In your opinion.

It is true that the claim that it is a problem is a matter of opinion. It
is not a matter of opinion, however, that an attempt to re-use an existing
buffer *will* leak memory.

Similarly, never use malloc to set a pre-existing
pointer variable. Buffer reuse would require additional
parameters, and user confusion. Avoided.

Yes, it would require additional parameters. User confusion can stem from
many sources, though: for example, "why is my program leaking like a
sieve?" or (for those who've worked this out, perhaps by reading the docs)
"why can't I re-use this perfectly good buffer?"

>> ix = 0;
while ((EOF != (ch = getc(f))) && ('\n' != ch)) {
if (ix >= (cursize - 1)) { /* extend buffer */
cursize += DELTASIZE;

The problem with this approach is that it doesn't scale well.
Remember that stdin doesn't necessarily mean keyboard input. The
input might, for example, be coming from a very, very fast fixed
disk. Consider changing to an exponential approach, e.g. cursize
*= 2; (or even cursize *= 1.1 or something like that, which is
fineat if cursize starts with a value of least 10, which it does
in this case).

Which it is not intended to.

I'm puzzled. Do you mean that it's not intended to scale well?

It will work for any size,

This suggests that it /is/ expected to scale well, which it fails to do.

but is
optimized for those sizes expected as interactive input.

I disagree. If input is interactive and therefore relatively thin, asking
for a multiple of the existing memory isn't going to cause a delay any
more noticeable than that suffered when asking for a fixed additional
amount. So in fact it's not (usefully) optimised for interactive input.
Rather, it's pessimised for non-interactive input.

Note that
the user can alter this part of the algorithm easily.

If by "user" you mean "programmer", yes, a reasonably skilled programmer
can do that quite easily. But why should he or she have to, when it's so
easy for the problem to be fixed at source?

>> if (NULL == (temp = realloc(buffer, (size_t)cursize))) {

The problem with this approach is that it runs a significant risk
of exhausting memory without the programmer having any control over
this. Consider accepting a parameter that specifies the largest
buffer size this call is allowed to allocate.

The point is to avoid the need for any extra parameters and
associated confusion.

I think you under-estimate the ability of computer programmers to deal with
what is, after all, a pretty simple interface even /with/ the extra
parameters.

>These are genuine concerns. Your continued failure to address them
is, of course, entirely your decision, but I am at a loss to
understand how you can possibly recommend such a function (which you
do, continually) whilst these flaws remain.

Yes, similarly it is a great evil to have file open functions,
since failure to close them may cause data loss.

No, but it would be silly to have a function that, if passed a stream
pointer, immediately closes it just so that it can have the fun of opening
it again.

You are confusing faults with design objectives.

No, I'm saying that the design objective is flawed. For a very slight
change in the design, you can make the function considerably more useful,
but you choose not to do that.

All these points have been answered before,

To say, in effect, "tisn't" is not what I consider a sensible answer.

and you are perfectly free not to use ggets.

It is a freedom that I guard jealously, and I commend the same approach to
others until ggets is fixed to deal with the problems I've raised.

--
Richard Heathfield <http://www.cpax.org.uk>
Email: -http://www. +rjh@
Google users: <http://www.cpax.org.uk/prg/writings/googly.php>
"Usenet is a strange place" - dmr 29 July 1999

Jun 27 '08 #11

Richard Heathfield <rj*@see.sig.invalidwrites:

CBFalconer said:
>Richard Heathfield wrote:
>>CBFalconer said:

... snip ...
>>>>
enum {OK = 0, NOMEM};

Are those the only two failure conditions? What about end of file?
Or a stream error? Why not make it possible to report those?

You didn't read the whole routine. It also returns EOF, which is
not defined here. I did point out that this listing omitted the
documentation etc.

Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets(). That's what feof() and ferror() are for.

[snip]

--
Keith Thompson (The_Other_Keith) ks***@mib.org <http://www.ghoti.net/~kst>
Nokia
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Jun 27 '08 #12

Keith Thompson said:

Richard Heathfield <rj*@see.sig.invalidwrites:

<snip>

>Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets().

Yes, which is another of its faults.

That's what feof() and ferror() are for.

A recoverable fault, therefore, but still a fault.

--
Richard Heathfield <http://www.cpax.org.uk>
Email: -http://www. +rjh@
Google users: <http://www.cpax.org.uk/prg/writings/googly.php>
"Usenet is a strange place" - dmr 29 July 1999

Jun 27 '08 #13

santosh

Keith Thompson wrote:

Richard Heathfield <rj*@see.sig.invalidwrites:
>CBFalconer said:
>>Richard Heathfield wrote:
CBFalconer said:

... snip ...
>
enum {OK = 0, NOMEM};

Are those the only two failure conditions? What about end of file?
Or a stream error? Why not make it possible to report those?

You didn't read the whole routine. It also returns EOF, which is
not defined here. I did point out that this listing omitted the
documentation etc.

Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets(). That's what feof() and ferror() are for.

[snip]

Nothing can be done about fgets but a new function /could/ disambiguate
between these two conditions thus freeing the caller from some more
repetitive work.

Jun 27 '08 #14

Richard Heathfield <rj*@see.sig.invalidwrites:

Keith Thompson said:

>Richard Heathfield <rj*@see.sig.invalidwrites:

<snip>

>>Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets().

Yes, which is another of its faults.

>That's what feof() and ferror() are for.

A recoverable fault, therefore, but still a fault.

Point taken.

On the other hand, fgets()'s failure to distinguish between
end-of-file and an error isn't all *that* bad, and there's some virtue
in sticking to the model used by the standard library.

--
Keith Thompson (The_Other_Keith) ks***@mib.org <http://www.ghoti.net/~kst>
Nokia
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Jun 27 '08 #15

Flash Gordon

Keith Thompson wrote, On 30/05/08 20:00:

Richard Heathfield <rj*@see.sig.invalidwrites:
>Keith Thompson said:

>>Richard Heathfield <rj*@see.sig.invalidwrites:
<snip>

>>>Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.
So does fgets().
Yes, which is another of its faults.

>>That's what feof() and ferror() are for.
A recoverable fault, therefore, but still a fault.

Point taken.

On the other hand, fgets()'s failure to distinguish between
end-of-file and an error isn't all *that* bad, and there's some virtue
in sticking to the model used by the standard library.

I don't think using two different negative values to distinguish between
error and end-of-file would be too much of a variation since some
functions which return an int like CBFs ggets (e.g. getc) use EOF for
failure (guaranteed to be negative) and non-negative values for success.
Of course, selecting the two negative values is a bit of a pain, but...

#if EOF==-1
#define FERROR (EOF-1)
#else
#define FERROR (EOF+1)
#endif
--
Flash Gordon

Jun 27 '08 #16

santosh wrote:

Keith Thompson wrote:
>Richard Heathfield <rj*@see.sig.invalidwrites:
>>CBFalconer said:
Richard Heathfield wrote:
CBFalconer said:
>
... snip ...
>>
>enum {OK = 0, NOMEM};
>
Are those the only two failure conditions? What about end of file?
Or a stream error? Why not make it possible to report those?

You didn't read the whole routine. It also returns EOF, which is
not defined here. I did point out that this listing omitted the
documentation etc.

Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets(). That's what feof() and ferror() are for.

[snip]

Nothing can be done about fgets but a new function /could/
disambiguate between these two conditions thus freeing the caller
from some more repetitive work.

Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available. If there is a need to distinguish EOF from errors, it
can be done at that point.

This is not a matter of correctness, but of design philosophy.

--
[mail]: Chuck F (cbfalconer at maineline dot net)
[page]: <http://cbfalconer.home.att.net>
Try the download section.
** Posted from http://www.teranews.com **

Jun 27 '08 #17

Antoninus Twink

On 30 May 2008 at 21:53, CBFalconer wrote:

Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

If you're in a restaurant and can't get your meal either because they've
run out of salmon or because the kitchen's on fire, you might find it
useful to be able to distinguish between those two error conditions.

Jun 27 '08 #18

rio

"CBFalconer" <cb********@yahoo.comha scritto nel messaggio #include
<stdio.h>

#include <stdlib.h>
#include "ggets.h"

#define INITSIZE 112 /* power of 2 minus 16, helps malloc */
#define DELTASIZE (INITSIZE + 16)

enum {OK = 0, NOMEM};

#define OK 0
#define NOMEM 1
#define EOF 2
#define EOFOK 4

int fggets(char* *ln, FILE *f)
{
int cursize, ch, ix;
char *buffer, *temp;

*ln = NULL; /* default */

if(ln==0||f==0) return ERROR;

if there is an error: better segfault
but if there is a segfault: better return error
All functions have to do the 'possible' for not seg fault in them

if (NULL == (buffer = malloc(INITSIZE))) return NOMEM;
cursize = INITSIZE;

ix = 0;
while ((EOF != (ch = getc(f))) && ('\n' != ch)) {
if (ix >= (cursize - 1)) { /* extend buffer */
cursize += DELTASIZE;

"jump if "+=" overflow -error"

if (NULL == (temp = realloc(buffer, (size_t)cursize))) {
/* ran out of memory, return partial line */
buffer[ix] = '\0';
*ln = buffer;
return NOMEM;
}
buffer = temp;
}
buffer[ix++] = ch;
}
if ((EOF == ch) && (0 == ix)) {
free(buffer);
return EOF;

return ferror(f)? EOF: EOFOK;

}

buffer[ix] = '\0';
if (NULL == (temp = realloc(buffer, (size_t)ix + 1))) {
*ln = buffer; /* without reducing it */
}
else *ln = temp;
return OK;

return ch==EOF? (ferror(f)?EOF:EOFOK) : OK;
Why not signal the EOF?

} /* fggets */
/* End of ggets.c */

Jun 27 '08 #19

santosh

CBFalconer wrote:

santosh wrote:
>Keith Thompson wrote:
>>Richard Heathfield <rj*@see.sig.invalidwrites:
CBFalconer said:
Richard Heathfield wrote:
>CBFalconer said:
>>
... snip ...
>>>
>>enum {OK = 0, NOMEM};
>>
>Are those the only two failure conditions? What about end of
>file? Or a stream error? Why not make it possible to report
>those?
>
You didn't read the whole routine. It also returns EOF, which is
not defined here. I did point out that this listing omitted the
documentation etc.

Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets(). That's what feof() and ferror() are for.

[snip]

Nothing can be done about fgets but a new function /could/
disambiguate between these two conditions thus freeing the caller
from some more repetitive work.

Ridiculous. Most use of the function

I was not talking about ggets in particular, but instead about a line
reading function in general.

simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available. If there is a need to distinguish EOF from errors, it
can be done at that point.

C provides standard functions to disambiguate between end-of-file and
error. The relevant functions need to be called right after fgetc has
returned EOF. I think it's mostly a matter of design whether this is
done at all, and if so, whether in the caller or callee.

I don't what's ridiculous about any of these alternatives.

This is not a matter of correctness, but of design philosophy.

Yes. So characterising an alternative design as "ridiculous" might be a
bit premature.

Jun 27 '08 #20

Antoninus Twink wrote:

On 30 May 2008 at 21:53, CBFalconer wrote:
>Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you might
find it useful to be able to distinguish between those two error
conditions.

For some reason that is beound me you elected to ignore CBF's next sentence,
which addresses exaclty that:

>>If there is a need to distinguish EOF from errors, it
can be done at that point.

In you analogy: just ask the waiter for the reason or listen to the fire
alarm.

Bye, Jojo

Jun 27 '08 #21

santosh

Joachim Schmitz wrote:

Antoninus Twink wrote:
>On 30 May 2008 at 21:53, CBFalconer wrote:
>>Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you might
find it useful to be able to distinguish between those two error
conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:

>>>If there is a need to distinguish EOF from errors, it
can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the callee
when he says "at that point". One might assume from the general tone of
his reply and the use of the word "ridiculous" that he prefers this to
be done by the calling code. Either way is fine but I personally prefer
to have the line reading function do this low-level chore.

In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to report
the reason to someone else, (perhaps someone at his home). So should
the client ask the waiter for the reason and go home and report that,
or go home and simply say "the salmon was unavailable" and leave it to
that person to go to the restaurant and ask the waiter for the reason
why Salmon was not available?

Jun 27 '08 #22

santosh wrote:

Joachim Schmitz wrote:

>Antoninus Twink wrote:
>>On 30 May 2008 at 21:53, CBFalconer wrote:
Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you
might find it useful to be able to distinguish between those two
error conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:
>>>If there is a need to distinguish EOF from errors, it
can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the callee
when he says "at that point". One might assume from the general tone
of his reply and the use of the word "ridiculous" that he prefers
this to be done by the calling code. Either way is fine but I
personally prefer to have the line reading function do this low-level
chore.

>In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to

s/has/may have/

report the reason to someone else, (perhaps someone at his home). So
should the client ask the waiter for the reason and go home and
report that, or go home and simply say "the salmon was unavailable"
and leave it to that person to go to the restaurant and ask the
waiter for the reason why Salmon was not available?

Either is fine and at the discretion of the client (customer).
If someone else (at home) needs to know and the client didn't bother to ask,
that someone else better uses a different client next time.

Bye, Jojo

Jun 27 '08 #23

Joachim Schmitz wrote:

santosh wrote:
>Joachim Schmitz wrote:

>>Antoninus Twink wrote:
On 30 May 2008 at 21:53, CBFalconer wrote:
Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you
might find it useful to be able to distinguish between those two
error conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:
If there is a need to distinguish EOF from errors, it
can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the
callee when he says "at that point". One might assume from the
general tone of his reply and the use of the word "ridiculous" that
he prefers this to be done by the calling code. Either way is fine
but I personally prefer to have the line reading function do this
low-level chore.

>>In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to
s/has/may have/

>report the reason to someone else, (perhaps someone at his home). So
should the client ask the waiter for the reason and go home and
report that, or go home and simply say "the salmon was unavailable"
and leave it to that person to go to the restaurant and ask the
waiter for the reason why Salmon was not available?
Either is fine and at the discretion of the client (customer).
If someone else (at home) needs to know and the client didn't bother
to ask, that someone else better uses a different client next time.

To extend the anaoly: If I go to a restaurant to eat and nothing is
available, I may not care why, the fact that I'm still hungry will just lead
me to the next restaurant.
Or I might care to ask why no salmon ia available and it the reason is a
burning kitchen, I'd leave hungry too (and quicly), otherwise I might pick a
diferent choice from the menu. Provided it's not the salmon that I'm
longinhg for.

So there are good reasons for both designs, both with pros and cons, but
none invalid or superior to the other.

Bye, Jojo

Jun 27 '08 #24

santosh

Joachim Schmitz wrote:

Joachim Schmitz wrote:
>santosh wrote:
>>Joachim Schmitz wrote:

Antoninus Twink wrote:
On 30 May 2008 at 21:53, CBFalconer wrote:
>Ridiculous. Most use of the function simply runs until a
>non-zero
>is returned, after which the operation ends. It may be because
>of
>EOF, or because of error. In either case, no further input is
>available.
>
If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you
might find it useful to be able to distinguish between those two
error conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:
>If there is a need to distinguish EOF from errors, it
>can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the
callee when he says "at that point". One might assume from the
general tone of his reply and the use of the word "ridiculous" that
he prefers this to be done by the calling code. Either way is fine
but I personally prefer to have the line reading function do this
low-level chore.

In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to
s/has/may have/

>>report the reason to someone else, (perhaps someone at his home). So
should the client ask the waiter for the reason and go home and
report that, or go home and simply say "the salmon was unavailable"
and leave it to that person to go to the restaurant and ask the
waiter for the reason why Salmon was not available?
Either is fine and at the discretion of the client (customer).
If someone else (at home) needs to know and the client didn't bother
to ask, that someone else better uses a different client next time.
To extend the anaoly: If I go to a restaurant to eat and nothing is
available, I may not care why, the fact that I'm still hungry will
just lead me to the next restaurant.
Or I might care to ask why no salmon ia available and it the reason is
a burning kitchen, I'd leave hungry too (and quicly), otherwise I
might pick a diferent choice from the menu. Provided it's not the
salmon that I'm longinhg for.

So there are good reasons for both designs, both with pros and cons,
but none invalid or superior to the other.

I agree. That is why I still don't understand why CBFalconer responded
to an earlier post as "ridiculous".

Jun 27 '08 #25

santosh

Joachim Schmitz wrote:

santosh wrote:
>Joachim Schmitz wrote:

>>Antoninus Twink wrote:
On 30 May 2008 at 21:53, CBFalconer wrote:
Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you
might find it useful to be able to distinguish between those two
error conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:
If there is a need to distinguish EOF from errors, it
can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the
callee when he says "at that point". One might assume from the
general tone of his reply and the use of the word "ridiculous" that
he prefers this to be done by the calling code. Either way is fine
but I personally prefer to have the line reading function do this
low-level chore.

>>In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to
s/has/may have/

>report the reason to someone else,

Why? If a library function /could/ fail, then returning a status code is
IMHO more or less essential. Otherwise the function becomes close to
ususable.

<snip>

Jun 27 '08 #26

santosh wrote:

Joachim Schmitz wrote:

>santosh wrote:
>>Joachim Schmitz wrote:

Antoninus Twink wrote:
On 30 May 2008 at 21:53, CBFalconer wrote:
>Ridiculous. Most use of the function simply runs until a
>non-zero is returned, after which the operation ends. It may be
>because of EOF, or because of error. In either case, no further
>input is available.
>
If you're in a restaurant and can't get your meal either because
they've run out of salmon or because the kitchen's on fire, you
might find it useful to be able to distinguish between those two
error conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:
>If there is a need to distinguish EOF from errors, it
>can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the
callee when he says "at that point". One might assume from the
general tone of his reply and the use of the word "ridiculous" that
he prefers this to be done by the calling code. Either way is fine
but I personally prefer to have the line reading function do this
low-level chore.

In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to
s/has/may have/

>>report the reason to someone else,

Why? If a library function /could/ fail, then returning a status code
is IMHO more or less essential. Otherwise the function becomes close
to ususable.

The client, your funktion, _may_ need to report failure to someone else.
The (Standard) Library indeed has to keep the reason some place.

Bye, Jojo

Jun 27 '08 #27

santosh wrote:

Joachim Schmitz wrote:

>Joachim Schmitz wrote:
>>santosh wrote:
Joachim Schmitz wrote:

Antoninus Twink wrote:
>On 30 May 2008 at 21:53, CBFalconer wrote:
>>Ridiculous. Most use of the function simply runs until a
>>non-zero
>>is returned, after which the operation ends. It may be because
>>of
>>EOF, or because of error. In either case, no further input is
>>available.
>>
>If you're in a restaurant and can't get your meal either because
>they've run out of salmon or because the kitchen's on fire, you
>might find it useful to be able to distinguish between those two
>error conditions.
For some reason that is beound me you elected to ignore CBF's next
sentence, which addresses exaclty that:
>>If there is a need to distinguish EOF from errors, it
>>can be done at that point.

The debate was whether the caller or the callee should disambiguate
between end-of-file and error. IMHO doing it in the caller saves a
small amount of otherwise extra work in the calling code, which is
after all, the main purpose of library code. It's unclear from the
above sentence by CBFalconer whether he means the caller or the
callee when he says "at that point". One might assume from the
general tone of his reply and the use of the word "ridiculous" that
he prefers this to be done by the calling code. Either way is fine
but I personally prefer to have the line reading function do this
low-level chore.

In you analogy: just ask the waiter for the reason or listen to
the fire alarm.

The analogy is flawed. The client (line reading function) has to
s/has/may have/

report the reason to someone else, (perhaps someone at his home).
So should the client ask the waiter for the reason and go home and
report that, or go home and simply say "the salmon was unavailable"
and leave it to that person to go to the restaurant and ask the
waiter for the reason why Salmon was not available?
Either is fine and at the discretion of the client (customer).
If someone else (at home) needs to know and the client didn't bother
to ask, that someone else better uses a different client next time.
To extend the anaoly: If I go to a restaurant to eat and nothing is
available, I may not care why, the fact that I'm still hungry will
just lead me to the next restaurant.
Or I might care to ask why no salmon ia available and it the reason
is a burning kitchen, I'd leave hungry too (and quicly), otherwise I
might pick a diferent choice from the menu. Provided it's not the
salmon that I'm longinhg for.

So there are good reasons for both designs, both with pros and cons,
but none invalid or superior to the other.

I agree. That is why I still don't understand why CBFalconer responded
to an earlier post as "ridiculous".

Indeed. Calling another ones opinion or needs ridiculous is just that...

Bye, Jojo

Jun 27 '08 #28

santosh said:

Joachim Schmitz wrote:

<snip>

>>
So there are good reasons for both designs, both with pros and cons,
but none invalid or superior to the other.

I agree. That is why I still don't understand why CBFalconer responded
to an earlier post as "ridiculous".

For a considerable number of months, Chuck's responses have been degrading
in quality. Recently, for example, he suggested that strspn is a
non-standard function - which, as any self-respecting C programmer ought
to know, really /is/ ridiculous.

He still gets stuff right occasionally, and I even think it's still
possible to hold a reasonable discussion with him, but it's getting more
difficult all the time.

--
Richard Heathfield <http://www.cpax.org.uk>
Email: -http://www. +rjh@
Google users: <http://www.cpax.org.uk/prg/writings/googly.php>
"Usenet is a strange place" - dmr 29 July 1999

Jun 27 '08 #29

Richard Tobin

In article <sl*******************@nospam.invalid>,
Antoninus Twink <no****@nospam.invalidwrote:

>Ridiculous. Most use of the function simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available.

>If you're in a restaurant and can't get your meal either because they've
run out of salmon or because the kitchen's on fire, you might find it
useful to be able to distinguish between those two error conditions.

On the other hand, I've been to hundreds of restaurants and have on
several occasions had to make a new choice because they'd run out of
what I'd ordered, but I've never had the problem that their kitchen
was on fire. It's not a possibility that I take into account when
choosing restaurants. Similarly, for some purposes the possibility of
an i/o error while reading is so insignificant that it can just be
ignored - just as you typically take no precautions to guard against
out-of-stack errors (which in my experience are much more common).

Obviously a program which, say, deletes the original file after
processing it should distinguish between an i/o error and EOF.
But not all programs are like that, and typically an i/o error
will produce a user-visible indication.

Errors when writing, rather than reading, are much more common because
they can be caused by a full disk.

-- Richard
--
In the selection of the two characters immediately succeeding the numeral 9,
consideration shall be given to their replacement by the graphics 10 and 11 to
facilitate the adoption of the code in the sterling monetary area. (X3.4-1963)

Jun 27 '08 #30

santosh wrote:

Joachim Schmitz wrote:

.... snip ...

>
>So there are good reasons for both designs, both with pros and
cons, but none invalid or superior to the other.

I agree. That is why I still don't understand why CBFalconer
responded to an earlier post as "ridiculous".

Because complicating a simple routine to allow for 'everything' is
ridiculous, especially when no impediment has been placed in the
way of handling that 'everything' when needed. Also, this is a
binary world, so many things go on the 'ridiculous' side to me.
:-)

--
[mail]: Chuck F (cbfalconer at maineline dot net)
[page]: <http://cbfalconer.home.att.net>
Try the download section.
** Posted from http://www.teranews.com **

Jun 27 '08 #31

Flash Gordon

rio wrote, On 31/05/08 06:51:

"CBFalconer" <cb********@yahoo.comha scritto nel messaggio #include
<stdio.h>
>#include <stdlib.h>
#include "ggets.h"

#define INITSIZE 112 /* power of 2 minus 16, helps malloc */
#define DELTASIZE (INITSIZE + 16)

enum {OK = 0, NOMEM};

#define OK 0
#define NOMEM 1
#define EOF 2

Rather problematic seeing as stdlib.h already defines EOF

#define EOFOK 4

>int fggets(char* *ln, FILE *f)
{
int cursize, ch, ix;
char *buffer, *temp;

*ln = NULL; /* default */

if(ln==0||f==0) return ERROR;

if there is an error: better segfault
but if there is a segfault: better return error

The above two statements seem to directly contradict each other so I'm
not sure what you are trying to say.

<snip>
--
Flash Gordon

Jun 27 '08 #32

rio

"Flash Gordon" <sp**@flash-gordon.me.ukha scritto nel messaggio
news:qe************@news.flash-gordon.me.uk...

rio wrote, On 31/05/08 06:51:
>"CBFalconer" <cb********@yahoo.comha scritto nel messaggio #include
if there is an error: better segfault
but if there is a segfault: better return error

The above two statements seem to directly contradict each other so I'm not
sure what you are trying to say.

if there is an data error in a function is better
segfault the function
but
is *much better* [than seg fault] the function return an error value

<snip>
--
Flash Gordon

Jun 27 '08 #33

rio wrote:

"Flash Gordon" <sp**@flash-gordon.me.ukha scritto nel messaggio
news:qe************@news.flash-gordon.me.uk...
>rio wrote, On 31/05/08 06:51:
>>"CBFalconer" <cb********@yahoo.comha scritto nel messaggio >
#include if there is an error: better segfault
but if there is a segfault: better return error

The above two statements seem to directly contradict each other so
I'm not sure what you are trying to say.

if there is an data error in a function is better
segfault the function
but
is *much better* [than seg fault] the function return an error value

A segvault might be better han continue processing with bogus data but
better than a segvault is an assert(), as it would easily tell you in which
file and line the error occured, rather than having you to run it thru a
debugger to find some stack trace. Also0 a segvault might not give you what
you need to debug, e.g. if the stack itself got corrupted.
But still a segvault as well as a failed assertion is a bug in the program
IMHO.

Bye, Jojo

Jun 27 '08 #34

Richard

santosh <sa*********@gmail.comwrites:

CBFalconer wrote:

>santosh wrote:
>>Keith Thompson wrote:
Richard Heathfield <rj*@see.sig.invalidwrites:
CBFalconer said:
>Richard Heathfield wrote:
>>CBFalconer said:
>>>
>... snip ...
>>>>
>>>enum {OK = 0, NOMEM};
>>>
>>Are those the only two failure conditions? What about end of
>>file? Or a stream error? Why not make it possible to report
>>those?
>>
>You didn't read the whole routine. It also returns EOF, which is
>not defined here. I did point out that this listing omitted the
>documentation etc.
>
Your point is well-taken, although it does seem that you fail to
distinguish between genuine end-of-file and a stream error.

So does fgets(). That's what feof() and ferror() are for.

[snip]

Nothing can be done about fgets but a new function /could/
disambiguate between these two conditions thus freeing the caller
from some more repetitive work.

Ridiculous. Most use of the function

I was not talking about ggets in particular, but instead about a line
reading function in general.

>simply runs until a non-zero
is returned, after which the operation ends. It may be because of
EOF, or because of error. In either case, no further input is
available. If there is a need to distinguish EOF from errors, it
can be done at that point.

C provides standard functions to disambiguate between end-of-file and
error. The relevant functions need to be called right after fgetc has
returned EOF. I think it's mostly a matter of design whether this is
done at all, and if so, whether in the caller or callee.

I don't what's ridiculous about any of these alternatives.

>This is not a matter of correctness, but of design philosophy.

Yes. So characterising an alternative design as "ridiculous" might be a
bit premature.

You must remember that "Chuck" is a world authority on everything and
has publicly stated that it would hard to find "better" than his
libraries...

Jun 27 '08 #35

santosh

CBFalconer wrote:

santosh wrote:
>Joachim Schmitz wrote:

... snip ...
>>
>>So there are good reasons for both designs, both with pros and
cons, but none invalid or superior to the other.

I agree. That is why I still don't understand why CBFalconer
responded to an earlier post as "ridiculous".

Because complicating a simple routine to allow for 'everything' is
ridiculous,

Actually the function only has to return one additional status code, but
yes, point taken.

especially when no impediment has been placed in the
way of handling that 'everything' when needed. Also, this is a
binary world, so many things go on the 'ridiculous' side to me.
:-)

I noticed.

Jun 27 '08 #36

santosh

Joachim Schmitz wrote:

<snip>

A segvault might be better han continue processing with bogus data but
better than a segvault is an assert(), as it would easily tell you in
which file and line the error occured, rather than having you to run
it thru a debugger to find some stack trace.

Also behaviour on an assertion failure is defined by the standard while
a segfault is not guaranteed to occur on all platforms.

<snip>

Jun 27 '08 #37

santosh <sa*********@gmail.comwrites:

Joachim Schmitz wrote:

[...]

>In you analogy: just ask the waiter for the reason or listen to the
fire alarm.

The analogy is flawed. The client (line reading function) has to report
the reason to someone else, (perhaps someone at his home). So should
the client ask the waiter for the reason and go home and report that,
or go home and simply say "the salmon was unavailable" and leave it to
that person to go to the restaurant and ask the waiter for the reason
why Salmon was not available?

Speaking of flawed analogies ...

I don't think the difference is all that big a deal. You're not going
home, then going back to the restaurant to ask whether it's on fire.
It's just a difference between (a) getting a result back from the
input function that tells you whether and how it failed, or (b)
getting a result back from the input function that tells you it
failed, then calling another (presumably very cheap) function to tell
you why.

(On the other hand, separating the error determination this way does
make it too easy to ignore error conditions. The most straightforward
idioms for reading from a file just read until they can't, then stop.)

--
Keith Thompson (The_Other_Keith) ks***@mib.org <http://www.ghoti.net/~kst>
Nokia
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Jun 27 '08 #38

"rio" <a@b.cwrites:

"Flash Gordon" <sp**@flash-gordon.me.ukha scritto nel messaggio
news:qe************@news.flash-gordon.me.uk...
>rio wrote, On 31/05/08 06:51:
>>"CBFalconer" <cb********@yahoo.comha scritto nel messaggio #include
if there is an error: better segfault
but if there is a segfault: better return error

The above two statements seem to directly contradict each other so I'm not
sure what you are trying to say.

if there is an data error in a function is better
segfault the function
but
is *much better* [than seg fault] the function return an error value

"rio", are you the same person who previously posted as
"RoSsIaCrIiLoIA" and as "av <av@ala.a>"?

--
Keith Thompson (The_Other_Keith) ks***@mib.org <http://www.ghoti.net/~kst>
Nokia
"We must do something. This is something. Therefore, we must do this."
-- Antony Jay and Jonathan Lynn, "Yes Minister"

Jun 27 '08 #39

Keith Thompson wrote:

>
"rio" <a@b.cwrites:

.... snip ...

>
>if there is an data error in a function is better segfault the
function but is *much better* [than seg fault] the function
return an error value

"rio", are you the same person who previously posted as
"RoSsIaCrIiLoIA" and as "av <av@ala.a>"?

I doubt it. He isn't pushing stupid obfuscative macros.

--
[mail]: Chuck F (cbfalconer at maineline dot net)
[page]: <http://cbfalconer.home.att.net>
Try the download section.
** Posted from http://www.teranews.com **

Jun 27 '08 #40