code using more memory than required

Sourin

Hi all,
I am trying to write code for my experiments. My work involves huge
datasets, and as such my code needs to be memory efficient. I did some
hand calculations regarding the amount of dynamic memory required for
my datastructures ( used sizeof() to get the size of the structures ).
But mallinfo() function show that approximately double that amount of
memory is being allocated. I am working with gcc 3.2.2 on a redhat 9
machine. I come from electrical engineering background and dont know
much about memory allocations or overheads. Can anyone please
enlighten me.
Thanks in advance.

Nov 13 '05 #1

Subscribe Post Reply

2688

Derk Gwen

so****@lycos.com (Sourin) wrote:
# Hi all,
# I am trying to write code for my experiments. My work involves huge
# datasets, and as such my code needs to be memory efficient. I did some
# hand calculations regarding the amount of dynamic memory required for
# my datastructures ( used sizeof() to get the size of the structures ).
# But mallinfo() function show that approximately double that amount of
# memory is being allocated. I am working with gcc 3.2.2 on a redhat 9
# machine. I come from electrical engineering background and dont know
# much about memory allocations or overheads. Can anyone please
# enlighten me.

There's no one storage allocator for unix. You get one free with libc, but different
libc implementations can have different allocation strategies. Some implementations
will round up the allocated block size to conveniently alignable size, say 8 bytes.
Others allocate a page at a time where all blocks on the page are the same size;
a different page (perhaps 4096 bytes at a time) will be allocated for every different
block size. The buddy system rounds up the block size to the next power of two.
Some allocators need a link list overhead hidden from the caller; some provide
overflow and underflow zones around each block.

With virtual memory, you can often afford less efficient use of the address space
to get faster code. If you're only going to use unix, you can bypass all other
allocation libraries and write your own with the kernel functions break()
and sbrk().

--
Derk Gwen http://derkgwen.250free.com/html/index.html
I have no respect for people with no shopping agenda.

Nov 13 '05 #2

Nils Petter Vaskinn

On Mon, 25 Aug 2003 01:56:36 -0700, Sourin wrote:

Hi all, [snip: program using more memory than expected] Thanks in advance.

Ok this can happen with any compiler not just gcc:

The compiler may decide to optimize. Say you have a struct of size 9. Then
you create an array of that struct. Now the comiler may know that the
processor works faster if it accesses data at every 4th byte. So instead
of packing your structs end to end you get 3 bytes unused between them.

Basically it's a speed / memory tradeoff.

Read your compilers documentation for ways to force it to pack the data
together without padding with unuesd bytes.

(for gcc look for the __packed__ attribute)

Be aware that this may make the memory access slower making your program
run slower. Maybe you can rethink your design to avoid the problem: do you
really need to have all the data in memory or can you proces it piece by
piece?

hth
NPV

Nov 13 '05 #3

Eric Sosman

Sourin wrote:

Hi all,
I am trying to write code for my experiments. My work involves huge
datasets, and as such my code needs to be memory efficient. I did some
hand calculations regarding the amount of dynamic memory required for
my datastructures ( used sizeof() to get the size of the structures ).
But mallinfo() function show that approximately double that amount of
memory is being allocated. I am working with gcc 3.2.2 on a redhat 9
machine. I come from electrical engineering background and dont know
much about memory allocations or overheads. Can anyone please
enlighten me.

Others have already mentioned the possibility that some
of your struct types contain padding, but nobody so far has
suggested what you might do about it. One way to get rid of
the padding is to use non-Standard language extensions to tell
the compiler not to pad certain struct types; many compilers
support such extensions, although the details differ from one
compiler to the next. Unfortunately, this usually results in
slower and/or larger code; the padding existed for a reason,
and eliminating it exacts a penalty.

Another approach, if you've got large arrays of these
padded structs, is break the structs apart and "factor" the
data into parallel arrays. For example, instead of

struct {
short shrift;
/* padding likely here */
double trouble;
} array[10000];

.... you could write

short shrift[10000];
double trouble[10000];

.... and the ten thousand padding slots simply vanish. Of
course, you've sacrificed some notational and mnemonic
convenience -- it's obvious that array[42].shrift and
array[42].trouble are somehow related, but in the revised
program the relationship between shrift[42] and trouble[42]
is less apparent. Functions like qsort() won't work with
this scheme, for example, because they don't know about
the relationship between these parallel arrays.

On other possible contributor to greater-than-expected
memory consumption is the overhead of keeping track of each
allocated area. malloc() implementations differ, but most
will add somewhere between four and sixteen bytes to each
allocation, in addition to rounding the total up to a nice
multiple of (usually) four, eight, or sixteen bytes. In a
malloc(1) call, the overhead due to bookkeeping and rounding
will probably be much greater than the one-byte "payload,"
and if you make a hundred thousand such calls, most of your
program's memory will be tied up in this useless (to you)
overhead. malloc(1) is an extreme case, but the effect can
be significant even for moderate-sized structs -- an eight-
byte struct may incur eight bytes of overhead, for only
fifty percent "efficiency."

If you are dealing with many such smallish structs, a
way to reduce the bookkeeping and rounding overhead is to
allocate a whole block of them at once and dole them out
individually, thus incurring only one chunk of overhead
for the whole batch instead of one for each struct:

struct thing *new_thing(void) {
static struct thing *avail;
static int howmany = 0;
if (howmany == 0) {
howmany = 1000;
avail = malloc(howmany * sizeof *avail);
if (avail == NULL)
return NULL; /* out of memory */
}
return &array[--howmany];
}

The disadvantage of this approach is that you can no longer
free() or realloc() the individual structs; you can only do
so on an entire batch at once. Depending on the application,
this may or may not be a problem.

--
Er*********@sun.com

Nov 13 '05 #4

Similar topics

memory allocation

by: benevilent | last post by:

Hey, I'm trying to debug the memory allocation in an embedded use of the Python interpreter. The longer I leave my program running the more memory it consumes. The total number of objects in...

Python

Sorting records using sort()

by: Elijah Bailey | last post by:

I want to sort a set of records using STL's sort() function, but dont see an easy way to do it. I have a char *data; which has size mn bytes where m is size of the record and n is the...

C / C++

Separating directory from file name: Code Review!

by: Joe Laughlin | last post by:

I've not used C much before, so I don't know how robust or good this code is. I'd appreciate any feedback or criticisms anyone has! Thanks, Joe #include <stdio.h> #include <string.h>

C / C++

Here is some Sample Code for Classes and Methods

by: Michael Gallagher | last post by:

Here below I wrote a program to take a subtotal of your purchased amount and calulate the tax based on Tennesse sales tax rate of 9.25%. This is just a basic ASP.NET C# program which could be...

C# / C Sharp

I think C# is forcing us to write more (redundant) code

by: Neo The One | last post by:

I think C# is forcing us to write more code by enforcing a rule that can be summarized as 'A local variable must be assgined *explicitly* before reading its value.' If you are interested in what...

C# / C Sharp

using mmap on large (> 2 Gig) files

by: myeates | last post by:

Hi Anyone ever done this? It looks like Python2.4 won't take a length arg Mathew

Python

232

Requesting advice how to clean up C code for validating string represents integer

by: robert maas, see http://tinyurl.com/uh3t | last post by:

I'm working on examples of programming in several languages, all (except PHP) running under CGI so that I can show both the source files and the actually running of the examples online. The first...

C / C++

memory leak in the code?

by: George2 | last post by:

Hello everyone, Should I delete memory pointed by pointer a if there is bad_alloc when allocating memory in memory pointed by pointer b? I am not sure whether there will be memory leak if I do...

C / C++

What has managed code achieved?

by: John | last post by:

Hi What are the advantages actually achieved of managed code? I am not talking of theory but in reality. Thanks Regards

ASP.NET

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice

Access Europe - Using VBA to create a class based on a table - Wed 1 May

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

Microsoft Access / VBA