Matrix storage structure

Duncan M Gunn

Hi,

I need to store the following matrix of values:

T U V
A 2 1 1
B 2 - -
C - - 2

Where (-) is an empty cell.

The obvious solution is to store it as a 2D array, but I would like to store
this as efficiently as possible.
For example, the Matrix object need only store the cells that have a value,
if asked for a cell which is not assigned it can return a default empty
value.

I would also prefer to be able to access it by using only one index, e.g. if
only the y value is known then return a list of cells with that y value.

This could be achieved using a 1D array, where a given cell could be
accessed by multiplying the given x or y value by the respective number of
columns or rows. However, it would still require an array of x * y
elements, even when half of these elements are not required. I presume that
a completely empty x * y array of MyObject would take up x * y *
SizeOf(MyObject) space.

Does anyone know of an efficient data structure for a matrix?

Thanks,
Duncan

Nov 16 '05 #1

Subscribe Post Reply

8689

Dan Bass

Duncan,

An ArrayList of ArrayList's of strings would be an efficient structure.
If there are no values in certain cells, the 2nd ArrayList would simply
reference a null. This is the most minimul an empty cell can be.
It would be a good idea to wrap this in a class.

Daniel.

"Duncan M Gunn" <gu***@gaelqualityNOSPAM.co.uk> wrote in message
news:uS**************@TK2MSFTNGP15.phx.gbl...

Hi,

I need to store the following matrix of values:

T U V
A 2 1 1
B 2 - -
C - - 2

Where (-) is an empty cell.

The obvious solution is to store it as a 2D array, but I would like to
store this as efficiently as possible.
For example, the Matrix object need only store the cells that have a
value, if asked for a cell which is not assigned it can return a default
empty value.

I would also prefer to be able to access it by using only one index, e.g.
if only the y value is known then return a list of cells with that y
value.

This could be achieved using a 1D array, where a given cell could be
accessed by multiplying the given x or y value by the respective number of
columns or rows. However, it would still require an array of x * y
elements, even when half of these elements are not required. I presume
that a completely empty x * y array of MyObject would take up x * y *
SizeOf(MyObject) space.

Does anyone know of an efficient data structure for a matrix?

Thanks,
Duncan

Nov 16 '05 #2

Marcus

Duncan,

Take a look at sparse matrix data structures. This link gives a nice
intro to them http://www.math.uu.nl/people/bisselin/PSC/psc4_2.pdf
It only make senses to use a sparse matrix if more than half your data
is empty.

Marcus

Duncan M Gunn wrote:

Hi,

I need to store the following matrix of values:

T U V
A 2 1 1
B 2 - -
C - - 2

Where (-) is an empty cell.

The obvious solution is to store it as a 2D array, but I would like to store
this as efficiently as possible.
For example, the Matrix object need only store the cells that have a value,
if asked for a cell which is not assigned it can return a default empty
value.

I would also prefer to be able to access it by using only one index, e.g. if
only the y value is known then return a list of cells with that y value.

This could be achieved using a 1D array, where a given cell could be
accessed by multiplying the given x or y value by the respective number of
columns or rows. However, it would still require an array of x * y
elements, even when half of these elements are not required. I presume that
a completely empty x * y array of MyObject would take up x * y *
SizeOf(MyObject) space.

Does anyone know of an efficient data structure for a matrix?

Thanks,
Duncan

Nov 16 '05 #3

Peter N Roth

Efficiency is a tricky beast. In this case, you are making
a tradeoff between storage used and computation
time to access an element. If you have a lot of cheap
memory, you might find 2D storage a winner after all.

--
Grace + Peace,
Peter N Roth
Engineering Objects International
http://engineeringobjects.com
Home of Matrix.NET
"Duncan M Gunn" <gu***@gaelqualityNOSPAM.co.uk> wrote in message
news:uS**************@TK2MSFTNGP15.phx.gbl...

Hi,

I need to store the following matrix of values:

T U V
A 2 1 1
B 2 - -
C - - 2

Where (-) is an empty cell.

The obvious solution is to store it as a 2D array, but I would like to
store this as efficiently as possible.
For example, the Matrix object need only store the cells that have a
value, if asked for a cell which is not assigned it can return a default
empty value.

I would also prefer to be able to access it by using only one index, e.g.
if only the y value is known then return a list of cells with that y
value.

This could be achieved using a 1D array, where a given cell could be
accessed by multiplying the given x or y value by the respective number of
columns or rows. However, it would still require an array of x * y
elements, even when half of these elements are not required. I presume
that a completely empty x * y array of MyObject would take up x * y *
SizeOf(MyObject) space.

Does anyone know of an efficient data structure for a matrix?

Thanks,
Duncan

Nov 16 '05 #4

Bruce Wood

Peter is right. The first question you have to ask is, "How big do I
expect this thing to be?"

Any sparse-matrix data structure will come with two penalties: a CPU
time hit to look up elements (_nothing_ is faster than a 2D array) and
memory overhead when storing the elements that are there.

Marcus was correct in saying that sparse data structures make sense
only if less than half your data is there, but he forgot to mention
that they also make sense only if you have lots and lots of data. If
you have only, say, a 20x20 half-full array, even of 64-bit values,
then the 12K you're going to save by not storing the empty values can
easily be eaten up (and more) by the overhead needed to maintain a more
complex data structure.

As the size of your matrix mounts, sparse array data structures look
more and more appealing. For example, those same 64-bit values in a
100x100 half-full array are now wasting 320K, which probably wouldn't
be eaten up by overhead in a sparse structure.

Just be sure that by using an alternate data structure you are not, in
fact, creating something even bigger than the original 2D array would
have been. :)

Nov 16 '05 #5

Marcus

> Peter is right. The first question you have to ask is, "How big do I

expect this thing to be?" Yep, that shoudl be the first question.
(_nothing_ is faster than a 2D array) In my experience, accessing elements in matrix that is represented by a
1D array is usual faster than a 2D array C#. (comparing matrix[i*cols+j]
to matrix[i,j] or matrix[i][j]).

If you can exploit the fact that most elements are null or zero
(depending the data type), sparse matrices can outperform dense matrices
(so they can be faster and use less memory). For a trivial example, sum
all elements of a data matrix.

Bruce Wood wrote: Peter is right. The first question you have to ask is, "How big do I
expect this thing to be?"

Any sparse-matrix data structure will come with two penalties: a CPU
time hit to look up elements (_nothing_ is faster than a 2D array) and
memory overhead when storing the elements that are there.

Marcus was correct in saying that sparse data structures make sense
only if less than half your data is there, but he forgot to mention
that they also make sense only if you have lots and lots of data. If
you have only, say, a 20x20 half-full array, even of 64-bit values,
then the 12K you're going to save by not storing the empty values can
easily be eaten up (and more) by the overhead needed to maintain a more
complex data structure.

As the size of your matrix mounts, sparse array data structures look
more and more appealing. For example, those same 64-bit values in a
100x100 half-full array are now wasting 320K, which probably wouldn't
be eaten up by overhead in a sparse structure.

Just be sure that by using an alternate data structure you are not, in
fact, creating something even bigger than the original 2D array would
have been. :)

Nov 16 '05 #6

Bruce Wood

Yes, that's true. I was being sloppy. I should have said, "Finding an
specific element is fastest in a 2D array." Other operations such as
summing the elements may indeed be faster in a sparse structure.

I find it odd that a 1D array would "outperform" a 2D array. I would
expect them to perform exactly the same, as most languages use the 1D
implementation under the covers, so usually it's just a difference
between the language doing the math behind the scenes or you doing the
math in code. That experience, however, comes from C, so .NET may be
different.

Nov 16 '05 #7

Duncan M Gunn

Thanks for all the help. I need to do a bit of investigation to find out
likely data sizes and the number of zero entries that are likely, before I
decide on a storage structure. It seems unlikely I would gain any advantage
on computing summary values (this will already have been done) so therefore
space is my only other consideration.

Does anyone know if there exists a public domain C# (or C++/C)
implementation of a sparse matrix structure I could use?

"Bruce Wood" <br*******@canada.com> wrote in message
news:11**********************@z14g2000cwz.googlegr oups.com...

Yes, that's true. I was being sloppy. I should have said, "Finding an
specific element is fastest in a 2D array." Other operations such as
summing the elements may indeed be faster in a sparse structure.

I find it odd that a 1D array would "outperform" a 2D array. I would
expect them to perform exactly the same, as most languages use the 1D
implementation under the covers, so usually it's just a difference
between the language doing the math behind the scenes or you doing the
math in code. That experience, however, comes from C, so .NET may be
different.

Nov 16 '05 #8

Dan Bass

This shouldn't be hard to port:
http://www.codeproject.com/cpp/sparse_matrices.asp

"Duncan M Gunn" <gu***@gaelqualityNOSPAM.co.uk> wrote in message
news:e0**************@TK2MSFTNGP09.phx.gbl...

Thanks for all the help. I need to do a bit of investigation to find out
likely data sizes and the number of zero entries that are likely, before I
decide on a storage structure. It seems unlikely I would gain any
advantage on computing summary values (this will already have been done)
so therefore space is my only other consideration.

Does anyone know if there exists a public domain C# (or C++/C)
implementation of a sparse matrix structure I could use?

"Bruce Wood" <br*******@canada.com> wrote in message
news:11**********************@z14g2000cwz.googlegr oups.com...
Yes, that's true. I was being sloppy. I should have said, "Finding an
specific element is fastest in a 2D array." Other operations such as
summing the elements may indeed be faster in a sparse structure.

I find it odd that a 1D array would "outperform" a 2D array. I would
expect them to perform exactly the same, as most languages use the 1D
implementation under the covers, so usually it's just a difference
between the language doing the math behind the scenes or you doing the
math in code. That experience, however, comes from C, so .NET may be
different.

Nov 16 '05 #9

Marcus

check out http://nmath.sourceforge.net/wiki/index.php/Main_Page

Duncan M Gunn wrote:

Thanks for all the help. I need to do a bit of investigation to find out
likely data sizes and the number of zero entries that are likely, before I
decide on a storage structure. It seems unlikely I would gain any advantage
on computing summary values (this will already have been done) so therefore
space is my only other consideration.

Does anyone know if there exists a public domain C# (or C++/C)
implementation of a sparse matrix structure I could use?

"Bruce Wood" <br*******@canada.com> wrote in message
news:11**********************@z14g2000cwz.googlegr oups.com...
Yes, that's true. I was being sloppy. I should have said, "Finding an
specific element is fastest in a 2D array." Other operations such as
summing the elements may indeed be faster in a sparse structure.

I find it odd that a 1D array would "outperform" a 2D array. I would
expect them to perform exactly the same, as most languages use the 1D
implementation under the covers, so usually it's just a difference
between the language doing the math behind the scenes or you doing the
math in code. That experience, however, comes from C, so .NET may be
different.

Nov 16 '05 #10

Duncan M Gunn

Thanks for your help, this will give me a good starting point if we require
to use such a structure.

Nov 16 '05 #11

by: Web Science | last post by:

Site and Features: http://www.eigensearch.com Search engine, eigenMethod, eigenvector, mathematical, manifolds, science, technical, search tools, eigenmath, Jacobian, quantum, mechanics,...

HTML / CSS

41585 Mining the Web: Jacobian Matrix Constructs with eigenVectorSearching 41585

by: Web Science | last post by:

Site and Features: http://www.eigensearch.com Search engine, eigenMethod, eigenvector, mathematical, manifolds, science, technical, search tools, eigenmath, Jacobian, quantum, mechanics,...

Microsoft Access / VBA

Sprase matrix

by: nestini | last post by:

Hello everybody, I am student who just begin learning about programming I want to know what is Sprase Matrix. And would you please show me how to add the 2 Sprase Matrix in C soursecode.

C / C++

Returning pointers to different items and memory storage

by: CptDondo | last post by:

I am working on an embedded platform which has a block of battery-backed RAM. I need to store various types of data in this block of memory - for example, bitmapped data for control registers,...

C / C++

05154 Mining the Web: Jacobian Matrix Constructs with eigenVectorSearching 05154

by: Web Science | last post by:

Site and Features: http://www.eigensearch.com Search engine, eigenMethod, eigenvector, mathematical, manifolds, science, technical, search tools, eigenmath, Jacobian, quantum, mechanics,...

.NET Framework

Have data to fill a triangular matrix, but how do I graph it???? - Please help

by: Nevets Steprock | last post by:

I'm writing a web program where one of the sections is supposed to output a correlation matrix. The typical correlation matrix looks like this: ..23 ..34 .54 ..76 .44 .28 ..02 .77 ...

ASP.NET

Sparse Matrix implemented as a doubly-linked list

by: adam.kleinbaum | last post by:

Hi there, I'm a novice C programmer working with a series of large (30,000 x 30,000) sparse matrices on a Linux system using the GCC compiler. To represent and store these matrices, I'd like to...

C / C++

Recover data from a bitmap and filling a matrix

by: Marco Biagioni | last post by:

After i've tried to update a vb 6.0 project to vb.net, using visual studio utility,i can't read correctly data bytes from a .bmp file to insert them in a matrix to operate on. Using vb 6.0 the...

.NET Framework

Matrix Inversion Algorithm or Memory Storage Issue?

by: PeteJ | last post by:

Hello all.. I wrote code in C to invert n x n matrices, where 1<n<39 and the matrices are guaranteed to be invertible. Prior to this task, I haven't really seen linear algebra since my sophmore...

C / C++

Easy Steps to Fix "Canon Printer Won't Connect to WiFi Network"

by: taylorcarr | last post by:

A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...

General

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Matrix storage structure

Similar topics