473,386 Members | 2,042 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,386 software developers and data experts.

table size/record limit

I am designing something that may be the size of yahoo, google, ebay, etc.

Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records
each record is 9 fields of INT4/DATE

Other tables will have about 5 million records of about the same size.

There are lots of scenarios here to lessson this.

BUT, is postgres on linux, maybe necessarily a 64 bit system, cabable of
this? And there'd be 4-5 indexes on that table.

---------------------------(end of broadcast)---------------------------
TIP 4: Don't 'kill -9' the postmaster

Nov 23 '05 #1
9 6015
Hi,

Am Do, den 21.10.2004 schrieb Dennis Gearon um 1:30:
I am designing something that may be the size of yahoo, google, ebay, etc.

Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records
each record is 9 fields of INT4/DATE

Other tables will have about 5 million records of about the same size.

There are lots of scenarios here to lessson this.

BUT, is postgres on linux, maybe necessarily a 64 bit system, cabable of
this? And there'd be 4-5 indexes on that table.


Sure. Why not? 3...5mio records is not really a problem.
We had bigger tables with historic commercial transactions
(even on an old dual PIII/1000) with fine performance.
I bet however, yahoo, google at least are much bigger :-)

Regards
Tino

---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives?

http://archives.postgresql.org

Nov 23 '05 #2
Google probably is much bigger, and on mainframes, and probably Oracle or DB2.

But the table I am worried about is the one sized >= 3.6 GIGA records.

Tino Wildenhain wrote:
Hi,

Am Do, den 21.10.2004 schrieb Dennis Gearon um 1:30:
I am designing something that may be the size of yahoo, google, ebay, etc.

Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records
each record is 9 fields of INT4/DATE

Other tables will have about 5 million records of about the same size.

There are lots of scenarios here to lessson this.

BUT, is postgres on linux, maybe necessarily a 64 bit system, cabable of
this? And there'd be 4-5 indexes on that table.

Sure. Why not? 3...5mio records is not really a problem.
We had bigger tables with historic commercial transactions
(even on an old dual PIII/1000) with fine performance.
I bet however, yahoo, google at least are much bigger :-)

Regards
Tino

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 23 '05 #3
Dennis Gearon wrote:
Google probably is much bigger, and on mainframes, and probably Oracle
or DB2.
Google uses a Linux cluster and there database is HUGE. I do not know
which database
they use. I bet they built their own specifically for what they do.

Sincerely,

Joshua D. Drake

But the table I am worried about is the one sized >= 3.6 GIGA records.

Tino Wildenhain wrote:
Hi,

Am Do, den 21.10.2004 schrieb Dennis Gearon um 1:30:
I am designing something that may be the size of yahoo, google,
ebay, etc.

Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records
each record is 9 fields of INT4/DATE

Other tables will have about 5 million records of about the same size.

There are lots of scenarios here to lessson this.

BUT, is postgres on linux, maybe necessarily a 64 bit system,
cabable of this? And there'd be 4-5 indexes on that table.


Sure. Why not? 3...5mio records is not really a problem.
We had bigger tables with historic commercial transactions
(even on an old dual PIII/1000) with fine performance.
I bet however, yahoo, google at least are much bigger :-)

Regards
Tino

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html


--
Command Prompt, Inc., home of Mammoth PostgreSQL - S/ODBC and S/JDBC
Postgresql support, programming shared hosting and dedicated hosting.
+1-503-667-4564 - jd@commandprompt.com - http://www.commandprompt.com
PostgreSQL Replicator -- production quality replication for PostgreSQL
---------------------------(end of broadcast)---------------------------
TIP 1: subscribe and unsubscribe commands go to ma*******@postgresql.org

Nov 23 '05 #4
Actually, now that I think about it, they use a special table type that the INDEX is also the DATUM. It is possible to recover the data, out of the index listing. So go down the index, then decode the indexing value - voila, a whole step saved. I have no idea what engine these table types are in, however.

Joshua D. Drake wrote:
Dennis Gearon wrote:
Google probably is much bigger, and on mainframes, and probably Oracle
or DB2.

Google uses a Linux cluster and there database is HUGE. I do not know
which database
they use. I bet they built their own specifically for what they do.

Sincerely,

Joshua D. Drake

But the table I am worried about is the one sized >= 3.6 GIGA records.

Tino Wildenhain wrote:
Hi,

Am Do, den 21.10.2004 schrieb Dennis Gearon um 1:30:

I am designing something that may be the size of yahoo, google,
ebay, etc.

Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records
each record is 9 fields of INT4/DATE

Other tables will have about 5 million records of about the same size.

There are lots of scenarios here to lessson this.

BUT, is postgres on linux, maybe necessarily a 64 bit system,
cabable of this? And there'd be 4-5 indexes on that table.


Sure. Why not? 3...5mio records is not really a problem.
We had bigger tables with historic commercial transactions
(even on an old dual PIII/1000) with fine performance.
I bet however, yahoo, google at least are much bigger :-)

Regards
Tino

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html


---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #5
On Wed, 2004-10-20 at 23:01 -0700, Joshua D. Drake wrote:
Dennis Gearon wrote:
Google probably is much bigger, and on mainframes, and probably Oracle
or DB2.


Google uses a Linux cluster and there database is HUGE. I do not know
which database
they use. I bet they built their own specifically for what they do.


....actually, I heard they were running it off a flat file database on 7
386 machines in some guys garage off a dsl connection. I could be wrong
though. ;-)

-Robby

--
/***************************************
* Robby Russell | Owner.Developer.Geek
* PLANET ARGON | www.planetargon.com
* Portland, OR | ro***@planetargon.com
* 503.351.4730 | blog.planetargon.com
* PHP/PostgreSQL Hosting & Development
****************************************/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.6 (GNU/Linux)

iD8DBQBBd3Y30QaQZBaqXgwRAm+AAKDT777dkqnCG7M1f5MHGJ mUv8p0YQCbBUFj
DWRKRa9k3vc93yudLQvGI2I=
=OjYA
-----END PGP SIGNATURE-----

Nov 23 '05 #6
On 21. okt 2004, at 01:30, Dennis Gearon wrote:
I am designing something that may be the size of yahoo, google, ebay,
etc.
Grrr. Geek wet-dream.
Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records
each record is 9 fields of INT4/DATE
I don't do this myself (my data is only 3 gig, and most of that is in
blobs), but people have repeatedly reported such sizes on this list.

Check
http://archives.postgresql.org/pgsql...1/msg00188.php

.... but the best you can do is just to try it out. With a few commands
in the 'pql' query tool you can easily populate a ridiculously large
database ("insert into foo select * from foo" a few times).

In few hours you'll have some feel of it.
Other tables will have about 5 million records of about the same size.

There are lots of scenarios here to lessson this.


What you'll have to worry about most is the access pattern, and update
frequency.

There's a lot of info out there. You may need any of the following:
• clustering, the 'slony' project seems to be popular around here.
• concurrency of updating
• connnection pooling, maybe via Apache or some java-thingey
• securing yourself from hardware errors

This list is a goldmine of discussions. Search the archives for
discussions and pointers. Search interfaces at

http://archives.postgresql.org/pgsql-general/
http://archives.postgresql.org/pgsql-admin/

..... or download the list archive mbox files into your mail-program and
use that (which is what I do).

d.
--
David Helgason,
Business Development et al.,
Over the Edge I/S (http://otee.dk)
Direct line +45 2620 0663
Main line +45 3264 5049

---------------------------(end of broadcast)---------------------------
TIP 7: don't forget to increase your free space map settings

Nov 23 '05 #7
Dennis Gearon wrote:
I am designing something that may be the size of yahoo, google, ebay, etc.

Just ONE many to many table could possibly have the following
characteristics:

3,600,000,000 records


This is a really huge monster one, and if you don't partition that
table in some way I think you'll have nightmares with it...

Regards
Gaetano Mendola

Nov 23 '05 #8
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Dennis Gearon wrote:
| Gaetano Mendola wrote:
|
|> Dennis Gearon wrote:
|>
|>> I am designing something that may be the size of yahoo, google, ebay,
|>> etc.
|>>
|>> Just ONE many to many table could possibly have the following
|>> characteristics:
|>>
|>> 3,600,000,000 records
|>
|> This is a really huge monster one, and if you don't partition that
|> table in some way I think you'll have nightmares with it...
|>
|> Regards
|> Gaetano Mendola
|>
| thanks for the input, Gaetano.

For partion in some way I don't mean only split it in more tables. You
can use some available tools in postgres and continue to see this table
as one but implemented behind the scenes with more tables.
One usefull and impressive way is to use the inheritance in order to obtain
a vertical partition

0) Decide a partition policy ( based on time stamp for example )
1) Create an empty base table with the name that you want see as "public"
2) Create the partition using the empty table as base table
3) Create a rule on the base table so an insert or the update on it is
~ performed as a insert or an update on the right table ( using the partition
~ policy at step 0 )

in this way you are able to vacuum each partition, reindex each partition and
so on in a more "feseable way" I do not immagine vacuum full or reindex a
3,600,000,000 records table...

Regards
Gaetano Mendola








-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.5 (MingW32)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFBeLiK7UpzwH2SGd4RAh+TAJ4w89SvkFWgt9DGhQx/aUR6j2wDtwCgtut5
FN0OuoycbI37a8Wouvo3icw=
=Wb6h
-----END PGP SIGNATURE-----
---------------------------(end of broadcast)---------------------------
TIP 7: don't forget to increase your free space map settings

Nov 23 '05 #9
Great Idea! When I get that far, I will try it.

Gaetano Mendola wrote:

<snip>
For partion in some way I don't mean only split it in more tables. You
can use some available tools in postgres and continue to see this table
as one but implemented behind the scenes with more tables.
One usefull and impressive way is to use the inheritance in order to obtain
a vertical partition

0) Decide a partition policy ( based on time stamp for example )
1) Create an empty base table with the name that you want see as "public"
2) Create the partition using the empty table as base table
3) Create a rule on the base table so an insert or the update on it is
~ performed as a insert or an update on the right table ( using the
partition
~ policy at step 0 )

in this way you are able to vacuum each partition, reindex each
partition and
so on in a more "feseable way" I do not immagine vacuum full or reindex a
3,600,000,000 records table...

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #10

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
by: Kums | last post by:
I have a 4K page size table space, and it has just multiple tables. (I need to know as well if it's a single table in that tablespace, how to handle.) It has reached the maximum size of 64 GB...
3
by: dunleav1 | last post by:
I am trying to write a sql script to estimate size and count of rows of all tables and indexes within a schema. Here's what I have so far but need a little help. #$1 is database name and $2 is...
4
by: bdotson | last post by:
Does anyone know the record limit for a single table in Access 97? I have a client who has over 800,000 records in one table. They are experiencing math problems and timeouts when running reports....
3
by: Alan Carbutt | last post by:
Hi all, I'm running a data warehouse in Postgres and I was wondering what the max table size is column-wise and row-wise. I tried to search the lists and documentation on the...
8
by: Ron | last post by:
Hi all, How do I determine the size of the tables I'm using? I looked under properties and it's not there. The book I just browsed said table is limited to 1GB. How do I find out what size my...
0
by: nassim.bouayad.agha | last post by:
Unique table version record Hello, I am using a table record to store informations about database dump,I use this table : CREATE TABLE dump_version( count BIGINT unsigned NOT NULL DEFAULT 0,...
1
by: VIJENDRA1978 | last post by:
I need to control the size of table in database in Ms Sql server 2005. My objective is to see if the table size grows beyond a certain limit like 300mb or so. If it grows more then this then fire a...
5
by: Stepheno | last post by:
Hi, I am a recently converted Iseries (AS/400) RPG programmer trying to learn HTML/CSS/JavsScript all at the same time (not fun). My problem deals mostly with CSS. I will be reveiving a table,...
8
by: ananthaisin | last post by:
How to reduce the table size for any table while using truncate or delete statements. In oracle 8i it was truncating the storage space but in 10g it is not .... I have given truncate statement in...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.