data directory growing huge - PostgreSQL Database

Esger Abbink

Hello,

it is very possible that this is a well described problem, but I have
not been able to find the solution.

On two production server (7.2rc2) of ours the data directory is growing
to very large sizes while the data that is actually in the db's isnt 1.
that large and 2. growing.

The databases see a fairly limited/constant use at the moment. The data
they contain is close to real-time information and as such they are
continuously updated with new data. The amount of queries executed is
low or very low (this will change in the future).

To prevent the database from growing to unmanageable size it's being
emptied and vacuumed twice a day. This is only a "vacuum" btw, not a
"vacuum analyze". I did an analyze by hand to get the planner to use a
plan suited for 100k+ tables as before that performance was horrid.

The problem is that the data directories have reached sizes of 4.1 and
4.5 Gigabyte respectively. A recreation of the 4.1Gb db after a pg_dump
on a different system yields a total of about 460Mb in ALL databases on
that system, the dump file itself is 10Mb. Clearly the 4Gb is a bit
wastefull...

Next to the daily automatic vacuum's I have done a "vacuum full" by hand
(this took a few hours) but this has not resulted in any reduction in
file sizes.

the bulk of the space is occupied by a dozen huge files in base/16559,
all sufficiently meaninglessly named: 72646, 72653.1 etc.

As vacuuming doesnt help, whats up? Could it be that there are lots of
stuck transactions keeping huge amounts of old&removed rows on disk
through MVCC or something?

thanks for any help,

Esger

--
NeoMail - Webmail that doesn't suck... as much.
http://neomail.sourceforge.net

---------------------------(end of broadcast)---------------------------
TIP 4: Don't 'kill -9' the postmaster

Nov 11 '05 #1

Subscribe Post Reply

4250

Shridhar Daithankar

Esger Abbink wrote:

The databases see a fairly limited/constant use at the moment. The data
they contain is close to real-time information and as such they are
continuously updated with new data. The amount of queries executed is
low or very low (this will change in the future).
No wonder data is growing real fast. If it is updated in real time it is going
to generate dead tuples like crazy..
The problem is that the data directories have reached sizes of 4.1 and
4.5 Gigabyte respectively. A recreation of the 4.1Gb db after a pg_dump
on a different system yields a total of about 460Mb in ALL databases on
that system, the dump file itself is 10Mb. Clearly the 4Gb is a bit
wastefull...

Next to the daily automatic vacuum's I have done a "vacuum full" by hand
(this took a few hours) but this has not resulted in any reduction in
file sizes.
Hmm.. Vacuum full should have done the trick..
the bulk of the space is occupied by a dozen huge files in base/16559,
all sufficiently meaninglessly named: 72646, 72653.1 etc.
You can find the corresponding table names by querying pg_class..
As vacuuming doesnt help, whats up? Could it be that there are lots of
stuck transactions keeping huge amounts of old&removed rows on disk
through MVCC or something?

If you can take down the database for some time, you can dump the entire
database using pg_dump, drop the database and recreate from the dump. Surely it
would take lot less than few hours.

And I think you despartely need pg_autovacuum.. A daemon that vacuums in real
time according to requirement. It's in CVS head and works with 7.3/7.4. Up your
max fsm setting appropriately too.. That should help keeping your database size
in check..

HTH

Shridhar

---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 12 '05 #2

Doug McNaught

"Esger Abbink" <pg*******@bumblebeast.com> writes:

On two production server (7.2rc2) of ours the data directory is growing
to very large sizes while the data that is actually in the db's isnt 1.
that large and 2. growing.

You're running a release candidate of 7.2? That's pretty old and has
known bugs...

Your problem, however, is probably index bloat, which is not fixed by
VACUUM in the version you're running. If you REINDEX you should see
storage usage go back to normal.

The upcoming 7.4 should suffer much less from the index bloat problem.

-Doug

---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend

Nov 12 '05 #3

Tom Lane

"Esger Abbink" <pg*******@bumblebeast.com> writes:

the bulk of the space is occupied by a dozen huge files in base/16559,
all sufficiently meaninglessly named: 72646, 72653.1 etc.
You could find out what they are by consulting pg_class.relfilenode
(or see the contrib/oid2name utility). But my bet is that they are
indexes and you need to REINDEX them to recover the space.
On two production server (7.2rc2) of ours

[ blanch ] You're running a production server on 7.2rc2? Still?
Get thee to 7.2.4, at least. We do not make dot-releases for idle
amusement.

regards, tom lane

---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives?

http://archives.postgresql.org

Nov 12 '05 #4

Similar topics

NAS vs SAN for data directory

by: SAQIB | last post by:

Hello All, I am trying to decide which storage I should use for the "Data Directory". My application has lots of 'SELECT's (80%) and fewer UPDATEs/INSERTs(20 %). I have the following choices...

MySQL Database

..\MSSQL\Data\ - Directory saturated

by: Pablo | last post by:

C:\Program Files\Microsoft SQL Server\MSSQL\Data\" contains .mdb and ..ldb files that are huge, some reach 21GB! while backup files don't exceed 8MB. I guess it's a kind of transaction log, but...

Microsoft SQL Server

postmaster growing to consume all memory

by: Lincoln Yeoh | last post by:

Hi, I recently upgraded to postgresql 7.4 and I am having a problem with postmaster using lots of memory for a query (keeps growing even up to 400MB+ till I stop postgresql ). I don't recall...

PostgreSQL Database

The data directory was initialized by PostgreSQL version 7.3, which is not compatible with this version 7.4.3.

by: Bilicki Vilmos | last post by:

Hi all, I have upgraded my cygwin installation and it has replaced my old 7.3 postgresql. My questions are the following: How can I use the old files with the newer version? If this is not...

PostgreSQL Database

Chunking out data from a huge xml file (Ajax)

by: adhag | last post by:

Hi I am faced with quite a challenge. I need to open a 70-100 meg file and be able to chunk it out using AJAX back to the client but that isn't my problem really. What I need to do is open the...

.NET Framework

Using INSERT to write to a directory

by: bballr | last post by:

I have a problem that I'm not sure if it can be done. I'm trying to use the MySQL C API to be able to use a normal sql insert statement that will send the data or file to a directory and NOT the...

MySQL Database

how to change default data directory for SQL 2005

by: Eric Bragas | last post by:

Hi, I've installed SQL 2005 and failed to change the default data location during installation. Now I want to change it. Can I do this without having to uninstall and reinstall? And...

Microsoft SQL Server

changing data directory

by: Archanak | last post by:

Hi, I changed the data directory of mysql through my.cnf file. my.cnf has the details. "#" indicates the i have changed accordingly to the main directory. I want to move my tables from...

MySQL Database

Problem Changing Data Directory in Server 2005

by: naspinski | last post by:

Ok, this is the second install I haev tried of 2005 to get this right, and something is not working correctly. The server is in a cluster, and it has 4 drives available in the cluster... the N Drive...

Microsoft SQL Server

Wordpress or something else?

by: Faith0G | last post by:

I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

Content Management Systems

Access Europe: Command bars, the Access Shortcut Tool and a simple Audit Log - Wed 3 April

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

General

One-click Importing Excel Data into a*Database

by: ryjfgjl | last post by:

In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...

Microsoft Excel

Basic Javascript concepts

by: aa123db | last post by:

Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...

Javascript

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware