DBD::Pg handling duplicates in a transaction

Tim Nelson

**** Post for FREE via your newsreader at post.usenet.com ****

I am a newbie try to port my applications to Postgres. I have an
application that is bulk loading a table with autocommit off (with it on
it's way to slow). The logic of the application dictates that I try the
insert, and if it fails because of a duplicate, update the record instead.
The entire bulk load is wrapped in a transaction for speed purposes. When I
hit a duplicate I can detect it, but when I try the update it fails with:

ERROR: current transaction is aborted, commands ignored until end of
transaction block

I assume this is a result of the duplicate error, but is there a way I can
get around this. In my case duplicate is fatal to my transaction...

Thanks. Tim

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
*** Usenet.com - The #1 Usenet Newsgroup Service on The Planet! ***
http://www.usenet.com
Unlimited Download - 19 Seperate Servers - 90,000 groups - Uncensored
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

Jul 19 '05 #1

Subscribe Reply

2434

Christopher Browne

After a long battle with technology, "Tim Nelson" <ti***************@softhome.net>, an earthling, wrote:

I am a newbie try to port my applications to Postgres. I have an
application that is bulk loading a table with autocommit off (with
it on it's way to slow). The logic of the application dictates that
I try the insert, and if it fails because of a duplicate, update the
record instead. The entire bulk load is wrapped in a transaction
for speed purposes. When I hit a duplicate I can detect it, but
when I try the update it fails with:

ERROR: current transaction is aborted, commands ignored until end of
transaction block

I assume this is a result of the duplicate error, but is there a way
I can get around this. In my case duplicate is fatal to my
transaction...

There is no ready way of blindly doing this.

Several improvements would seem plausible:

1. Try to do groups of 1000 records, committing every 1K, so that if
there's a failure, it is only expected to roll back on the order
of 500 rows.

2. Along with that approach, keep the last 1K records 'buffered' in
memory, so that if you hit a failure, you can quickly go back and
redo the ones that _were_ good, throwing in a COMMIT as soon as
you complete all the records you _know_ to be good.

3. Alternatively, before doing each row, do a query that checks to
see if the relevant key is already there. That is regrettably
likely to be rather time-consuming, and you'll be firing in
great gobs of little queries.

4. You could create a stored procedure that amounts to "insert or
ignore", and pass the data through that.

Thus, instead of
insert into mytable (f1,f2,f3,f4) values ('this', 'that',
'other', 'foo');

You might do...
select add_to_my_table ('this','that','other', 'foo');

5. You can cut down on the number of statements by grouping these
together...

select atmt ([row 1 data]), atmt([row 2 data]), atmt([row3
data]), ... atmt([row 20 data]).

That allows you to submit 20 records in one statement, which
is fairly likely to save some query submission overhead.
It oughtn't make parsing too much more expensive.

6. Stage the bulk load.

Load the data into a "staging" table, which has minimal (if any)
integrity checks; this would be most efficiently handled via
COPY, which would load it all in one transaction.

Then take that data and insert the relevant bits into the new
location.

Thus:

-- This'll be way fast
copy stage_table from '/tmp/cruddy_data.txt';

-- Maybe do some validation, fix crud, delete crud, and such...
select *; -- Look for crud
update *; -- Fix crud
delete *; -- Delete crud

-- stage_table now contains gleaming fixed data; load into final table...
insert into ultimate_table
select * from stage_table s where
not exists (select * from ultimate_table u where u.pkey = s.pkey);

-- Alternatively, if you had deleted the cruddy data, it might
-- just be:

insert into ultimate_table
select * from stage_table s;

I rather like approach #6, FYI...
--
output = ("cbbrowne" "@" "ntlug.org")
http://www3.sympatico.ca/cbbrowne/lsf.html
"The wrath of Holloway is nothing compared to the wrath of Moon."
-- Fred Drenckhahn

Jul 19 '05 #2

Similar topics

2986

mod_perl - dbi - DBD:Pg performance test, old vs new

by: Pablo S | last post by:

Hi there mod_perl/Pg folks, I have 2 systems, one OLD, (linux 2.4 running postgresql-7.2.1-5 with a perl web db on Apache/1.3.23 mod_perl 1.26 and dbi 1.37 with peristent db connections via...

Perl

4643

Where to get DBD::pg Perl module for Windows?

by: Piotr B. | last post by:

Hello, I want to make use of a Perl script "ora2pg" (Oracle to PostgreSQL schema converter), which requires the following modules: DBI, DBD::Oracle and DBD::Pg. As I don't use Perl on a...

PostgreSQL Database

2376

Installing DBD::Pg Perl module locally.

by: Envex Developments | last post by:

Hey guys, I have a need to install the DBD::Pg Perl module on many shared web servers, which do not have PostgreSQL installed. Then the DBD::Pg module will just connect to a remote PostgreSQL...

PostgreSQL Database

2252

Installing DBD::Pg Perl module locally

by: Envex Developments | last post by:

Hey guys, I have a need to install the DBD::Pg Perl module on many shared web servers, which do not have PostgreSQL installed. Then the DBD::Pg module will just connect to a remote PostgreSQL...

PostgreSQL Database

1132

DBD::Pg Err / Errstr

by: Alex | last post by:

Hi, I am using DBD::Pg in some of my scripts. I want to customize the error login based on the error received. While I am happy with the Errstr message I want to take specific actions depending...

PostgreSQL Database

2887

DBD::Pg problem

by: Ausrack Webmaster | last post by:

Hi I am trying to insert a simple email address into a text field, and I get the below error: DBD::Pg::st execute failed: ERROR: pg_atoi: error in "<support@somedomain.com>": can't parse...

PostgreSQL Database

2300

DBD::Pg 1.32 ready for testing

by: greg | last post by:

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 A new version of DBD::Pg is about to be released (1.32), and we need your help to test it out. If you use DBD::Pg, please download and test the...

PostgreSQL Database

1361

Is there a newer version of DBD::Pg?

by: Patrick Hatcher | last post by:

Pg: 7.4.2 I use perl scripts to import data into my db. When errors occurred uploading files in Pg ver 7.3.x, $DBI::errstr used to return a row number from the input file. I could then go to...

PostgreSQL Database

2999

Activestate Perl and DBD-Pg?

by: David Siebert | last post by:

Anyone using Activestate Perl and DBD-Pg? I am using perl 5.8.3 ppm does not seem to work. I downloaded the DBD-Pg ..zip file I found through google but ppm could not seem to install that. Any...

PostgreSQL Database

7221

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

7109

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

7313

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

7372

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

7029

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

4702

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...

C# / C Sharp

1537

transfer the data from one system to another through ip address

by: 6302768590 | last post by:

Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated ...

C# / C Sharp

758

How to add payments to a PHP MySQL app.

by: muto222 | last post by:

How can i add a mobile payment intergratation into php mysql website.

PHP

411

Comprehensive Guide to Website Development in Toronto: Expert Insights from BSMN Consultancy

by: bsmnconsultancy | last post by:

In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

General