Mysterious performance of query because of plsql function in where condition

Peter Alberer

Hi there,

i have a problem with a query that uses the result of a plsql function
In the where clause:

SELECT
assignments.assignment_id,
assignments.package_id AS package_id,
assignments.title AS title,
COUNT(*) AS Count
FROM
assignments INNER JOIN submissions ON
(assignments.assignment_id=submissions.assignment_ id)
WHERE
package_id=949589 AND
submission_status(submissions.submission_id)='clos ed'
GROUP BY
assignments.assignment_id, assignments.package_id, assignments.title
ORDER BY
assignments.title;

Postgres seems to execute the function "submission_status" for every row
of the submissions table (~1500 rows). The query therefore takes quite a
lot time, although in fact no row is returned from the assignments table
when the condition package_id=949589 is used.

QUERY PLAN

------------------------------------------------------------------------
------------------------------------------------------
Sort (cost=41.21..41.21 rows=1 width=35) (actual
time=4276.978..4276.978 rows=0 loops=1)
Sort Key: assignments.title
-> HashAggregate (cost=41.19..41.20 rows=1 width=35) (actual
time=4276.970..4276.970 rows=0 loops=1)
-> Hash Join (cost=2.40..41.18 rows=1 width=35) (actual
time=4276.966..4276.966 rows=0 loops=1)
Hash Cond: ("outer".assignment_id =
"inner".assignment_id)
-> Seq Scan on submissions (cost=0.00..38.73 rows=9
width=4) (actual time=10.902..4276.745 rows=38 loops=1)
Filter: (submission_status(submission_id) =
'closed'::text)
-> Hash (cost=2.40..2.40 rows=2 width=35) (actual
time=0.058..0.058 rows=0 loops=1)
-> Seq Scan on assignments (cost=0.00..2.40
rows=2 width=35) (actual time=0.015..0.052 rows=2 loops=1)
Filter: (package_id = 949589)
Total runtime: 4277.078 ms
(11 rows)

I therefore tried to rephrase the query, to make sure that the function
is only used for the rows returned by the join but not even the
following does help (the subselect t1 does not return a single row):

select * from (
SELECT
a.assignment_id, a.package_id, a.title, s.submission_id,
COUNT(*) AS Count
FROM
assignments a INNER JOIN submissions s ON
(a.assignment_id=s.assignment_id)
WHERE
a.package_id=949589
GROUP BY
a.assignment_id, a.package_id, a.title, s.submission_id
) t1
where
submission_status(t1.submission_id)='closed'
order by
title;

QUERY PLAN

------------------------------------------------------------------------
--------------------------------------------------------------
Sort (cost=41.21..41.22 rows=1 width=188) (actual
time=4114.251..4114.251 rows=0 loops=1)
Sort Key: title
-> Subquery Scan t1 (cost=41.20..41.20 rows=1 width=188) (actual
time=4114.242..4114.242 rows=0 loops=1)
-> HashAggregate (cost=41.20..41.20 rows=1 width=39) (actual
time=4114.238..4114.238 rows=0 loops=1)
-> Hash Join (cost=2.40..41.18 rows=1 width=39) (actual
time=4114.235..4114.235 rows=0 loops=1)
Hash Cond: ("outer".assignment_id =
"inner".assignment_id)
-> Seq Scan on submissions s (cost=0.00..38.73
rows=9 width=8) (actual time=7.179..4113.984 rows=38 loops=1)
Filter: (submission_status(submission_id) =
'closed'::text)
-> Hash (cost=2.40..2.40 rows=2 width=35) (actual
time=0.100..0.100 rows=0 loops=1)
-> Seq Scan on assignments a
(cost=0.00..2.40 rows=2 width=35) (actual time=0.045..0.094 rows=2
loops=1)
Filter: (package_id = 949589)
Total runtime: 4114.356 ms
(12 rows)

The function is nevertheless executed for every row in the submissions
table. A simple "select *, submission_status(submission_id) from
submissions" takes about the same time as the 2 queries stated above.

The whole database has been vacuum analysed right before the explain
analyse output has been captured.

What can I do to reduce the time this query takes? And why is the
function executed although there is no row in the result set of t1 in my
rephrased query?

TIA, peter

--
pe***********@wu-wien.ac.at Tel: +43/1/31336/4341
Abteilung für Wirtschaftsinformatik, Wirtschaftsuniversitaet Wien,
Austria

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly

Nov 23 '05 #1

Subscribe Reply

1973

mike g

Do you have any indexes created on the submissions table? If not
postgresql has no choice but read every row in the table.

If you do they are not being used. The tuning masters would really need
to see definition of the table if the indexes are not being used.

You might find a lot of pointers in the pgsql-performance mailing list
instead of this one.

Mike
On Thu, 2004-07-01 at 06:52, Peter Alberer wrote:

Hi there,

i have a problem with a query that uses the result of a plsql function
In the where clause:

SELECT
assignments.assignment_id,
assignments.package_id AS package_id,
assignments.title AS title,
COUNT(*) AS Count
FROM
assignments INNER JOIN submissions ON
(assignments.assignment_id=submissions.assignment_ id)
WHERE
package_id=949589 AND
submission_status(submissions.submission_id)='clos ed'
GROUP BY
assignments.assignment_id, assignments.package_id, assignments.title
ORDER BY
assignments.title;

Postgres seems to execute the function "submission_status" for every row
of the submissions table (~1500 rows). The query therefore takes quite a
lot time, although in fact no row is returned from the assignments table
when the condition package_id=949589 is used.

QUERY PLAN

------------------------------------------------------------------------
------------------------------------------------------
Sort (cost=41.21..41.21 rows=1 width=35) (actual
time=4276.978..4276.978 rows=0 loops=1)
Sort Key: assignments.title
-> HashAggregate (cost=41.19..41.20 rows=1 width=35) (actual
time=4276.970..4276.970 rows=0 loops=1)
-> Hash Join (cost=2.40..41.18 rows=1 width=35) (actual
time=4276.966..4276.966 rows=0 loops=1)
Hash Cond: ("outer".assignment_id =
"inner".assignment_id)
-> Seq Scan on submissions (cost=0.00..38.73 rows=9
width=4) (actual time=10.902..4276.745 rows=38 loops=1)
Filter: (submission_status(submission_id) =
'closed'::text)
-> Hash (cost=2.40..2.40 rows=2 width=35) (actual
time=0.058..0.058 rows=0 loops=1)
-> Seq Scan on assignments (cost=0.00..2.40
rows=2 width=35) (actual time=0.015..0.052 rows=2 loops=1)
Filter: (package_id = 949589)
Total runtime: 4277.078 ms
(11 rows)

I therefore tried to rephrase the query, to make sure that the function
is only used for the rows returned by the join but not even the
following does help (the subselect t1 does not return a single row):

select * from (
SELECT
a.assignment_id, a.package_id, a.title, s.submission_id,
COUNT(*) AS Count
FROM
assignments a INNER JOIN submissions s ON
(a.assignment_id=s.assignment_id)
WHERE
a.package_id=949589
GROUP BY
a.assignment_id, a.package_id, a.title, s.submission_id
) t1
where
submission_status(t1.submission_id)='closed'
order by
title;

QUERY PLAN

------------------------------------------------------------------------
--------------------------------------------------------------
Sort (cost=41.21..41.22 rows=1 width=188) (actual
time=4114.251..4114.251 rows=0 loops=1)
Sort Key: title
-> Subquery Scan t1 (cost=41.20..41.20 rows=1 width=188) (actual
time=4114.242..4114.242 rows=0 loops=1)
-> HashAggregate (cost=41.20..41.20 rows=1 width=39) (actual
time=4114.238..4114.238 rows=0 loops=1)
-> Hash Join (cost=2.40..41.18 rows=1 width=39) (actual
time=4114.235..4114.235 rows=0 loops=1)
Hash Cond: ("outer".assignment_id =
"inner".assignment_id)
-> Seq Scan on submissions s (cost=0.00..38.73
rows=9 width=8) (actual time=7.179..4113.984 rows=38 loops=1)
Filter: (submission_status(submission_id) =
'closed'::text)
-> Hash (cost=2.40..2.40 rows=2 width=35) (actual
time=0.100..0.100 rows=0 loops=1)
-> Seq Scan on assignments a
(cost=0.00..2.40 rows=2 width=35) (actual time=0.045..0.094 rows=2
loops=1)
Filter: (package_id = 949589)
Total runtime: 4114.356 ms
(12 rows)

The function is nevertheless executed for every row in the submissions
table. A simple "select *, submission_status(submission_id) from
submissions" takes about the same time as the 2 queries stated above.

The whole database has been vacuum analysed right before the explain
analyse output has been captured.

What can I do to reduce the time this query takes? And why is the
function executed although there is no row in the result set of t1 in my
rephrased query?

TIA, peter

--
pe***********@wu-wien.ac.at Tel: +43/1/31336/4341
Abteilung fÃ¼r Wirtschaftsinformatik, Wirtschaftsuniversitaet Wien,
Austria

---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly

---------------------------(end of broadcast)---------------------------
TIP 9: the planner will ignore your desire to choose an index scan if your
joining column's datatypes do not match

Nov 23 '05 #2

Similar topics

3232

Performance issue with new 9i database

by: Shankar | last post by:

Hello, I am seeing huge performance problems on the queries executed against 9i database. I am not too familiar with 9i, But I would like to ask the DBA to check whether all the parameters are set...

Oracle Database

2487

Mysterious 9.2.0.4 (on HP-UX) problem

by: SteveS | last post by:

Can anyone help with a mysterious problem that has arisen since 'upgrading' from 8 to 9.2.0.4? The situation is this: Queries that worked fine under 8 are now producing *really* strange...

Oracle Database

3761

DEFAULT keyword performance

by: Jason | last post by:

I have a function which performs a query and returns a table. The one parameter that can get passed in is a date which defaults to NULL. There is an IF statement in the function that will set the...

Microsoft SQL Server

5203

Curious performance issue when running a query

by: Paul Mateer | last post by:

Hi, I have been running some queries against a table in a my database and have noted an odd (at least it seems odd to me) performance issue. The table has approximately 5 million rows and...

Microsoft SQL Server

1778

Mysterious mystery -- 2002-11-08/2002-11-09

by: Jim Geissman | last post by:

I have function that returns a table of information about properties. The data comes from three different tables -- addresses (called PropertyID), property characteristics, and events concerning...

Microsoft SQL Server

1306

Which has the best performance?

by: Robin Tucker | last post by:

This: SELECT MAX(TheDate) FROM MyTable or this: SELECT TOP 1 TheDate FROM MyTable ORDER BY TheDate DESC As a follow up question to save me having to post, if I want a different

Microsoft SQL Server

2855

poor query performance

by: Ion | last post by:

Hi all, I have a query that takes almost 1 hour to complete. This is acceptable in certain situations, but unacceptable when no rows should qualify. Something like: Select list >From...

DB2 Database

81964

SQL performance: Nested SELECT vs. INNER JOIN

by: Brian | last post by:

Hello All - I am wondering if anyone has any thoughts on which is better from a performance perspective: a nested Select statement or an Inner Join. For example, I could do either of the...

Microsoft Access / VBA

2036

Performance on access queries with conditions

by: bhbgroup | last post by:

I have a query on one large table. I only add one condition, i.e. a date (the SQL reads like 'where date > parameterdate'. This query is rather quick if 'parameterdate' is either explicitly...

Microsoft Access / VBA

7178

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

6899

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

7390

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

General

5475

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice

4919

Access Europe - Using VBA to create a class based on a table - Wed 1 May

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

Microsoft Access / VBA

4602

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...

C# / C Sharp

3103

Trying to create a lan-to-lan vpn between two differents networks

by: TSSRALBI | last post by:

Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...

Networking - Hardware / Configuration

665

How to add payments to a PHP MySQL app.

by: muto222 | last post by:

How can i add a mobile payment intergratation into php mysql website.

PHP

302

Comprehensive Guide to Website Development in Toronto: Expert Insights from BSMN Consultancy

by: bsmnconsultancy | last post by:

In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

General