UPDATEs with multiple aggregate functions

Steven An

Howdy,

I need to write an update query with multiple aggregate functions.
Here is an example:

UPDATE t
SET
t.a = ( select avg(f.q) from dbo.foo f where f.p = t.y ),
t.b = ( select sum(f.q) from dbo.foo f where f.p = t.y )
FROM dbo.test t

Basically I need to get some aggregate statistics about the rows of
foo and store them in rows of t. The above statement works fine...but
note how the two subSelect's have the exact same WHERE clause. This
screams at me to combine them...but how? I would like to have
something like this in my query:

SELECT avg(f.q), sum(f.q) FROM dbo.foo f WHERE f.p = 2

...and somehow store the results in t.a and t.b. Is there any way to
do this?

Thanks before hand!

Jul 20 '05 #1

Subscribe Reply

9959

John Bell

Hi

You could try:

UPDATE t
SET a = d.a
b = d.b
FROM dbo.test t JOIN
( SELECT p,avg(q),sum(q) FROM dbo.foo GROUP BY p ) f ON f.p = t.y

John

"Steven An" <st******@uclink.berkeley.edu> wrote in message
news:49**************************@posting.google.c om...

Howdy,

I need to write an update query with multiple aggregate functions.
Here is an example:

UPDATE t
SET
t.a = ( select avg(f.q) from dbo.foo f where f.p = t.y ),
t.b = ( select sum(f.q) from dbo.foo f where f.p = t.y )
FROM dbo.test t

Basically I need to get some aggregate statistics about the rows of
foo and store them in rows of t. The above statement works fine...but
note how the two subSelect's have the exact same WHERE clause. This
screams at me to combine them...but how? I would like to have
something like this in my query:

SELECT avg(f.q), sum(f.q) FROM dbo.foo f WHERE f.p = 2

..and somehow store the results in t.a and t.b. Is there any way to
do this?

Thanks before hand!

Jul 20 '05 #2

--CELKO--

>> I need to write an update query with multiple aggregate functions.
Here is an example: <<

You might want to learn Standard SQL. There is no FROM clause in a
Standard SQL UPDATE statement; it would make no sense. Other products
(SQL Server, Sybase and Ingres) also use the UPDATE .. FROM syntax,
but with different semantics. So it does not port, or even worse,
when you do move it, it trashes your database. Other programmers
cannot read it and maintaining it is harder. And when Microsoft
decides to change it, you will have to do a re-write. Remember the
deprecated "*=" versus "LEFT OUTER JOIN" conversions?

The correct syntax for a searched update statement is

<update statement> ::=
UPDATE <table name>
SET <set clause list>
[WHERE <search condition>]

<set clause list> ::=
<set clause> [{ , <set clause> }...]

<set clause> ::= <object column> = <update source>

<update source> ::= <value expression> | NULL | DEFAULT

<object column> ::= <column name>

The UPDATE clause simply gives the name of the base table or updatable
view to be changed.

Notice that no correlation name is allowed in the UPDATE clause; this
is to avoid some self-referencing problems that could occur. But it
also follows the data model in Standard SQL. When you give a table
expression a correlation name, it is to act as if a materialized table
with that correlation name has been created in the database. That
table then is dropped at the end of the statement. If you allowed
correlation names in the UPDATE clause, you would be updating the
materialized table, which would then disappear and leave the base
table untouched.

The SET clause is a list of columns to be changed or made; the WHERE
clause tells the statement which rows to use. For this discussion, we
will assume the user doing the update has applicable UPDATE privileges
for each <object column>.

* The WHERE Clause

As mentioned, the most important thing to remember about the WHERE
clause is that it is optional. If there is no WHERE clause, all rows
in the table are changed. This is a common error; if you make it,
immediately execute a ROLLBACK statement.

All rows that test TRUE for the <search condition> are marked as a
subset and not as individual rows. It is also possible that this
subset will be empty. This subset is used to construct a new set of
rows that will be inserted into the table when the subset is deleted
from the table. Note that the empty subset is a valid update that
will fire declarative referential actions and triggers.

* The SET Clause

Each assignment in the <set clause list> is executed in parallel and
each SET clause changes all the qualified rows at once. Or at least
that is the theoretical model. In practice, implementations will
first mark all of the qualified rows in the table in one pass, using
the WHERE clause. If there were no problems, then the SQL engine
makes a copy of each marked row in working storage. Each SET clause
is executed based on the old row image and the results are put in the
new row image. Finally, the old rows are deleted and the new rows are
inserted. If an error occurs during all of this, then system does a
ROLLBACK, the table is left unchanged and the errors are reported.
This parallelism is not like what you find in a traditional
third-generation programming language, so it may be hard to learn.
This feature lets you write a statement that will swap the values in
two columns, thus:

UPDATE MyTable
SET a = b, b = a;

This is not the same thing as

BEGIN ATOMIC
UPDATE MyTable
SET a = b;
UPDATE MyTable
SET b = a;
END;

In the first UPDATE, columns a and b will swap values in each row. In
the second pair of UPDATEs, column a will get all of the values of
column b in each row. In the second UPDATE of the pair, a, which now
has the same value as the original value of b, will be written back
into column b -- no change at all. There are some limits as to what
the value expression can be. The same column cannot appear more than
once in a <set clause list> -- which makes sense, given the parallel
nature of the statement. Since both go into effect at the same time,
you would not know which SET clause to use.

If a subquery expression is used in a <set clause>, and it returns a
single value, the result set is cast to a scalar; if it returns an
empty, the result set is cast to a NULL; if it returns multiple rows,
a cardinality violation is raised.

When we finally get T-SQL up to SQL-92 specs, you could use row
constructors and have written:

UPDATE Test
SET ROW(a,b)
= (SELECT AVG(q), SUM(q) FROM FooWHERE Foo.p = Test.y);

Jul 20 '05 #3

Steven An

Just what I needed. This saves me a LOT of time :)

Many thanks!

"John Bell" <jb************@hotmail.com> wrote in message news:<2c*********************@news-text.cableinet.net>...

Hi

You could try:

UPDATE t
SET a = d.a
b = d.b
FROM dbo.test t JOIN
( SELECT p,avg(q),sum(q) FROM dbo.foo GROUP BY p ) f ON f.p = t.y

John

"Steven An" <st******@uclink.berkeley.edu> wrote in message
news:49**************************@posting.google.c om...
Howdy,

I need to write an update query with multiple aggregate functions.
Here is an example:

UPDATE t
SET
t.a = ( select avg(f.q) from dbo.foo f where f.p = t.y ),
t.b = ( select sum(f.q) from dbo.foo f where f.p = t.y )
FROM dbo.test t

Basically I need to get some aggregate statistics about the rows of
foo and store them in rows of t. The above statement works fine...but
note how the two subSelect's have the exact same WHERE clause. This
screams at me to combine them...but how? I would like to have
something like this in my query:

SELECT avg(f.q), sum(f.q) FROM dbo.foo f WHERE f.p = 2

..and somehow store the results in t.a and t.b. Is there any way to
do this?

Thanks before hand!

Jul 20 '05 #4

Steven An

Well, thank you for that very insightful answer. Yes, that ROW
operator is a lot like what I had in mind. I was looking for
something like SET (a,b) = (select ..). I will try that, and see
which one is faster. I have a feeling your method will be MUCH
faster, since unlike the JOIN, it does not require that I calculate
avg's and sum's for ALL rows, only the ones I want. Thank you!

jc*******@earthlink.net (--CELKO--) wrote in message news:<18**************************@posting.google. com>...

I need to write an update query with multiple aggregate functions.

Here is an example: <<

You might want to learn Standard SQL. There is no FROM clause in a
Standard SQL UPDATE statement; it would make no sense. Other products
(SQL Server, Sybase and Ingres) also use the UPDATE .. FROM syntax,
but with different semantics. So it does not port, or even worse,
when you do move it, it trashes your database. Other programmers
cannot read it and maintaining it is harder. And when Microsoft
decides to change it, you will have to do a re-write. Remember the
deprecated "*=" versus "LEFT OUTER JOIN" conversions?

The correct syntax for a searched update statement is

<update statement> ::=
UPDATE <table name>
SET <set clause list>
[WHERE <search condition>]

<set clause list> ::=
<set clause> [{ , <set clause> }...]

<set clause> ::= <object column> = <update source>

<update source> ::= <value expression> | NULL | DEFAULT

<object column> ::= <column name>

The UPDATE clause simply gives the name of the base table or updatable
view to be changed.

Notice that no correlation name is allowed in the UPDATE clause; this
is to avoid some self-referencing problems that could occur. But it
also follows the data model in Standard SQL. When you give a table
expression a correlation name, it is to act as if a materialized table
with that correlation name has been created in the database. That
table then is dropped at the end of the statement. If you allowed
correlation names in the UPDATE clause, you would be updating the
materialized table, which would then disappear and leave the base
table untouched.

The SET clause is a list of columns to be changed or made; the WHERE
clause tells the statement which rows to use. For this discussion, we
will assume the user doing the update has applicable UPDATE privileges
for each <object column>.

* The WHERE Clause

As mentioned, the most important thing to remember about the WHERE
clause is that it is optional. If there is no WHERE clause, all rows
in the table are changed. This is a common error; if you make it,
immediately execute a ROLLBACK statement.

All rows that test TRUE for the <search condition> are marked as a
subset and not as individual rows. It is also possible that this
subset will be empty. This subset is used to construct a new set of
rows that will be inserted into the table when the subset is deleted
from the table. Note that the empty subset is a valid update that
will fire declarative referential actions and triggers.

* The SET Clause

Each assignment in the <set clause list> is executed in parallel and
each SET clause changes all the qualified rows at once. Or at least
that is the theoretical model. In practice, implementations will
first mark all of the qualified rows in the table in one pass, using
the WHERE clause. If there were no problems, then the SQL engine
makes a copy of each marked row in working storage. Each SET clause
is executed based on the old row image and the results are put in the
new row image. Finally, the old rows are deleted and the new rows are
inserted. If an error occurs during all of this, then system does a
ROLLBACK, the table is left unchanged and the errors are reported.
This parallelism is not like what you find in a traditional
third-generation programming language, so it may be hard to learn.
This feature lets you write a statement that will swap the values in
two columns, thus:

UPDATE MyTable
SET a = b, b = a;

This is not the same thing as

BEGIN ATOMIC
UPDATE MyTable
SET a = b;
UPDATE MyTable
SET b = a;
END;

In the first UPDATE, columns a and b will swap values in each row. In
the second pair of UPDATEs, column a will get all of the values of
column b in each row. In the second UPDATE of the pair, a, which now
has the same value as the original value of b, will be written back
into column b -- no change at all. There are some limits as to what
the value expression can be. The same column cannot appear more than
once in a <set clause list> -- which makes sense, given the parallel
nature of the statement. Since both go into effect at the same time,
you would not know which SET clause to use.

If a subquery expression is used in a <set clause>, and it returns a
single value, the result set is cast to a scalar; if it returns an
empty, the result set is cast to a NULL; if it returns multiple rows,
a cardinality violation is raised.

When we finally get T-SQL up to SQL-92 specs, you could use row
constructors and have written:

UPDATE Test
SET ROW(a,b)
= (SELECT AVG(q), SUM(q) FROM FooWHERE Foo.p = Test.y);

Jul 20 '05 #5

Steven An

OK I didn't read the "When we finally" part. So as of now, my best
actual solution seems to be the JOIN'd UPDATE.. I guess I'll have to
break standard. Oh well..

jc*******@earthlink.net (--CELKO--) wrote in message news:<18**************************@posting.google. com>...

I need to write an update query with multiple aggregate functions.

Here is an example: <<

You might want to learn Standard SQL. There is no FROM clause in a
Standard SQL UPDATE statement; it would make no sense. Other products
(SQL Server, Sybase and Ingres) also use the UPDATE .. FROM syntax,
but with different semantics. So it does not port, or even worse,
when you do move it, it trashes your database. Other programmers
cannot read it and maintaining it is harder. And when Microsoft
decides to change it, you will have to do a re-write. Remember the
deprecated "*=" versus "LEFT OUTER JOIN" conversions?

The correct syntax for a searched update statement is

<update statement> ::=
UPDATE <table name>
SET <set clause list>
[WHERE <search condition>]

<set clause list> ::=
<set clause> [{ , <set clause> }...]

<set clause> ::= <object column> = <update source>

<update source> ::= <value expression> | NULL | DEFAULT

<object column> ::= <column name>

The UPDATE clause simply gives the name of the base table or updatable
view to be changed.

Notice that no correlation name is allowed in the UPDATE clause; this
is to avoid some self-referencing problems that could occur. But it
also follows the data model in Standard SQL. When you give a table
expression a correlation name, it is to act as if a materialized table
with that correlation name has been created in the database. That
table then is dropped at the end of the statement. If you allowed
correlation names in the UPDATE clause, you would be updating the
materialized table, which would then disappear and leave the base
table untouched.

The SET clause is a list of columns to be changed or made; the WHERE
clause tells the statement which rows to use. For this discussion, we
will assume the user doing the update has applicable UPDATE privileges
for each <object column>.

* The WHERE Clause

As mentioned, the most important thing to remember about the WHERE
clause is that it is optional. If there is no WHERE clause, all rows
in the table are changed. This is a common error; if you make it,
immediately execute a ROLLBACK statement.

All rows that test TRUE for the <search condition> are marked as a
subset and not as individual rows. It is also possible that this
subset will be empty. This subset is used to construct a new set of
rows that will be inserted into the table when the subset is deleted
from the table. Note that the empty subset is a valid update that
will fire declarative referential actions and triggers.

* The SET Clause

Each assignment in the <set clause list> is executed in parallel and
each SET clause changes all the qualified rows at once. Or at least
that is the theoretical model. In practice, implementations will
first mark all of the qualified rows in the table in one pass, using
the WHERE clause. If there were no problems, then the SQL engine
makes a copy of each marked row in working storage. Each SET clause
is executed based on the old row image and the results are put in the
new row image. Finally, the old rows are deleted and the new rows are
inserted. If an error occurs during all of this, then system does a
ROLLBACK, the table is left unchanged and the errors are reported.
This parallelism is not like what you find in a traditional
third-generation programming language, so it may be hard to learn.
This feature lets you write a statement that will swap the values in
two columns, thus:

UPDATE MyTable
SET a = b, b = a;

This is not the same thing as

BEGIN ATOMIC
UPDATE MyTable
SET a = b;
UPDATE MyTable
SET b = a;
END;

In the first UPDATE, columns a and b will swap values in each row. In
the second pair of UPDATEs, column a will get all of the values of
column b in each row. In the second UPDATE of the pair, a, which now
has the same value as the original value of b, will be written back
into column b -- no change at all. There are some limits as to what
the value expression can be. The same column cannot appear more than
once in a <set clause list> -- which makes sense, given the parallel
nature of the statement. Since both go into effect at the same time,
you would not know which SET clause to use.

If a subquery expression is used in a <set clause>, and it returns a
single value, the result set is cast to a scalar; if it returns an
empty, the result set is cast to a NULL; if it returns multiple rows,
a cardinality violation is raised.

When we finally get T-SQL up to SQL-92 specs, you could use row
constructors and have written:

UPDATE Test
SET ROW(a,b)
= (SELECT AVG(q), SUM(q) FROM FooWHERE Foo.p = Test.y);

Jul 20 '05 #6

Erland Sommarskog

Steven An (st******@uclink.berkeley.edu) writes:

Well, thank you for that very insightful answer. Yes, that ROW
operator is a lot like what I had in mind. I was looking for
something like SET (a,b) = (select ..). I will try that, and see
which one is faster. I have a feeling your method will be MUCH
faster, since unlike the JOIN, it does not require that I calculate
avg's and sum's for ALL rows, only the ones I want. Thank you!

Actually, my experience is that on SQL Server

UPDATE tbl
SET col = (SELECT SUM(x)
FROM tbl2
WHERE tbl2.keycol1 = tbl.keycol)

usually performs less well than:

UPDATE tbl
SET col = x.sum2
FROM tbl a
JOIN (SELECT keycol1, sum2 = SUM(x) FROM tbl2 GROUP BY keycol1) b
ON a.keycol = b.keycol1

The derived table, is only logical and SQL Server does not have to
compute all sums, even if the looks so. We had one case in our
system where a colleague rewrote from the first syntax to second,
and execution time fell from two minutes to a few seconds. But

If you have more than one column to update, using a derived table is
almost a guaranteed winner.

Of course, this is all up to the optimizer, and if you need to know
for the specific case, you have to benchmark.

--
Erland Sommarskog, SQL Server MVP, es****@sommarskog.se

Books Online for SQL Server SP3 at
http://www.microsoft.com/sql/techinf...2000/books.asp

Jul 20 '05 #7

Similar topics

1387

Aggregate functions with GROUP BY Clauses on DataTable?

by: Job Lot | last post by:

Is it possible to use Aggregate functions with GROUP BY Clauses on DataTable. I have a DataTable with following values: Date Amount Int Balance 1/1/2004 5000.00 50.00 5050.00...

.NET Framework

7034

Writing your own aggregate functions

by: jc | last post by:

Hi. Just as we have AVG(COLUMN_NAME) and MAX(COLUMN_NAME) how can I write my own variation of a such a function. If I can appreciate how to do this, then I hopefully I can write a...

Microsoft SQL Server

11905

Clue for writing user-defined aggregate functions

by: neb | last post by:

Dear member of the forum, Ms access has built-in aggregate function like: -Sum, Max, First, Avg, ... Is it possible to build user-defined aggregate? (if you have any clue, do not hesitate to...

Microsoft Access / VBA

3695

More aggregate functions?

by: David Garamond | last post by:

What do people think of adding some more aggregate functions. These are the ones that MySQL has and PG doesn't: - STD/STDDEV - VARIANCE - BIT_OR - BIT_AND - GROUP_CONCAT (for strings, added...

PostgreSQL Database

3591

multiple inheritance

by: Mark | last post by:

why doesn't .NET support multiple inheritance? I think it's so silly! Cheers, Mark

C# / C Sharp

4867

Strategic Functional Migration and Multiple Inheritance

by: Shawnk | last post by:

Some Sr. colleges and I have had an on going discussion relative to when and if C# will ever support 'true' multiple inheritance. Relevant to this, I wanted to query the C# community (the...

C# / C Sharp

3113

Aggregate UDFs

by: jefftyzzer | last post by:

The current issue of "Oracle Magazine" has an article on creating custom aggregate functions, which naturally got me thinking about how to do this in DB2. I found some articles on creating...

DB2 Database

8456

Alternative to domain aggregate functions

by: Dean | last post by:

Has anyone toiled with creating/using alternate domain aggregate functions? I have been messing with that a little. The one recordsource I have been working indicates I get 20 to 40% savings in...

Microsoft Access / VBA

3953

Multiple base classes in .NET

by: Larry Smith | last post by:

I just read a blurb in MSDN under the C++ "ref" keyword which states that: "Under the CLR object model, only public single inheritance is supported". Does this mean that no .NET class can ever...

C# / C Sharp

7202

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

7084

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

7278

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

7328

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

5578

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice

5013

Access Europe - Using VBA to create a class based on a table - Wed 1 May

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

Microsoft Access / VBA

4672

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...

C# / C Sharp

3167

Trying to create a lan-to-lan vpn between two differents networks

by: TSSRALBI | last post by:

Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...

Networking - Hardware / Configuration

736

How to add payments to a PHP MySQL app.

by: muto222 | last post by:

How can i add a mobile payment intergratation into php mysql website.

PHP