Virtuel-Counting-Row

walexand

I use the database mysql v.4.

My problem is...
I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result:
id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

how can i make this with mysql?
My first idea was to made that with views but mysql can't do that!

MFG Alexander W.

Jul 20 '05 #1

Subscribe Post Reply

1692

Siemel Naran

<wa******@web.de> wrote in message

I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result: id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

how can i make this with mysql?
My first idea was to made that with views but mysql can't do that!

There might be some special function, I don't know for sure. But in 5.0
they will have cursors, and in the present version you can always do it in
C++ and other languages.

I can think of trying an extra table, but it doesn't seem like the best
solution.

create table temp_product (
id int(1) unsigned auto_increment,
product_id int(1) unsigned not null references product(id),
primary key (id)
);

....

delete from temp_product;

insert into temp_product (product_id)
select id from product
order by name desc;

select temp_product.id, product.*
from product inner join temp_product on product.id =
temp_product.product_id;

Jul 20 '05 #2

Siemel Naran

<wa******@web.de> wrote in message

I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result: id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

how can i make this with mysql?
My first idea was to made that with views but mysql can't do that!

Jul 20 '05 #3

Mike Chirico

<wa******@web.de> wrote in message
news:b1**************************@posting.google.c om...

I use the database mysql v.4.

My problem is...
I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result: id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

Assume the following table:

CREATE TABLE student (
s_id int(11) NOT NULL auto_increment,
name varchar(15),
PRIMARY KEY (s_id)
);

Add some rows and delete (just to convice you this works in all cases
independent of the primary key)

insert into student (name) values ('tom');
insert into student (name) values ('bob');
insert into student (name) values ('sue');
insert into student (name) values ('mike');
insert into student (name) values ('joe');
insert into student (name) values ('zoe');
insert into student (name) values ('harpo');

delete from student where name = 'bob';

Now, note mc is the row count...independent of s_id;
select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;
+-------+------+

| name | mc |
+-------+------+
| harpo | 1 |
| zoe | 2 |
| joe | 3 |
| mike | 4 |
| sue | 5 |
| tom | 6 |
+-------+------+
6 rows in set (0.00 sec)

Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

Hope this helps,

Mike Chirico

Jul 20 '05 #4

Mike Chirico

<wa******@web.de> wrote in message
news:b1**************************@posting.google.c om...

I use the database mysql v.4.

My problem is...
I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result: id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

Jul 20 '05 #5

Siemel Naran

"Mike Chirico" <mc******@comcast.net> wrote in message
news:L66dncvnK4TEdAfdRVn-

select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;
That's pretty cool!
Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

Jul 20 '05 #6

Siemel Naran

"Mike Chirico" <mc******@comcast.net> wrote in message
news:L66dncvnK4TEdAfdRVn-

select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;
That's pretty cool!
Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

Jul 20 '05 #7

Mike Chirico

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:Vw********************@bgtnsc05-news.ops.worldnet.att.net...

"Mike Chirico" <mc******@comcast.net> wrote in message
news:L66dncvnK4TEdAfdRVn-
select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;

That's pretty cool!
Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

It's also possible to compute running aggregates. Assume the following
table
and data:

CREATE TABLE dice (
d_id int(11) NOT NULL auto_increment,
roll int,
PRIMARY KEY (d_id)
);

insert into dice (roll) values (1);
insert into dice (roll) values (2);
insert into dice (roll) values (3);
insert into dice (roll) values (4);
insert into dice (roll) values (5);
insert into dice (roll) values (6);
The following gives a running, cumulative, sum:

mysql> select a.roll, sum(b.roll) from dice a, dice b
where b.d_id <= a.d_id group by a.d_id, a.roll;

+------+-------------+

| roll | sum(b.roll) |
+------+-------------+
| 1 | 1 |
| 2 | 3 |
| 3 | 6 |
| 4 | 10 |
| 5 | 15 |
| 6 | 21 |
+------+-------------+
6 rows in set (0.00 sec)
To get a running average change the
sum to avg as follows:
mysql> select a.roll, avg(b.roll) from dice a, dice b
where b.d_id <= a.d_id group by a.d_id, a.roll;

+------+-------------+
| roll | avg(b.roll) |
+------+-------------+
| 1 | 1.0000 |
| 2 | 1.5000 |
| 3 | 2.0000 |

| 4 | 2.5000 |
| 5 | 3.0000 |
| 6 | 3.5000 |
+------+-------------+
6 rows in set (0.00 sec)
Regards,

Mike Chirico

Reference (TIP 23)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

Jul 20 '05 #8

Mike Chirico

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:Vw********************@bgtnsc05-news.ops.worldnet.att.net...

"Mike Chirico" <mc******@comcast.net> wrote in message
news:L66dncvnK4TEdAfdRVn-
select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;

That's pretty cool!
Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

Jul 20 '05 #9

Siemel Naran

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:VwCmc.27167

"Mike Chirico" <mc******@comcast.net> wrote in message

select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;

Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

That's pretty cool!

Instead of sum(1) you could use count(*).

What's the running time of the algorithm? Because it selects from 2 tables
it seems a O(N^2). If the system uses the index on s_id it can eliminate
the inner loop and immediately find the records of interest -- for example,
if driving by alias 'a', it can immediately find the records in alias 'b'
meeting the condition in the where clause. But it still has to loop through
these records in order to determine the count(*) or sum(1), because this
informations is (probably) not available in the index. Which makes it
O(N^2) still. Of course, if N is small as in the examples in this thread,
then it doesn't matter.

Jul 20 '05 #10

Siemel Naran

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:VwCmc.27167

"Mike Chirico" <mc******@comcast.net> wrote in message

select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;

Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

That's pretty cool!

Jul 20 '05 #11

Mike Chirico

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:o1********************@bgtnsc04-news.ops.worldnet.att.net...

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:VwCmc.27167
"Mike Chirico" <mc******@comcast.net> wrote in message
select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;

Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

That's pretty cool!

Instead of sum(1) you could use count(*).

What's the running time of the algorithm? Because it selects from 2

tables it seems a O(N^2). If the system uses the index on s_id it can eliminate
the inner loop and immediately find the records of interest -- for example, if driving by alias 'a', it can immediately find the records in alias 'b'
meeting the condition in the where clause. But it still has to loop through these records in order to determine the count(*) or sum(1), because this
informations is (probably) not available in the index. Which makes it
O(N^2) still. Of course, if N is small as in the examples in this thread,
then it doesn't matter.

If I add 2,000,000 records into the dice table and take the sum of the last
10 records, which appears to be the worst case, it takes apx 47 seconds.
That's without indexes, and I agree it might not make a difference. For
some reason it doesn't seem to grow O(N^2); but, more like O(n) Log(n). I'm
not sure why.

mysql> select a.roll, sum(b.roll)
from dice a, dice b
where b.d_id <= a.d_id and a.roll > 1999990
and a.roll <= 2000000 group by a.d_id, a.roll;
+---------+---------------+
| roll | sum(b.roll) |
+---------+---------------+
| 1999991 | 1999983000036 |
| 1999992 | 1999985000028 |
| 1999993 | 1999987000021 |
| 1999994 | 1999989000015 |
| 1999995 | 1999991000010 |
| 1999996 | 1999993000006 |
| 1999997 | 1999995000003 |
| 1999998 | 1999997000001 |
| 1999999 | 1999999000000 |
| 2000000 | 2000001000000 |
+---------+---------------+
10 rows in set (46.93 sec)

Or, with count it's about the same.

select a.roll, count(b.roll) from dice a, dice b
where b.d_id <= a.d_id
and a.roll > 1999990 and a.roll <= 2000000 group by a.d_id, a.roll;

+---------+---------------+
| roll | count(b.roll) |
+---------+---------------+
| 1999991 | 1999991 |
| 1999992 | 1999992 |
| 1999993 | 1999993 |
| 1999994 | 1999994 |
| 1999995 | 1999995 |
| 1999996 | 1999996 |
| 1999997 | 1999997 |
| 1999998 | 1999998 |
| 1999999 | 1999999 |
| 2000000 | 2000000 |
+---------+---------------+
10 rows in set (47.51 sec)
Still with 2 million records in the table; but, only selecting between 500
and 510 it takes < 1 sec.

select a.roll, count(b.roll) from dice a, dice b
where b.d_id <= a.d_id
and a.roll > 500 and a.roll <= 510 group by a.d_id, a.roll;
+------+---------------+
| roll | count(b.roll) |
+------+---------------+
| 501 | 501 |
| 502 | 502 |
| 503 | 503 |
| 504 | 504 |
| 505 | 505 |
| 506 | 506 |
| 507 | 507 |
| 508 | 508 |
| 509 | 509 |
| 510 | 510 |
+------+---------------+
10 rows in set (0.80 sec)

Interesting note...creating the text file of insert statements took too long
with a "for loop in a bash script".. I had to use a C program, which took
only 3 seconds

#include <stdio.h>
#include <unistd.h>
#include <sys/types.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <stdlib.h>

#include <string.h>/* for strerror(int errno) */
#include <errno.h>
int main()
{
long int i;
char buf[50+1];
int fd;

if ((fd = open ("data.sql", O_RDWR | O_CREAT, 0600)) == -1)
{
fprintf (stderr, "Can't open data: %s\n", strerror (errno));
return 1;
}
for(i=1; i <= 2000000; ++i)
{
snprintf(buf,50,"insert into dice (roll) values (%ld);\n",i);
write (fd, buf, strlen (buf));
}

close(fd);
return 0;
}
Regards,

Mike Chirico

Jul 20 '05 #12

Mike Chirico

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:o1********************@bgtnsc04-news.ops.worldnet.att.net...

"Siemel Naran" <Si*********@REMOVE.att.net> wrote in message
news:VwCmc.27167
"Mike Chirico" <mc******@comcast.net> wrote in message
select a.name,sum(1) as mc
from student a, student b
where a.s_id <= b.s_id
group by a.s_id, a.name order by mc;

Reference (TIP 22)
http://osdn.dl.sourceforge.net/sourc...ADME_mysql.txt

That's pretty cool!

Instead of sum(1) you could use count(*).

What's the running time of the algorithm? Because it selects from 2

tables it seems a O(N^2). If the system uses the index on s_id it can eliminate
the inner loop and immediately find the records of interest -- for example, if driving by alias 'a', it can immediately find the records in alias 'b'
meeting the condition in the where clause. But it still has to loop through these records in order to determine the count(*) or sum(1), because this
informations is (probably) not available in the index. Which makes it
O(N^2) still. Of course, if N is small as in the examples in this thread,
then it doesn't matter.

Jul 20 '05 #13

Siemel Naran

"Mike Chirico" <mc******@comcast.net> wrote in message
news:vr2dnefdF_XNUQbdRVn-

If I add 2,000,000 records into the dice table and take the sum of the last 10 records, which appears to be the worst case, it takes apx 47 seconds.
That's without indexes, and I agree it might not make a difference. For
some reason it doesn't seem to grow O(N^2); but, more like O(n) Log(n). I'm not sure why.
Maybe the running time is A*N^2+B*N*lg(N), and only for really large N, like
2 trillion it might be N^2. Just guessing.

Anyway, thanks for the long reply. I am curious as to how you got these SQL
timings. Is there some kind of clock function in MySql?
mysql> select a.roll, sum(b.roll)
from dice a, dice b
where b.d_id <= a.d_id and a.roll > 1999990
and a.roll <= 2000000 group by a.d_id, a.roll;
We should compare the running time to the O(N) algorithm, which probably
means SQL cursors or the temporary table idea, but SQL cursors are only in
in MySql 5.0.

Anyway, it's clear that for most reasonable values, the method is fast.

#include <fcntl.h>

[OT] Is the above UNIX specific, and defines the open function?

Jul 20 '05 #14

Siemel Naran

"Mike Chirico" <mc******@comcast.net> wrote in message
news:vr2dnefdF_XNUQbdRVn-

If I add 2,000,000 records into the dice table and take the sum of the last 10 records, which appears to be the worst case, it takes apx 47 seconds.
That's without indexes, and I agree it might not make a difference. For
some reason it doesn't seem to grow O(N^2); but, more like O(n) Log(n). I'm not sure why.
Maybe the running time is A*N^2+B*N*lg(N), and only for really large N, like
2 trillion it might be N^2. Just guessing.

Anyway, thanks for the long reply. I am curious as to how you got these SQL
timings. Is there some kind of clock function in MySql?
mysql> select a.roll, sum(b.roll)
from dice a, dice b
where b.d_id <= a.d_id and a.roll > 1999990
and a.roll <= 2000000 group by a.d_id, a.roll;
We should compare the running time to the O(N) algorithm, which probably
means SQL cursors or the temporary table idea, but SQL cursors are only in
in MySql 5.0.

Anyway, it's clear that for most reasonable values, the method is fast.

#include <fcntl.h>

[OT] Is the above UNIX specific, and defines the open function?

Jul 20 '05 #15

nospam1001

MySQL has a function you might find helpful:
try
SELECT @m:=0;
then
select @m:=@m+1 AS 'pos', * from user where language = "de";

best and I hope this helps

wa******@web.de wrote:

I use the database mysql v.4.

My problem is...
I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result:
id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

how can i make this with mysql?
My first idea was to made that with views but mysql can't do that!

MFG Alexander W.

Jul 20 '05 #16

Reply-to-group Thx

MySQL has a function you might find helpful:
try
SELECT @m:=0;
then
select @m:=@m+1 AS 'pos', * from user where language = "de";

best and I hope this helps

wa******@web.de wrote:

I use the database mysql v.4.

My problem is...
I have a select like:
select * from user where language = "de";

the result are then:
id name
===================
1 max
5 tim
99 otto

so and what i need is a virtuell row (pos) with a counting index of the result: id name pos
=================== ===
1 max 1
5 tim 2
99 otto 3

how can i make this with mysql?
My first idea was to made that with views but mysql can't do that!

MFG Alexander W.

Jul 20 '05 #17

Similar topics