473,327 Members | 2,081 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,327 software developers and data experts.

EEE partitions process performance are different

Hi All,
I have DB2 EEE 7.2 12 db partitions running on AIX 5.1ML5 P690( one
server). The AIX workload manager is on. I started the same peoplesoft
job on all 12 nodes at the same time. But some node(partition)
processing speed are much faster than the others. The fastest node
took 15 hours. But the slowest node took 17 hours.

According to DBA, the data are evenly disributed on 12 partitions. The
configuration of the partitions are same. All the job have same
priority on the OS. The jobs used all the CPU resource (reached WLM
CPU hard limit). I think maybe the partition were compete the CPU
resource which cause the partition speed were different.

Could anyone give me some advice why the DB2 nodes cannot share the
system resource evenly? Why some nodes faster?

Thanks.

John
Nov 12 '05 #1
6 2280
Jack Li wrote:
Hi All,
I have DB2 EEE 7.2 12 db partitions running on AIX 5.1ML5 P690( one
server). The AIX workload manager is on. I started the same peoplesoft
job on all 12 nodes at the same time. But some node(partition)
processing speed are much faster than the others. The fastest node
took 15 hours. But the slowest node took 17 hours.

According to DBA, the data are evenly disributed on 12 partitions. The
configuration of the partitions are same. All the job have same
priority on the OS. The jobs used all the CPU resource (reached WLM
CPU hard limit). I think maybe the partition were compete the CPU
resource which cause the partition speed were different.

Could anyone give me some advice why the DB2 nodes cannot share the
system resource evenly? Why some nodes faster?

Thanks.

John


Assume that the average duration is 16 hours. Maximum dispersion is 1/16
or <7%. You're complaining about +|- 7% duration of "identical" tasks in
a multi-tasking environment? I wouldn't be surprised to see that much
dispersion of data that's "evenly" distributed. Even if the data is
exactly divided between the partitions; I/O effects could account for at
least that much difference between the fastest and slowest node.

Phil Sherman

Nov 12 '05 #2
>
Assume that the average duration is 16 hours. Maximum dispersion is 1/16
or <7%. You're complaining about +|- 7% duration of "identical" tasks in
a multi-tasking environment? I wouldn't be surprised to see that much
dispersion of data that's "evenly" distributed. Even if the data is
exactly divided between the partitions; I/O effects could account for at
least that much difference between the fastest and slowest node.

Phil Sherman


I also suspect the I/O effects the nodes performance. For the first
time run, I found the containers of last finished 4 partitions are
located at the same LSS of Shark ( That means they are in same SSA
loop). This may impact the performance. Looks like I almost get the
anwser. But the secound time run, I found the job finish sequence is
different. The slowest 2 nodes on the first test are move to top 3
fastest nodes on the second time test. And the winner of the first
test become to last one on the second test. Looks like the nodes
process speed are random.

Thanks.
Nov 12 '05 #3
What is the number of CPUs ?
What do you mean - I started the same peoplesoft job on all 12 nodes at the
same time? Where do you kick off the peoplesoft job? On the same P690 or on
another application server? (Sorry, I have no ideal about peoplesoft
software.) If it is the same physical node, would you tell us more detailed
information? For example, you setup DB2NODE variable in order to kick off
the job on the node you specified.


"Jack Li" <li******@yahoo.com> wrote in message
news:d5**************************@posting.google.c om...

Assume that the average duration is 16 hours. Maximum dispersion is 1/16
or <7%. You're complaining about +|- 7% duration of "identical" tasks in
a multi-tasking environment? I wouldn't be surprised to see that much
dispersion of data that's "evenly" distributed. Even if the data is
exactly divided between the partitions; I/O effects could account for at
least that much difference between the fastest and slowest node.

Phil Sherman


I also suspect the I/O effects the nodes performance. For the first
time run, I found the containers of last finished 4 partitions are
located at the same LSS of Shark ( That means they are in same SSA
loop). This may impact the performance. Looks like I almost get the
anwser. But the secound time run, I found the job finish sequence is
different. The slowest 2 nodes on the first test are move to top 3
fastest nodes on the second time test. And the winner of the first
test become to last one on the second test. Looks like the nodes
process speed are random.

Thanks.

Nov 12 '05 #4
"Fan Ruo Xin" <fa*****@sbcglobal.net> wrote in message news:<1h*****************@newssvr28.news.prodigy.c om>...
What is the number of CPUs ?
What do you mean - I started the same peoplesoft job on all 12 nodes at the
same time? Where do you kick off the peoplesoft job? On the same P690 or on
another application server? (Sorry, I have no ideal about peoplesoft
software.) If it is the same physical node, would you tell us more detailed
information? For example, you setup DB2NODE variable in order to kick off
the job on the node you specified.


I ran the peoplesoft jobs on one physical P690 server 32 CPUs, 100GB
memory. The Workload manager is turn on and setup the CPU hard limit
to 44% for this application class. We setup 12 db2nodes on this
server. The peoplesoft application engine jobs are called by shell
scirpt. The 12 jobs are exactly same just running on different db2
node to process data seperately.
Nov 12 '05 #5

"Jack Li" <li******@yahoo.com> wrote in message
news:d5*************************@posting.google.co m...
"Fan Ruo Xin" <fa*****@sbcglobal.net> wrote in message

news:<1h*****************@newssvr28.news.prodigy.c om>...
What is the number of CPUs ?
What do you mean - I started the same peoplesoft job on all 12 nodes at the same time? Where do you kick off the peoplesoft job? On the same P690 or on another application server? (Sorry, I have no ideal about peoplesoft
software.) If it is the same physical node, would you tell us more detailed information? For example, you setup DB2NODE variable in order to kick off the job on the node you specified.


I ran the peoplesoft jobs on one physical P690 server 32 CPUs, 100GB
memory. The Workload manager is turn on and setup the CPU hard limit
to 44% for this application class. We setup 12 db2nodes on this
server. The peoplesoft application engine jobs are called by shell
scirpt. The 12 jobs are exactly same just running on different db2
node to process data seperately.


I see. Then do you know why the DBA decide to use 12 db partitions? Do you
also use LPAR, separate the db2 system and peoplesoft system to use its own
dedicated system resources? Is peoplesoft process single thread or
multi-thread?
Nov 12 '05 #6
>
I see. Then do you know why the DBA decide to use 12 db partitions? Do you
also use LPAR, separate the db2 system and peoplesoft system to use its own
dedicated system resources? Is peoplesoft process single thread or
multi-thread?

I know the db partition should base on CPU numbers( 1 or 2 CPU per
partition ). But this environment is used to simulate production
server configuration. So the partition number didn't match the CPU
number. We don't use lpar on this P690. The db2 and peoplesoft system
are on same server. I don't know too much about peoplesoft. I guess
the peoplesoft application engine is multi-thread.
Folloing the description of peoplesoft application engine."In
PeopleSoft Application Engine, a program is a set of SQL statements,
PeopleCode, and program control actions that enable looping and
conditional logic."

Thanks.
Nov 12 '05 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

7
by: Steve | last post by:
This post has two parts. First is my feedback on sets. (Hello? Last summer called, they want their discussion thread back...) Second is some questions about my implementation of a partition...
5
by: Nick J Chackowsky | last post by:
Wrote a python script to find the partitions of an integer (a list of all the ways you can express n as a sum of integers). For example, the partitions of 5 are 5 4+1 3+2 2+2+1 3+1+1 2+1+1+1...
77
by: Charles Law | last post by:
Hi guys I have a time critical process, running on a worker thread. By "time critical", I mean that certain parts of the process must be completed in a specific time frame. The time when the...
3
by: jcgeorge | last post by:
I have a Windows DPF (v8.2.2) environment. 2 Nodes 5 Partitions Server1 - Cat (0) Data (1) Data (2) Server2 - Data (3) Data (4) I want to use block-based IO, but I do not want the same size...
4
by: Chad Crowder | last post by:
I've taken a look at this article http://msdn.microsoft.com/library/default.asp?url=/library/en-us/dnaspnet/html/asp12282000.asp which someone posted a month or so ago regarding setting up SQL...
8
by: arunrocks | last post by:
Hi I am having a requirement to create a db in 2 out of 8 partitiones. I have the following doubts. 1. should I create a new instance in 2 partitions alone (the present instance spans 8 nodes)...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
1
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
1
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
1
by: Shællîpôpï 09 | last post by:
If u are using a keypad phone, how do u turn on JavaScript, to access features like WhatsApp, Facebook, Instagram....
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.