473,722 Members | 2,484 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Unable to start HADR reason code 7

I'm having trouble getting HADR to work with the sample databases on
two HS20 xSeries blades, Red Hat ES4 up3, DB2 8.2.4, getting the
following error.

SQL1768N Unable to start HADR. Reason code = "7" - The primary
database failed to establish a connection to its standby database
within the HADR timeout interval.

What things should I check besides the remote host and remote service
parameters on the standby database, which seem to be correct. Each of
the servers can see each other via pings etc. I have sucessfully setup
HADR on a single server.

thanks
dub

May 31 '06 #1
16 21785

"gumby" <da******@gmail .com> wrote in message
news:11******** **************@ i39g2000cwa.goo glegroups.com.. .
I'm having trouble getting HADR to work with the sample databases on
two HS20 xSeries blades, Red Hat ES4 up3, DB2 8.2.4, getting the
following error.

SQL1768N Unable to start HADR. Reason code = "7" - The primary
database failed to establish a connection to its standby database
within the HADR timeout interval.

What things should I check besides the remote host and remote service
parameters on the standby database, which seem to be correct. Each of
the servers can see each other via pings etc. I have sucessfully setup
HADR on a single server.

thanks
dub


I assume you have already started HADR on the standby database first, before
you started HADR on the primary. If that is true, then try logging on the
standby database and activating the standby database

db2 activate database sample

Then retry starting HADR on primary.

May 31 '06 #2
Yes, I think the control center runs the following commands anyway. And
if I activate the standby it says it is already activated. Here are the
final commands run.

-- Start HADR on standby database
--
DEACTIVATE DATABASE SAMPLE
START HADR ON DATABASE SAMPLE AS STANDBY
--
-- Start HADR on primary database
--
DEACTIVATE DATABASE SAMPLE
START HADR ON DATABASE SAMPLE AS PRIMARY

Just to clarify, I have sucessfully setup HADR bewteen 2 different
databases on the same server using the control center gui. My problem
is between databases on two different servers. I have tried the manual
command method and the control center, both with the same results.

Using the control center commands

Standby diag file ends with

2006-05-31-16.24.26.101725-240 E476637G362 LEVEL: Event
PID : 27068 TID : 3086558912 PROC : db2hadrs
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSetHdrState,
probe:10000
CHANGE : HADR state set to S-RemoteCatchupPe nding (was S-LocalCatchup)

2006-05-31-16.24.25.999932-240 I477000G398 LEVEL: Warning
PID : 27057 TID : 3086558912 PROC : db2agent
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
APPHDL : 0-14 APPID: *LOCAL.sample.0 60531202426
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduStartup,
probe:21152
MESSAGE : Info: HADR Startup has completed.

Primary diag files ends with

2006-05-31-16.24.32.714718 +600 E128512G336 LEVEL: Event
PID : 9575 TID : 3085870784 PROC : db2hadrp
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrSetHdrState,
probe:10000
CHANGE : HADR state set to P-Boot (was None)

2006-05-31-16.24.32.719416 +600 I128849G318 LEVEL: Warning
PID : 9575 TID : 3085870784 PROC : db2hadrp
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduP,
probe:20301
MESSAGE : Info: Primary Started.

2006-05-31-16.26.18.769577 +600 I129489G321 LEVEL: Event
PID : 5376 TID : 2947414960 PROC : db2hmon
INSTANCE: sample NODE : 000
FUNCTION: DB2 UDB, Automatic Table Maintenance, db2HmonEvalStat s,
probe:900
STOP : Automatic Runstats: evaluation has finished on database
SAMPLE

2006-05-31-16.26.33.712145 +600 I129811G571 LEVEL: Error
PID : 9575 TID : 3085870784 PROC : db2hadrp
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduP,
probe:20390
MESSAGE : HADR primary did not establish connection with standby within
timeout
and will shut down. BY FORCE option required to start primary
without
standby. Timeout seconds =
DATA #1 : Hexdump, 4 bytes
0x12C13A3C : 7800 0000 x...

2006-05-31-16.26.33.712399 +600 I130383G418 LEVEL: Error
PID : 9575 TID : 3085870784 PROC : db2hadrp
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduP,
probe:20390
RETCODE : ZRC=0x8280001A=-2105540582=HDR_ ZRC_NO_STANDBY
"Comm time-out in unforced HADR primary start, to avoid
split-brain"

2006-05-31-16.26.33.712573 +600 I130802G319 LEVEL: Warning
PID : 9575 TID : 3085870784 PROC : db2hadrp
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduP,
probe:20302
MESSAGE : Info: Primary Finished.

2006-05-31-16.26.33.712704 +600 I131122G422 LEVEL: Error
PID : 9575 TID : 3085870784 PROC : db2hadrp
(SAMPLE) 0
INSTANCE: sample NODE : 000 DB : SAMPLE
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrEduEntry,
probe:21100
RETCODE : ZRC=0x8280001A=-2105540582=HDR_ ZRC_NO_STANDBY
"Comm time-out in unforced HADR primary start, to avoid
split-brain"
Any assistance greatly appreciated

cheers
dub
dub

May 31 '06 #3

"gumby" <da******@gmail .com> wrote in message
news:11******** **************@ u72g2000cwu.goo glegroups.com.. .
Yes, I think the control center runs the following commands anyway. And
if I activate the standby it says it is already activated. Here are the
final commands run.

-- Start HADR on standby database
--
DEACTIVATE DATABASE SAMPLE
START HADR ON DATABASE SAMPLE AS STANDBY
--
-- Start HADR on primary database
--
DEACTIVATE DATABASE SAMPLE
START HADR ON DATABASE SAMPLE AS PRIMARY

Just to clarify, I have sucessfully setup HADR bewteen 2 different
databases on the same server using the control center gui. My problem
is between databases on two different servers. I have tried the manual
command method and the control center, both with the same results.

Any assistance greatly appreciated

cheers
dub


Can you post your db config parms (HADR section only) on both primary and
standby databases?

Also, post output from "db2level" and the OS you are using.
May 31 '06 #4
From the snippets of diag log shown, it appears that the standby was

not able to establish a socket connection with the primary (primary
listens, standby connects). It seems fairly common that this is not
correctly configured on the first attempt. We've seen issues with
incorrect HADR parameters, DNS problems, failure to properly set up
service names, and inability to correctly map across a NAT.

Double check that your HADR comms parameters mesh up correctly (each
side properly refers to itself in LOCAL params and to the other in
REMOTE params).

HADR_LOCAL_HOST
HADR_LOCAL_SVC
HADR_REMOTE_HOS T
HADR_REMOTE_SVC

Ensure that your service names are registered and/or use IP addresses.
Try using fully-specified network naming (a.b.c.d) for host names if
you haven't already.

HTH.

Regards,
- Steve P.
--
Steve Pearson, IBM DB2 UDB for LUW Development, IBM Software Group
DB2 "Portland" Team, IBM Beaverton Lab, Beaverton, OR, USA

May 31 '06 #5
Your ability to get this working on a single system indicates that you
have the knowledge to do this from the database perspective.

A common cause of problems when going from one system to two systems,
especially with Linux, is the requirement to pass through the firewalls.
Make sure they are configured to allow the HADR ports to pass traffic.

Phil Sherman

gumby wrote:
I'm having trouble getting HADR to work with the sample databases on
two HS20 xSeries blades, Red Hat ES4 up3, DB2 8.2.4, getting the
following error.

SQL1768N Unable to start HADR. Reason code = "7" - The primary
database failed to establish a connection to its standby database
within the HADR timeout interval.

What things should I check besides the remote host and remote service
parameters on the standby database, which seem to be correct. Each of
the servers can see each other via pings etc. I have sucessfully setup
HADR on a single server.

thanks
dub

May 31 '06 #6
"Phil Sherman" <ps******@ameri tech.net> wrote in message
news:R8******** **@newssvr33.ne ws.prodigy.com. ..
Your ability to get this working on a single system indicates that you
have the knowledge to do this from the database perspective.

A common cause of problems when going from one system to two systems,
especially with Linux, is the requirement to pass through the firewalls.
Make sure they are configured to allow the HADR ports to pass traffic.

Phil Sherman


He is using the GUI interface. I was able to configure HADR on a local
Windows box with the GUI, but not with remote Linux boxes. Using command
line configuration scripts on remote Linux boxes worked fine.
Jun 1 '06 #7
I'm Running Red Hat ES4 up3

[sample@tank ~]$ uname -r
2.6.9-34.ELsmp

[sample@tank ~]$ db2level
DB21085I Instance "sample" uses "32" bits and DB2 code release
"SQL08024" with
level identifier "03050106".
Informational tokens are "DB2 v8.1.2.104", "s060120", "MI00152", and
FixPak
"11".
Product is installed at "/opt/IBM/db2/V8.1".

STANDBY - tank
HADR database role = STANDARD
HADR local host name (HADR_LOCAL_HOS T) = tank
HADR local service name (HADR_LOCAL_SVC ) = DB2_HADR_2
HADR remote host name (HADR_REMOTE_HO ST) = dozer
HADR remote service name (HADR_REMOTE_SV C) = DB2_HADR_1
HADR instance name of remote server (HADR_REMOTE_IN ST) = sample
HADR timeout value (HADR_TIMEOUT) = 120
HADR log write synchronization mode (HADR_SYNCMODE) = NEARSYNC

PRIMARY - dozer
HADR database role = STANDARD
HADR local host name (HADR_LOCAL_HOS T) = dozer
HADR local service name (HADR_LOCAL_SVC ) = DB2_HADR_1
HADR remote host name (HADR_REMOTE_HO ST) = tank
HADR remote service name (HADR_REMOTE_SV C) = DB2_HADR_2
HADR instance name of remote server (HADR_REMOTE_IN ST) = sample
HADR timeout value (HADR_TIMEOUT) = 120
HADR log write synchronization mode (HADR_SYNCMODE) = NEARSYNC

/etc/services
# Local services
DB2_sample 60000/tcp
DB2_sample_1 60001/tcp
DB2_sample_2 60002/tcp
DB2_sample_END 60003/tcp
DB2_HADR_1 55001/tcp
DB2_HADR_2 55002/tcp

Currently doing some more tests with and without the GUI on the linux
boxes.

Jun 1 '06 #8
Are there any requirements for the servers to be cataloged. I mean on
the primary, should there be a catalog/node entry (no sure of the
correct terms) to the standby. And likewise should there be an entry on
the standby pointing to the primary.

Should they be described by running the command db2 list node
directory.?

Jun 1 '06 #9
"gumby" <da******@gmail .com> wrote in message
news:11******** *************@y 43g2000cwc.goog legroups.com...
I'm Running Red Hat ES4 up3

[sample@tank ~]$ uname -r
2.6.9-34.ELsmp

[sample@tank ~]$ db2level
DB21085I Instance "sample" uses "32" bits and DB2 code release
"SQL08024" with
level identifier "03050106".
Informational tokens are "DB2 v8.1.2.104", "s060120", "MI00152", and
FixPak
"11".
Product is installed at "/opt/IBM/db2/V8.1".

STANDBY - tank
HADR database role = STANDARD
HADR local host name (HADR_LOCAL_HOS T) = tank
HADR local service name (HADR_LOCAL_SVC ) = DB2_HADR_2
HADR remote host name (HADR_REMOTE_HO ST) = dozer
HADR remote service name (HADR_REMOTE_SV C) = DB2_HADR_1
HADR instance name of remote server (HADR_REMOTE_IN ST) = sample
HADR timeout value (HADR_TIMEOUT) = 120
HADR log write synchronization mode (HADR_SYNCMODE) = NEARSYNC

PRIMARY - dozer
HADR database role = STANDARD
HADR local host name (HADR_LOCAL_HOS T) = dozer
HADR local service name (HADR_LOCAL_SVC ) = DB2_HADR_1
HADR remote host name (HADR_REMOTE_HO ST) = tank
HADR remote service name (HADR_REMOTE_SV C) = DB2_HADR_2
HADR instance name of remote server (HADR_REMOTE_IN ST) = sample
HADR timeout value (HADR_TIMEOUT) = 120
HADR log write synchronization mode (HADR_SYNCMODE) = NEARSYNC

/etc/services
# Local services
DB2_sample 60000/tcp
DB2_sample_1 60001/tcp
DB2_sample_2 60002/tcp
DB2_sample_END 60003/tcp
DB2_HADR_1 55001/tcp
DB2_HADR_2 55002/tcp

Currently doing some more tests with and without the GUI on the linux
boxes.


This may not help, but I would use the port number in your db config, and
not the service name (but leave the service names in the /etc/services).

I assume the database names are the same on primary and standby (not
specified in your post above).

When the HADR database role is STANDARD, then that means that HADR has not
been started. So manually "start HADR on db xxxxxxx as standby" (on tank),
and then (if successful) "start HADR on db xxxxxxx as primary" (on dozer).
You must start HADR on the standby first.

If the above does not work, then you should check the ports (at the OS
level) to make sure no one else is using 55001 and 55002. The recommended
HADR ports start with 18819 (although I have no reason why, and don't know
if this matters).

A useful monitoring tool of the current HADR status without the GUI is to
take a database snapshot (refer to HADR section):
db2 get snapshot for database on xxxxxxxx

As I said previously, I was not able to get the GUI to work for HADR on
Linux, but there are very few commands needed to get it working, so it is
easy to script from the command line.
Jun 1 '06 #10

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
5221
by: bwmiller16 | last post by:
Why are 2 ports required for a HADR Pair? Environment: RH Linux AS3 UDB Express 8.2 with HADR Option Folks - Working with HADR and getting this setup to work right.
8
23462
by: Challenge | last post by:
Hi, I got error, SQL1768N Unable to start HADR. Reason code = "7", when I tried to start hadr primary database. Here are the hadr configuration of my primary db: HADR database role = STANDARD HADR local host name (HADR_LOCAL_HOST) = testserver HADR local service name (HADR_LOCAL_SVC) = 56000 HADR remote host name (HADR_REMOTE_HOST) = testserver
6
4086
by: RayRay | last post by:
I was hoping I might be able to get an answer regarding DB2 and HADR. Let me preface this by saying that I know very little about networking and I'm not familiar with the HADR product or clustering. But, I've been asked to research a couple issues for a client. They are thinking about implementing HADR. They have DB2 production databases running on more than one box. Does a liscense for HADR need to be purchased for each production box...
6
21944
by: shorti | last post by:
db2 8.1 fix pack 12 on AIX 5.3 This is a newly configured HADR machine. The HADR was up and running. I was 'playing' around some on the standby and did a db2 deactivate and things sort of went down hill from there. Though the Standby machine seemed to know it was a standby it would not connect to the primary. I could not even do a db2 get snapshot to view the hadr status on the machine. I was wanting to do a few HADR tests before I...
6
3449
by: shorti | last post by:
I have two questions about HADR recovery. I am running db2 v8 fp12. 1) If the primary suddenly crashes would you always want to switch the standby to the primary by force...or would there be times when you would want to make it a standard? (and why) 2) Let us say the primary suddenly crashes (someone pulled the power cable) and you switch the standby to primary by force. Then you bring the primary back up and issue a START as...
3
3804
by: Laurence | last post by:
Hi folks, Anyone knows what do these mean? Primary log position(file, page, LSN) = S0000009.LOG, 0, 00000000036B0000 Standby log position(file, page, LSN) = S0000008.LOG, 357, 000000000342D073 Log gap running average(bytes) = 2633608
3
2851
by: Pat | last post by:
Hi - We're trying to set up an HADR pair on two databases on instances with different names on separate servers. The databases were defined as follows: CREATE DATABASE database1 ON '/db2home/instanceA'; CREATE DATABASE database2 ON '/db2home/instanceB';
0
5081
by: nimjerry | last post by:
i am using db2 udb V 9 on aix 5.3 and in db2diag.log alwas has this error occurr below is sample message 2008-03-03-09.45.34.366406+420 I306667A443 LEVEL: Warning PID : 835622 TID : 1 PROC : db2stmm (WEBEDIDB) INSTANCE: db2inst1 NODE : 000 DB : WEBEDIDB APPHDL : 0-8 APPID: *LOCAL.db2inst1.080229022428 AUTHID : DB2INST1 FUNCTION: DB2 UDB, Self tuning memory...
0
2438
by: deshaipet | last post by:
Hi - We have primary and standby databases on different servers. Here is my /etc/service entry for HADR pair : On Primary : "vi /etc/services
0
8863
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
8739
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
9384
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
9157
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
1
6681
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
4502
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
4762
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
3207
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
3
2147
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.