473,569 Members | 2,676 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

failover cluster using 64 bit windows 2k3 on single HBAs and DS400 with dual controller

We are trying to setup a system to system failover cluster using two
nodes (x346) which each have a single hba running to seperate
controllers on the DS400.

For full redundnancy, IBM recommends dual path from each node but we
dont need that. The current setup has two completly seperate paths. hba
on node 1 to controller A on DS400 and hba on node 2 to controller B.
If i take a controller offline, failover works fine to jumo to other
controller and throw all resources to it's node but if i shutdown a
node- the cluster loses all attached storage and DS400 is unaware to
switch ownership to other controller.

Is there a way to us mscs without dual path from each node?
anotherwords... if either node or controller fails on a single path, we
want the other path to become active.

our main goal is to use sql server 2005 clustering on the cluster.
everything checks out perfect if i only use one controller on the DS400
for both nodes but this brings us back to another single point of
failure.

I saw that Qlogic has MPIO drivers on thir website for the DS400 but it
seems as though they are for 32bit systems and the install errors out
with:

C:\Drivers\mpio \1.0.8.4 (w32)>install.e xe -i
Pre-Installing the Multi-Path Adapter Filter...
Success
Installing the Multi-Path Bus Driver...
Failure. Error code (0xe0000235)

configuration:
2 X IBM x346 w/ single QLogic 2340 HBAs running win2k3 64bit Enterprise
DS400 w/ dual controllers

Apr 12 '06 #1
5 3096
<us******@gmail .com> wrote in message
news:11******** **************@ e56g2000cwe.goo glegroups.com.. .
We are trying to setup a system to system failover cluster using two
nodes (x346) which each have a single hba running to seperate
controllers on the DS400.

For full redundnancy, IBM recommends dual path from each node but we
dont need that. The current setup has two completly seperate paths. hba
on node 1 to controller A on DS400 and hba on node 2 to controller B.
If i take a controller offline, failover works fine to jumo to other
controller and throw all resources to it's node but if i shutdown a
node- the cluster loses all attached storage and DS400 is unaware to
switch ownership to other controller.

Is there a way to us mscs without dual path from each node?
anotherwords... if either node or controller fails on a single path, we
want the other path to become active.


It sounds like you're trying to persuade the DS400 to control your failover
action. You're making a LUN available to one node, and when a failure occurs
you're expecting the DS400 to switch ownership of that LUN to the other node
so it can proceed. That's not what you want. You want both nodes to see and
share the LUN(s) on the DS400 at all times. Mscs will then figure out
between the two nodes which one will access the LUN.

Rob
Apr 12 '06 #2
logically that would make sense that MSCS would be responsible for
everything.
however.... both nodes are able to see the storage but can only read
the drives when the respective controller is the active one.

both initiators have access to all the LUNs on the storage. both HBAs
have access to all LUNs

It sounds like you're trying to persuade the DS400 to control your failover
action. You're making a LUN available to one node, and when a failure occurs
you're expecting the DS400 to switch ownership of that LUN to the other node
so it can proceed. That's not what you want. You want both nodes to see and
share the LUN(s) on the DS400 at all times. Mscs will then figure out
between the two nodes which one will access the LUN.

Rob


Apr 12 '06 #3
<us******@gmail .com> wrote in message
news:11******** *************@i 40g2000cwc.goog legroups.com...
logically that would make sense that MSCS would be responsible for
everything.
however.... both nodes are able to see the storage but can only read
the drives when the respective controller is the active one.

both initiators have access to all the LUNs on the storage. both HBAs
have access to all LUNs


The DS400 wasn't certified for MSCS when it was initially introduced. If you
have a model from before mid-2005 then you may need to update firmware or
contact IBM about the exact features required to make it work with MSCS. The
latest firmware is available from Adaptec's website:
http://www.adaptec.com/ibm/downloads...ems_index.html

Rob
Apr 12 '06 #4
logically that would make sense that MSCS would be responsible for
everything.
however.... both nodes are able to see the storage but can only read
the drives when the respective controller is the active one.

both initiators have access to all the LUNs on the storage. both HBAs
have access to all LUNs

It sounds like you're trying to persuade the DS400 to control your failover
action. You're making a LUN available to one node, and when a failure occurs
you're expecting the DS400 to switch ownership of that LUN to the other node
so it can proceed. That's not what you want. You want both nodes to see and
share the LUN(s) on the DS400 at all times. Mscs will then figure out
between the two nodes which one will access the LUN.

Rob


Apr 12 '06 #5
OK,

we have now added to the configuration to provide multipaths to both
nodes from both controllers.

Each node now has two HBAs with with connections to both controllers.
It seems as though everything is working as expected with failover
occuring system to system if the node fails and also controller to
controller if the controller fails.

When I do a failover from system to system, it works flawlessly.
When i Do a failover from controller to controller however, the active
node seems to kick in fine when the resources are bak up and available
but shows an error in taskbar and event log saying:

windowsDelayed Write Failed: Windows was unable to save all the data
for the file M:\ The data has been lost. This error may be caused by a
failure of your computer hardware or network connection. Please try to
save this file elsewhere.

Since this cluster is being used for a SQL Server 2005 cluster, losing
data is not something we would like to do. The controllers have 256
battery backup memory on them. Since this is the case, are the
controllers taking care of this issue and windows is just not aware of
it or do we actually have an issue where we might lose data?

Apr 18 '06 #6

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
5870
by: Nico de Groot | last post by:
I have a 2 node Microsoft 2000 cluster with a shared storage device. I want to create automatic failover for MS SQL 2000 server. I can do that wit the following options: 1. Active/Pasive (one instance installed) 2. Active/Active (More then one instance/virtual sql servers installed) I have a question about option 2.
1
4276
by: kumar | last post by:
Dear Friends, I wanted to configure Failover cluster for SQL Server 2000 on Windows 2000 advanced servers. I have only 2 no.s of windows 2000 advanced server m/cs. I dont have any shared disks and SCSI drives. Can I still configure the Failover clustering? I would like to know more about the need for the shared disks in the
1
2469
by: pthawley | last post by:
All, I am doing some preliminary research on HA failover capabilities on DB2/UDB 8.1. The docs are a little light on details so I thought I'd ask this forum ...
1
2840
by: WildHare | last post by:
I am looking for documentation on state server failover. There is a microsoft article which sort of talks about protecting against a failed state server by using SQL server...but it about as clear as mud. http://msdn.microsoft.com/library/default.asp?url=/library/en-us/cpguide/htm l/cpconsessionstate.asp which suggests that you can...
1
3029
by: news.microsoft.com | last post by:
Is there an elegant way to clear out the ADO connection pool used by the SQL Server State Service following a cluster failover? We are using a SQL Cluster to store both the ASP Session State database and our application database. When the SQL Server cluster is failed-over, any "pointers" to the live connections stored in the connection...
8
4131
by: arandhawa1 | last post by:
Does anyone know how to correctly setup the Administration DB for fail over clustering on a MSCS server running DB2. Problem I have when I follow IBM's docs is that I install the Tools catalog on the first node but when I failover the cluster to the second node - DB2's Task Center no longer works since the Tools Catalog still points to...
1
2420
by: Mark D Powell | last post by:
I have a production failover cluster running SQL Server 2000 at SP3 that I want to upgrade to SP4. I do not have a test failover cluster to test with so I need the install on the primary server to work the first time. Per the information I have I just install the patch on the primary server and it will install both on the primary and on the...
2
2666
by: Jay Balapa | last post by:
Hello, We have two Windows 2003 servers one running IIS and another for SQL Server. We are at a stage where we need to load balance and provide a failover. Can you show us a technical document which can help us do that? Thanks. -jay
1
1385
by: astrohorse26 | last post by:
We are running on SQL 2000 on Windows 2003. We have active-passive clustering set up. We have 16 GB of RAM on each box. This past weekend, we failed over. I noticed that the Total Server Memory went from about 15.5 GB to 8 GB. I have been trying to figure out why this has occurred. Any help would be greatly appreciated.
0
7697
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main...
0
7924
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. ...
0
8120
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that...
0
7968
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the...
0
6283
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then...
1
5512
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes...
0
5219
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert...
0
3653
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in...
1
2113
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.