472,952 Members | 2,205 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 472,952 software developers and data experts.

failover cluster using 64 bit windows 2k3 on single HBAs and DS400 with dual controller

We are trying to setup a system to system failover cluster using two
nodes (x346) which each have a single hba running to seperate
controllers on the DS400.

For full redundnancy, IBM recommends dual path from each node but we
dont need that. The current setup has two completly seperate paths. hba
on node 1 to controller A on DS400 and hba on node 2 to controller B.
If i take a controller offline, failover works fine to jumo to other
controller and throw all resources to it's node but if i shutdown a
node- the cluster loses all attached storage and DS400 is unaware to
switch ownership to other controller.

Is there a way to us mscs without dual path from each node?
anotherwords... if either node or controller fails on a single path, we
want the other path to become active.

our main goal is to use sql server 2005 clustering on the cluster.
everything checks out perfect if i only use one controller on the DS400
for both nodes but this brings us back to another single point of
failure.

I saw that Qlogic has MPIO drivers on thir website for the DS400 but it
seems as though they are for 32bit systems and the install errors out
with:

C:\Drivers\mpio\1.0.8.4 (w32)>install.exe -i
Pre-Installing the Multi-Path Adapter Filter...
Success
Installing the Multi-Path Bus Driver...
Failure. Error code (0xe0000235)

configuration:
2 X IBM x346 w/ single QLogic 2340 HBAs running win2k3 64bit Enterprise
DS400 w/ dual controllers

Apr 12 '06 #1
5 3066
<us******@gmail.com> wrote in message
news:11**********************@e56g2000cwe.googlegr oups.com...
We are trying to setup a system to system failover cluster using two
nodes (x346) which each have a single hba running to seperate
controllers on the DS400.

For full redundnancy, IBM recommends dual path from each node but we
dont need that. The current setup has two completly seperate paths. hba
on node 1 to controller A on DS400 and hba on node 2 to controller B.
If i take a controller offline, failover works fine to jumo to other
controller and throw all resources to it's node but if i shutdown a
node- the cluster loses all attached storage and DS400 is unaware to
switch ownership to other controller.

Is there a way to us mscs without dual path from each node?
anotherwords... if either node or controller fails on a single path, we
want the other path to become active.


It sounds like you're trying to persuade the DS400 to control your failover
action. You're making a LUN available to one node, and when a failure occurs
you're expecting the DS400 to switch ownership of that LUN to the other node
so it can proceed. That's not what you want. You want both nodes to see and
share the LUN(s) on the DS400 at all times. Mscs will then figure out
between the two nodes which one will access the LUN.

Rob
Apr 12 '06 #2
logically that would make sense that MSCS would be responsible for
everything.
however.... both nodes are able to see the storage but can only read
the drives when the respective controller is the active one.

both initiators have access to all the LUNs on the storage. both HBAs
have access to all LUNs

It sounds like you're trying to persuade the DS400 to control your failover
action. You're making a LUN available to one node, and when a failure occurs
you're expecting the DS400 to switch ownership of that LUN to the other node
so it can proceed. That's not what you want. You want both nodes to see and
share the LUN(s) on the DS400 at all times. Mscs will then figure out
between the two nodes which one will access the LUN.

Rob


Apr 12 '06 #3
<us******@gmail.com> wrote in message
news:11*********************@i40g2000cwc.googlegro ups.com...
logically that would make sense that MSCS would be responsible for
everything.
however.... both nodes are able to see the storage but can only read
the drives when the respective controller is the active one.

both initiators have access to all the LUNs on the storage. both HBAs
have access to all LUNs


The DS400 wasn't certified for MSCS when it was initially introduced. If you
have a model from before mid-2005 then you may need to update firmware or
contact IBM about the exact features required to make it work with MSCS. The
latest firmware is available from Adaptec's website:
http://www.adaptec.com/ibm/downloads...ems_index.html

Rob
Apr 12 '06 #4
logically that would make sense that MSCS would be responsible for
everything.
however.... both nodes are able to see the storage but can only read
the drives when the respective controller is the active one.

both initiators have access to all the LUNs on the storage. both HBAs
have access to all LUNs

It sounds like you're trying to persuade the DS400 to control your failover
action. You're making a LUN available to one node, and when a failure occurs
you're expecting the DS400 to switch ownership of that LUN to the other node
so it can proceed. That's not what you want. You want both nodes to see and
share the LUN(s) on the DS400 at all times. Mscs will then figure out
between the two nodes which one will access the LUN.

Rob


Apr 12 '06 #5
OK,

we have now added to the configuration to provide multipaths to both
nodes from both controllers.

Each node now has two HBAs with with connections to both controllers.
It seems as though everything is working as expected with failover
occuring system to system if the node fails and also controller to
controller if the controller fails.

When I do a failover from system to system, it works flawlessly.
When i Do a failover from controller to controller however, the active
node seems to kick in fine when the resources are bak up and available
but shows an error in taskbar and event log saying:

windowsDelayed Write Failed: Windows was unable to save all the data
for the file M:\ The data has been lost. This error may be caused by a
failure of your computer hardware or network connection. Please try to
save this file elsewhere.

Since this cluster is being used for a SQL Server 2005 cluster, losing
data is not something we would like to do. The controllers have 256
battery backup memory on them. Since this is the case, are the
controllers taking care of this issue and windows is just not aware of
it or do we actually have an issue where we might lose data?

Apr 18 '06 #6

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
by: Nico de Groot | last post by:
I have a 2 node Microsoft 2000 cluster with a shared storage device. I want to create automatic failover for MS SQL 2000 server. I can do that wit the following options: 1. Active/Pasive (one...
1
by: kumar | last post by:
Dear Friends, I wanted to configure Failover cluster for SQL Server 2000 on Windows 2000 advanced servers. I have only 2 no.s of windows 2000 advanced server m/cs. I dont have any shared...
1
by: pthawley | last post by:
All, I am doing some preliminary research on HA failover capabilities on DB2/UDB 8.1. The docs are a little light on details so I thought I'd ask this forum ...
1
by: WildHare | last post by:
I am looking for documentation on state server failover. There is a microsoft article which sort of talks about protecting against a failed state server by using SQL server...but it about as clear...
1
by: news.microsoft.com | last post by:
Is there an elegant way to clear out the ADO connection pool used by the SQL Server State Service following a cluster failover? We are using a SQL Cluster to store both the ASP Session State...
8
by: arandhawa1 | last post by:
Does anyone know how to correctly setup the Administration DB for fail over clustering on a MSCS server running DB2. Problem I have when I follow IBM's docs is that I install the Tools catalog...
1
by: Mark D Powell | last post by:
I have a production failover cluster running SQL Server 2000 at SP3 that I want to upgrade to SP4. I do not have a test failover cluster to test with so I need the install on the primary server to...
2
by: Jay Balapa | last post by:
Hello, We have two Windows 2003 servers one running IIS and another for SQL Server. We are at a stage where we need to load balance and provide a failover. Can you show us a technical...
1
by: astrohorse26 | last post by:
We are running on SQL 2000 on Windows 2003. We have active-passive clustering set up. We have 16 GB of RAM on each box. This past weekend, we failed over. I noticed that the Total Server Memory...
0
by: Mushico | last post by:
How to calculate date of retirement from date of birth
2
by: DJRhino | last post by:
Was curious if anyone else was having this same issue or not.... I was just Up/Down graded to windows 11 and now my access combo boxes are not acting right. With win 10 I could start typing...
0
by: Aliciasmith | last post by:
In an age dominated by smartphones, having a mobile app for your business is no longer an option; it's a necessity. Whether you're a startup or an established enterprise, finding the right mobile app...
0
tracyyun
by: tracyyun | last post by:
Hello everyone, I have a question and would like some advice on network connectivity. I have one computer connected to my router via WiFi, but I have two other computers that I want to be able to...
4
NeoPa
by: NeoPa | last post by:
Hello everyone. I find myself stuck trying to find the VBA way to get Access to create a PDF of the currently-selected (and open) object (Form or Report). I know it can be done by selecting :...
3
NeoPa
by: NeoPa | last post by:
Introduction For this article I'll be using a very simple database which has Form (clsForm) & Report (clsReport) classes that simply handle making the calling Form invisible until the Form, or all...
1
by: Teri B | last post by:
Hi, I have created a sub-form Roles. In my course form the user selects the roles assigned to the course. 0ne-to-many. One course many roles. Then I created a report based on the Course form and...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 1 Nov 2023 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM) Please note that the UK and Europe revert to winter time on...
0
NeoPa
by: NeoPa | last post by:
Introduction For this article I'll be focusing on the Report (clsReport) class. This simply handles making the calling Form invisible until all of the Reports opened by it have been closed, when it...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.