HP Integrity Superdome SX2000 Cluster Installation and Configuration Guide - W - Page 29

•

The NIC and switch redundancy layer is transparent to the IP layer.

•

It may use standby, redundant team members to load balance your network traffic and

improve performance for transmitted and received packets on the individual cluster node.

•

It may use advanced redundancy mechanisms to improve the detection of failures in your

network infrastructure, and to provide a proactive response to them. For example, cluster

nodes continuously test their connectivity with each other but they cannot detect path failures

when there is an external switch upstream. Active Path Failover is an advanced teaming

feature that detects such failures, and fails over to a NIC that has a path to an Echo Node

device (an external switch upstream).

If you are going to implement NIC teaming in your cluster networks, you should complete the

following steps:

1.

Plan your network infrastructure according to the cluster demands, taking into account NIC

teaming configuration, redundant switches, routers, and so on.

2.

Create the teams planned in the previous step for every cluster node.

3.

Validate your cluster configuration.

4.

Create your cluster.

For more information about NIC teaming issues in clustered environments, see the following

document:

http://support.microsoft.com/kb/254101

Troubleshooting the Cluster

What to Do if Validation Tests Fail

In most cases, if any tests in the cluster validation wizard fail, then Microsoft does not consider

the solution to be supported. There are exceptions to this rule, such as the case with multi-site

(geographically dispersed) clusters where there is no shared storage. In this scenario the expected

result of the validation wizard is that the storage tests will fail. This is still a supported solution

if the remainder of the tests complete successfully.

The type of test that fails is a guideline to the corrective action to take. For example, if the storage

test "List all disks" fails, and subsequent storage tests do not run (because these would also fail),

contact the storage vendor to troubleshoot. Similarly, if a network test related to IP addresses

fails, consult with your network infrastructure team. Most of the warnings or errors should result

in working with internal teams or with a specific hardware vendor.

After the issues have been addressed and resolved, it is necessary to rerun the cluster validation

wizard. It is required (in order to be considered a supported configuration) that all tests are run

and completed successfully without failures.

Validation Issues for Multi-site or Geographically Dispersed Failover Clusters

Failover cluster solutions that do not have a common shared disk and instead leverage data

replication between nodes might not pass the cluster validation "storage" tests. This is a common

configuration in cluster solutions where nodes are stretched across geographic regions. If a cluster

solution does not require external storage to fail over from one node to another, it does not need

to pass the "storage" tests to be a fully supported solution.

For more information on multi-site or geographically dispersed clusters, see the following white

paper:

http://go.microsoft.com/fwlink/?LinkId=112125

Troubleshooting

See the following documents for more information about troubleshooting errors and interpreting

system event descriptions in clusters:

Troubleshooting the Cluster

29

Section	Page
Failover Cluster Installation and Configuration Guide	1
Table of Contents	3
About This Document	7
Intended Audience	7
New and Changed Information in This Edition	7
Document Organization	7
Typographic Conventions	7
Related Information	8
Publishing History	8
HP Encourages Your Comments	8
1 Introduction	11
Clustering Overview	11
Cluster Terminology	12
Nodes	12
Cluster Service	12
Shared Disks	12
Resources	12
Resource Dependencies	13
Services and Applications	13
Quorums	13
Heartbeats	16
Virtual Servers	16
Failover	16
Failback	16
2 Installing and Configuring the Cluster	17
An Overview of the Installation and Configuration Process	17
Gathering Required Installation Information	19
Installing the Cluster	20
Additional Configuration Topics	28
NIC Teaming in Clustered Environments	28
Troubleshooting the Cluster	29
What to Do if Validation Tests Fail	29
Validation Issues for Multi-site or Geographically Dispersed Failover Clusters	29
Troubleshooting	29
Additional Clustering Tasks	30
Upgrading Individual Nodes in the Cluster	30
Evicting a Node from the Cluster	30

HP Integrity Superdome SX2000 Cluster Installation and Configuration Guide - W - Page 29

Troubleshooting the Cluster, What to Do if Validation Tests Fail

Page 29 highlights