Compaq ProLiant 5500 Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator
Compaq ProLiant 5500 Manual
View all Compaq ProLiant 5500 manuals
Add to My Manuals
Save this manual to your list of manuals |
Compaq ProLiant 5500 manual content summary:
- Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 1
HA/F100 and HA/F200 Administrator Guide Second Edition (September 1999) Part Number 380362-002 Compaq Computer Corporation Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 2
Clusters HA/F100 and HA/F200 Administrator Guide Second Edition (September 1999) Part Number 380362-002 Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File Name: a-frnt.doc - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 3
HA/F100 and HA/F200 Overview of Compaq ProLiant Clusters HA/F100 and HA/F200 Components ...... 1-1 Compaq ProLiant Cluster HA/F100 1-3 Compaq ProLiant Cluster HA/F200 1-5 Compaq ProLiant Servers 1-7 Compaq StorageWorks RAID Array 4000 Storage System 1-7 Compaq StorageWorks RAID Array 4000 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 4
Administrator Guide Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 continued Compaq Software ...1-14 Compaq SmartStart and Support Software CD 1-15 Compaq Redundancy Manager (Fibre Channel 1-16 Compaq Cluster Verification Utility 1-16 Compaq Insight Manager 1-17 Compaq Insight - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 5
Manager 5-15 Cluster-Specific Features of Compaq Insight Manager 5-16 Compaq Insight Manager XE 5-17 Cluster Monitor 5-18 Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 6
Capacity B-8 Other Functions B-9 Troubleshooting Redundancy Manager B-9 Overview...B-10 Informational Messages B-10 Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 7
Redundancy Manager B-16 Troubleshooting Potential Problems B-16 Appendix C Software and Firmware Versions Glossary Index Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 8
to be used as step-by-step instructions for installation and as a reference for operation, troubleshooting, and future upgrades of the cluster server. This guide provides information about the installation, configuration, and implementation of the Compaq ProLiant Cluster Models HA/F100 and HA/F200 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 9
and HA/F200," contains high-level troubleshooting information for the Compaq ProLiant Clusters HA/F100 and HA/F200. Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File Name - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 10
Enter key. Enter When you are instructed to enter information, type the information and then press the Enter key. Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File Name - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 11
or specific instructions. NOTE: Text set off in this manner presents commentary, sidelights, or interesting points of information. Getting Help If you have a problem and have exhausted the information in this guide, you can get further information and other help in the following locations. Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 12
to specific hardware and software components of the Compaq ProLiant Clusters HA/F100 and HA/F200, including, but not limited to, the following: s Documentation related to the ProLiant servers you are clustering (for example, manuals, posters, and performance and tuning guides) s Compaq RA4000 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 13
1 Chapter Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 Overview of Compaq ProLiant Clusters HA/F100 and HA/F200 Components A cluster is a loosely coupled collection of servers and storage that acts as a single system, presents a single-system image to clients, provides protection - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 14
1-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq ProLiant Clusters HA/F100 and HA/F200 platforms are composed of the following. Hardware: s Compaq ProLiant servers s Compaq StorageWorks RAID Array 4000 Storage System (formerly Compaq Fibre Channel Storage System) q Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 15
Channel cable q Ethernet crossover cable q Network (LAN) cable The Compaq ProLiant Cluster HA/F100 includes these software solution components: s Microsoft Windows NT Server 4.0 Enterprise Edition s Compaq SmartStart and Support Software CD s Compaq Support Software Diskette for Windows NT (NT SSD - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 16
components of the Compaq ProLiant Cluster HA/F100 The Compaq ProLiant Cluster HA/F100 configuration is a cluster with a Compaq StorageWorks RAID Array 4000, a single Compaq StorageWorks Fibre Channel Storage Hub (7- or 12-port), two Compaq ProLiant servers (nodes), a single Compaq StorageWorks Fibre - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 17
Channel cable q Ethernet crossover cable q Network (LAN) cable The Compaq ProLiant Cluster HA/F200 includes these software solution components: s Microsoft Windows NT Server 4.0 Enterprise Edition s Compaq SmartStart and Support Software CD s Compaq Support Software Diskette for Windows NT (NT SSD - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 18
1-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide NOTE: See Appendix C, "Software and Firmware Versions," for the necessary software version levels for your cluster. The following illustration depicts the basic HA/F200 configuration. RA4000 storage hub storage hub Node 1 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 19
fans, redundant processor power modules, redundant Network Interface Controller (NIC) support, dual-ported hot-pluggable 10/100 NICs and redundant hot-pluggable power supplies (on most high-end models). Many of these features are available at the low end and mid range of the Compaq ProLiant server - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 20
the same hot-pluggable drives as Compaq Servers and Compaq ProLiant Storage Systems, online capacity expansion, online spares, and RAID fault tolerance of SMART-2 Array Controller technology. The RA4000 also supports hot-pluggable, redundant power supplies and fans, hot-pluggable hard drives, and - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 21
Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 1-9 Compaq StorageWorks Fibre Channel Storage Hubs The servers in a Compaq ProLiant Cluster HA/F100 and HA/F200 are connected to one or more Compaq StorageWorks Raid Array 4000 shared external storage systems using industry-standard - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 22
1-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq StorageWorks Fibre Channel Host Adapter Compaq StorageWorks Fibre Channel Host Adapters (host bus adapters) are the interface between the server and the RA4000 storage system. At least two host bus adapters (PCI or EISA), - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 23
Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 1-11 Cluster Interconnect Two types there are three options: Dedicated Interconnect Using an Ethernet Crossover Cable: An Ethernet crossover cable (supplied in both the HA/F100 and HA/F200 kits) can be used to connect the NICs directly - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 24
1-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Interconnect The cluster interconnect is a data stand-alone server configuration. Because clients desiring the full advantage of the cluster will now connect to the cluster rather than to a specific server, configuring - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 25
a Windows NT Cluster," available from the Compaq High Availability website (http://www.compaq.com/highavailability). Interconnect Adapters Ethernet adapters, or Compaq ServerNet adapters, can be used for the interconnect between the servers in a Compaq ProLiant Cluster. Either 10Mb/sec, or 100Mb/sec - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 26
Clusters HA/F100 and HA/F200 Administrator Guide Microsoft Software Microsoft Windows NT Server 4.0/Enterprise Edition (Windows NTS/E) is the operating system for the Compaq ProLiant Clusters HA/F100 and HA/F200. Microsoft Cluster Server (MSCS) is part of Windows NTS/E. As the core component - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 27
Support Software CD Compaq SmartStart is located on the SmartStart and Support Software CD shipped with ProLiant servers. SmartStart is the recommended way to configure the Compaq ProLiant Cluster HA/F100 or HA/F200. SmartStart uses a step-by-step process to configure the cluster and load the system - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 28
1-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Fibre Channel Fault Isolation Utility (FFIU) The SmartStart and Support Software CD also contains the Fibre Channel Fault Isolation Utility (FFIU). The FFIU verifies the integrity of a new or existing FC-AL installation. This - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 29
Management Desktop (IMD), Remote Insight (optional controller), and SmartStart. In Compaq servers, each hardware subsystem, such as disk storage, system memory, and system processor, has a robust set of management capabilities. Compaq Full Spectrum Fault Management notifies of impending fault - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 30
1-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq Insight Manager XE Compaq Insight Manager XE is a Web-based management system. It can be used in conjunction with Compaq Insight Manager agents as well as its own Web-enabled agents. This browser-based utility provides - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 31
Installation The client/server software applications are among the key components of any cluster. Compaq is working with its key software partners to ensure that cluster-aware applications are available and that the applications work seamlessly on Compaq ProLiant clusters. Compaq provides a number - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 32
2 Chapter Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 Before connecting any cables or powering on any machines, it is important to understand how all of the various cluster components and concepts fit together to meet your information system needs. The major topics discussed in this - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 33
help you design your Compaq ProLiant Cluster so that it addresses your specific availability needs. s system is not continuously available and therefore may have single points of failure. NOTE: The discussion in this chapter relating to single points of failure applies only to the Compaq ProLiant - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 34
Compaq ProLiant Clusters HA/F100 and HA/F200 2-3 An active/active configuration has two primary designs: s The first design uses Microsoft Cluster Server over to each other, ensure that each server has enough capacity, memory, and processor power to run all applications (all applications running on - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 35
2-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Example 1: File & Print/File & Print An example business scenario involves two file and print servers. The Human Resources (HR) department uses one server, and the Marketing department uses the other. Both servers actively run - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 36
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-5 Example 2: Database/Database Another resources fail over to their secondary node, the HR database server. The Marketing clients experience a slight disruption of service while the database resources are failed over, the database - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 37
2-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide If the node running the order entry database encounters a failure, the database fails over to its secondary node. The order entry clients experience a slight disruption of service while the database resources are failed over, the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 38
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-7 Example - Database/Standby Server An example business scenario uses a single server to perform queries and calculations on order entry information, translating sales orders into packaging and distribution instructions for the warehouse. - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 39
2-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Groups Understanding the relationship between your company's business functions and cluster groups is essential to getting the most from your cluster. Business functions rely on computer systems to support activities such - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 40
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-9 Resource Dependency Tree Order business function, which consists of two cluster groups: a database server (a Windows NT application) and a Web server (a Windows NT service). NOTE: For this example, it is assumed that each cluster group - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 41
2-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 2. List each application or service required for each business function. Web Sales Order Business Function Web Server Service (Cluster Group #1) Database Server Application (Cluster Group #2) Resource Resource Resource #1 #2 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 42
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-11 3. List the immediate dependencies for each application (or service). Web Sales Order Business Function Web Server Service (Cluster Group #1) Database Server Application (Cluster Group #2) Network Name Web Server Service Physical - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 43
2-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Figure 2-7 illustrates the worksheet for the Web Sales Order Sub Resource 1 Sub Resource 2 Sub Resource 3 Sub Resource 4 Resource #3 Web Server Service Sub Resource 1 Sub Resource 2 Sub Resource 3 Sub Resource 4 Resource #4 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 44
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-13 Use the resource dependency tree concept to review your company's failure. NOTE: Although not specifically covered in this section, redundant server components (such as power supplies and processor modules) should be used wherever - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 45
2-14 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide MSCS Configuration MSCS allows you to configure a on your hardware. The Compaq Redundant NIC Utility (originally called Advanced Network Fault Detection and Correction Feature) is supported on all Compaq TI-based Ethernet and - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 46
Compaq ProLiant Clusters HA/F100 and HA/F200 2-15 Because the purpose of the redundant interconnect is to increase the availability of the cluster, it is important to monitor the status of your redundant NICs. Compaq Insight Manager and Compaq Clients connected to a virtual server on the cluster (via - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 47
2-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Recommended Cluster Communication Strategy The past two sections discussed the redundancy of intracluster and cluster-to-LAN communication. However, to obtain the most benefit while - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 48
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-17 Example 1 A Compaq dual-port NIC and a single-port port NIC is configured as the primary network path for cluster-to-LAN communication. The Compaq Advanced Network Control Utility is used to configure the second port on the dual-port - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 49
2-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Example 2 The second example configuration consists of three single-port NICs. One NIC is dedicated to intracluster communication. The other two NICs are used for cluster-to-LAN communication. The Compaq Advanced Network Control - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 50
RAID Array 4000 storage system (formerly Compaq Fibre Channel storage system) is the mechanism with which ProLiant Clusters implement shared storage. Generally, the storage system consists of Compaq StorageWorks Host Adapters (host bus adapters) in each server, a Compaq StorageWorks Fibre Channel - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 51
2-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide If the host bus adapter-to-storage hub path fails, it results in a failover of all applications. For instance, if one server can no longer access the storage hub (and by extension the shared storage), all of the cluster groups - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 52
resolve this failure, since neither server can read from a failed drive. NOTE: Windows NT software RAID is not available for shared drives when using MSCS. Hardware RAID is the only available RAID option for shared storage. As with other system failures, Compaq Insight Manager monitors the health - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 53
Loop configuration. The Compaq ProLiant Cluster HA/F200 further enhances high availability through the use of additional, redundant, components in the server-to-storage connection and in the shared storage system itself. In the event of a failure, processing is switched to an alternate path - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 54
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-23 The following illustration depicts the HA/F200 configuration components. RA4000 storage hub storage hub Node 1 Dedicated Interconnect Node 2 L AN Figure 2-12. HA/F200 configuration - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 55
F200 Administrator Guide HA/F200 Fibre Channel Data Paths The Compaq StorageWorks RAID Array 4000 storage system is the mechanism with which the HA/F200 cluster implements shared storage. The Compaq ProLiant Cluster HA/F200 minimum configuration consists of two host bus adapters in each server, two - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 56
the Compaq ProLiant Clusters HA/F100 and HA/F200 2-25 The active data paths run from the active host bus adapters in the servers to the active storage hub. If this path fails, the applications can seamlessly fail over to the standby host bus adapter-to-storage hub data paths. A A S S Server - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 57
Operating System Clustered Applications & Services Non-Clustered Applications & Services Operating System Clustered Applications & Services Non-Clustered Applications & Services Node1 Node2 Figure 2-15. File locations in a Compaq ProLiant Cluster For each server, determine the processor, memory - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 58
the Compaq ProLiant Clusters HA/F100 and HA/F200 2-27 Determine the processor and memory requirements needed to support the clustered applications and services that will run on each node while the cluster is in a normal operating state. If the program files of a clustered application and/or service - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 59
2-28 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide The following table details the capacity requirements that can be applied to either active/active design. Table 2-1 Server Capacity* Requirements for Active/Active Configuration Node1 Node2 Operating system (Windows NT and - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 60
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-29 Shared Storage Capacity Each server is connected to shared storage (the Compaq StorageWorks RAID Array 4000 storage system), which mainly stores data files of clustered applications and services. Follow the guidelines below to determine - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 61
be found in the Compaq TechNote, Planning Considerations for Compaq ProLiant Clusters Using Microsoft Cluster Server, located on the Compaq website (http://www.compaq.com). This capability provides a high level of flexibility in configuring your RA4000 storage system. However, minimize the number - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 62
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-31 Shared Storage Capacity Worksheet Worksheet Disk Resource 1 Disk Resource 2 Description Web files and Web scripts for Web Service Group Log file(s) for Database Required Application Capacity 12 GB 4.3 GB Desired Level of - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 63
of load balancing. One way balances a system's workload across the cluster. The other balances a server's workload across multiple data paths. The dual redundant loop of the Compaq ProLiant Cluster HA/F200 and an added RA4000 storage system spread a system's applications and data across the data - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 64
different storage systems. RA4000 A S RA4000 A S storage hub storage hub A Server S A S Server Figure 2-18. Compaq ProLiant Cluster HA attach to the cluster. If Node1 encounters a failure and its applications and services fail over to Node2, then Node2 needs to handle access from its own - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 65
2-34 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Network Considerations This section addresses clustering items that affect the corporate LAN. MSCS has specific requirements regarding which protocol can be used and how IP address and network name resolution occurs. Additionally, - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 66
Compaq ProLiant Clusters HA/F100 and HA/F200 2-35 DHCP Only use DHCP for the clients; it should not be used for the cluster node IP addresses or cluster resource IP addresses. DHCP cannot be used to assign IP addresses for virtual servers Services The main consideration for file and print services - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 67
2-36 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Connecting to Shared Resources In the traditional, command-driven connection to a shared resource, the user needs to know the server name and the share name. In a clustered environment, the command is changed to reflect the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 68
move to the other cluster node. Performance monitoring of server loads after a failure should be investigated prior to a full clustered system implementation. You may need additional hardware, such as memory or system processors, to support the additional workload incurred after a failover. It is - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 69
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide You can use the Windows NT Performance Monitor to observe and track system that is part of a problem, set the restart threshold to 0 (zero). If the group will experience severe performance limitations if failed over to a secondary server - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 70
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-39 Failover Threshold and , if a server is providing print services to users, and the printer is directly connected to the parallel port of the server, there is no way to switch the physical connection to the other server, even though - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 71
2-40 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Another example of a direct-connect device is a directly connected mainframe interface. If the first server is directly connected to the mainframe, as through an SDLC (Synchronous Data Link Control) card in the server, there is no - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 72
the Compaq ProLiant Clusters failback. This setting allows the administrator to fail back a group manually. Allow automatic failback. This setting allows MSCS to fail back specific hours of the day during which automatic failback can occur. Refer to the Microsoft Cluster Server Administrator's Guide - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 73
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Group Failover/Failback Policy Use the Group Failover/Failback Policy worksheet to define the failover and failback policies for each cluster group. Figure 2-19 illustrates the failover/failback parameters for the Web Server Service - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 74
for the clustered Compaq ProLiant Servers s Compaq StorageWorks RAID Array 4000 User Guide s Compaq StorageWorks Fibre Channel Host Adapter Installation Guide s Installation guide for the interconnect card of your choice s Compaq SmartStart for Servers Setup Poster s Compaq Insight Manager - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 75
ProLiant Cluster can be described in the following phases: s Preinstallation guidelines s Installing the hardware, including: q Cluster nodes q Compaq StorageWorks RAID Array 4000 storage system q Cluster Interconnect s Installing the software, including: q Compaq SmartStart for Servers q Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 76
Compaq ProLiant Clusters HA/F100 and HA/F200 3-3 Preinstallation Guidelines When setting up the cluster, you will need to answer each of the following questions. Using the Preinstallation Worksheet in Appendix A, write down the answers to these questions before installing Microsoft Cluster Server - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 77
both be member servers, they can make up their own domain by assigning one as Primary Domain Controller (PDC) and one as Backup Domain Controller (BDC), or they can both be a BDC in an existing Windows NT domain. s One of the utilities the SmartStart CD runs is the Compaq Array Configuration Utility - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 78
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-5 s MSCS requires drive letters to remain constant throughout the life of the cluster; therefore, you must assign permanent drive letters to your shared drives. If you are performing manual software installation, use Windows NT Disk - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 79
Channel Host Adapter Follow the installation instructions in your Compaq StorageWorks Fibre Channel Host Adapter Installation Guide and your Compaq ProLiant Server documentation to install the host bus adapter in your servers. Install one adapter per server for the HA/F100 configuration. Install - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 80
the Logical-Physical Slot Numbering Problem," available from the Compaq website (http://www.compaq.com). For specific instructions on how to install an adapter card, refer to the documentation for the interconnect card you are installing or the Compaq ProLiant Server you are using. The cabling - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 81
3-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Setting Up the Compaq StorageWorks Raid Array 4000 Storage System Follow the instructions in the Compaq StorageWorks RAID Array 4000 User Guide to set up the RA4000s, the Compaq StorageWorks Fibre Channel Storage Hub 7 or 12, the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 82
so they can be identified and configured. The cluster must be powered up in the following order: 1. Storage hub (Power is applied when the AC power cord is plugged in) 2. Storage system 3. Servers Configuring Shared Storage The Compaq Array Configuration Utility sets up the hardware aspects of any - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 83
3-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Setting Up a Private Interconnect There are four ways to set up a private interconnect. s Ethernet direct connect s Ethernet direct connect using a private hub s ServerNet direct connect s ServerNet direct connect using a switch - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 84
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-11 Ethernet Direct Connect Using a Private Hub An Ethernet hub requires standard Ethernet cables; Ethernet crossover cables will not work with a hub. Follow these steps to cable the server interconnect using an Ethernet hub: 1. Connect - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 85
3-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Setting Up a Public Interconnect It is possible-but not Recommended Cluster Communication Strategy" section in Chapter 2 of this guide for more information about setting up redundancy for intracluster and cluster-to-LAN - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 86
, storage system, and one server powered up. You need the following during installation: IMPORTANT: Refer to Appendix C for the software and firmware version levels your cluster requires. s Compaq SmartStart for the Servers CD s Compaq SmartStart for Servers Setup Poster s Server Profile Diskette - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 87
Clusters HA/F100 and HA/F200 Administrator Guide Cluster-Specific SmartStart Installation The SmartStart setup poster describes the general flow of configuring and installing software on a single server. The installation for a Compaq ProLiant Cluster HA/F100 and HA/F200 will be very similar. The - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 88
system will reboot and SmartStart will automatically create your system partition. 7. Next, you will be guided through steps to install addition Compaq software and utilities including choosing the NT boot partition size and installing the Compaq Support Software Disk (SSD). Follow the instructions - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 89
3-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 16. Power on storage and wait for drives to spin, then power on the server. 17. If setting up an HA/F200, install Redundancy Manager. To automatically install Redundancy Manager Redundancy Manager: a. Place the Compaq Redundancy - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 90
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-17 IMPORTANT: Node2 Exception: Repeat SmartStart Assisted Integration steps 1-11 for Node2. Then proceed to step 17. IMPORTANT: Node1 Exception: Execute step 18 only after Node2 is set up. 18. Run the Compaq Cluster Verification Utility ( - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 91
completes, install Windows NT Service Pack 5. For the latest information on Service Packs, please refer to the release notes. 22. Run the Compaq Support Software Disk (SSD) through the diskettes you created or from the SmartStart CD and verify that all installed drivers are current. 23. Install - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 92
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-19 Manual Installation Using SmartStart To perform a manual installation perform the following steps: IMPORTANT: Power off Node2 when setting up Node1 1. Power up the shared storage. Place the SmartStart CD in the CD-ROM drive of the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 93
3-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 9. Power down the server, insert Options ROMPaq diskette in Node1, and restart the system. IMPORTANT: When updating the firmware on the array controllers, make sure that one server is powered off. IMPORTANT: Node2 Exception: Do - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 94
Pack 5. See the latest release notes for the latest service pack information. 22. Run Compaq Support Software Diskette (SSD) through the diskettes you created or from the SmartStart CD and verify that all installed drivers are current. 23. Install your applications and managing and monitoring - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 95
service to the Windows NT domain administrator user account. 8. Repeat these steps to install the software on the other cluster node. For more specific instructions about using Compaq Intelligent Cluster Administrator, refer to the Compaq Intelligent Cluster Administrator Quick Setup Guide - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 96
in a Compaq ProLiant Cluster environment. Visit to the Compaq High Availability website (http://www.compaq.com/ servers in a fresh state, verify creation of the cluster using the following steps. 1. Shut down and power off both servers. 2. Power off and then power on the RA4000. 3. Power both servers - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 97
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide When Windows NTS/E finishes booting up on both servers, follow these steps to use Microsoft Cluster Administrator to verify creation of the cluster: 1. From the Windows NTS/E desktop on either cluster server troubleshooting - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 98
the Compaq ProLiant Clusters HA/F100 and HA/F200 3-25 3. Make sure all predefined resources and groups are online. Verify that some of the resources and groups are owned by the server you will be powering off, so that a failure event will result in failover of resources and/or groups. 4. Power off - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 99
3-26 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 5. Use Microsoft Cluster Administrator to perform a manual failover of the cluster group that contains the IP address. 6. After the manual failover completes, execute the ping command again. 7. As soon as the other node brings the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 100
StorageWorks Fibre Channel Storage Hub s One additional Compaq StorageWorks Fibre Channel Host Adapter (host bus adapter) per server s Compaq Redundancy Manager (Fibre Channel) software s Appropriate firmware and drivers If you already have a Compaq ProLiant Cluster HA/F100 up and running, you do - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 101
(host bus adapter) per server s Compaq SmartStart for Servers CD s Compaq SmartStart for Servers Setup Poster s Server Profile Diskette (included with SmartStart) s Microsoft Windows NT Server 4.0/Enterprise Edition software and documentation s Compaq Redundancy Manager (Fibre Channel) software - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 102
website (http://www.compaq.com/highavailability). 3. Create Support Software Disk (SSD) from the Diskette Builder utility. Run the Support Software Disk (SSD) either from the diskettes or directly from the SmartStart CD. Install the latest Fibre Channel drivers and other server components. Do not - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 103
on service packs, refer to the HA/F200 release notes. 15. Run the Software Support Disk (SSD) through the diskettes you created or from the SmartStart CD, and verify that all installed drivers are current. 16. Install your applications and managing and monitoring software. a. Refer to the Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 104
details the utilities and programs used in the ongoing management of Compaq ProLiant Clusters HA/F100 and HA/F200. The topics addressed in this chapter include: s Managing a Cluster Without Interrupting Cluster Services s Managing a Cluster in a Degraded Condition s Managing Hardware Components of - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 105
5-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide The chapter also details the utilities and programs used in the ongoing management of Compaq ProLiant Clusters HA/F100 and HA/F200. The tools addressed in this chapter include: s Compaq Redundancy Manager (Fibre Channel) s Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 106
Compaq Insight Manager has been enhanced to operate with the Compaq ProLiant Clusters HA/F100 and HA/F200. Compaq Insight Manager XE allows you to view and manage servers experience some disruption of service and, possibly, a have on the users' information systems needs. When a failover or failback - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 107
5-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Managing a Cluster's Shared Storage Compaq Insight Manager and Compaq Insight Manager XE monitors the RAID Array 4000 storage system from both a physical and a logical perspective: s The physical drives and Fibre Channel hardware - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 108
of adding shared storage assumes: s Service Pack 5 for Windows NT 4.0 is applied. s Compaq SmartStart CD 4.10 or later is used to apply system drivers. 1. Power down one of the cluster servers (Node2). 2. Insert the Compaq SmartStart and Support Software CD into the CD-ROM drive of the other cluster - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 109
using a Compaq ProLiant Cluster HA/F200 with redundant paths, be sure to attach both array controllers, one to each hub. 4. Add the additional RA4000 to the storage subsystem. Follow the hardware installation steps detailed in Chapter 3 of this guide. 5. Power on the newly added RA4000. 6. Power on - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 110
the Compaq ProLiant the data on the failed drive. Refer to the Compaq StorageWorks RAID Array 4000 User Guide for instructions on replacing a failed drive. Adding Drives to Increase : s Service Pack 5 for Windows NT 4.0 is applied. s Compaq SmartStart CD 4.10 or later is used to apply system drivers. - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 111
Guide 1. Power down one of the cluster servers (Node2). 2. Insert the Compaq SmartStart and Support Software CD the other cluster server (Node1). Power down Node1. 3. Insert new drives in the RA4000 storage array. IMPORTANT: If using a Compaq ProLiant ACU. Remove SmartStart CD. 6. Boot Windows NT on - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 112
into the new node. 6. If the new node is part of a rack system, place the server in the rack. Attach the interconnect, LAN, Fibre Channel cables, and power cables. If you are using the Windows NT boot drives from the replaced node in the new node, power on the new node and follow the steps described - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 113
5-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Installing a New Windows NT Boot Drive New Windows NT boot drives require installation of Windows NT, configuration of the networking components of the new node, and installation of MSCS. Follow the SmartStart Assisted Path - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 114
adapters in a Compaq ProLiant Cluster HA/F200 configuration. This functionality is only available when accessing two separate RAID arrays. The timing of manual load balancing depends on the type of group to be moved and how many clients are using the group. File and print services are normally not - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 115
5-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq Redundancy Manager Compaq Redundancy Manager (Fibre Channel) increases the availability of single-server or clustered systems using Compaq StorageWorks RAID Array 4000. Redundancy Manager can detect failures of the Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 116
Managing the Compaq ProLiant Clusters HA/F100 and HA/F200 5-13 Changing Paths Redundancy Manager allows you to change the active and standby paths for your cluster. The following provide instructions for changing paths. NOTE: Redundancy Manager will not change the configuration until you close the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 117
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide has happened in the system. Refresh does not affect any processing or interrupt any of the system's functions. Rescan Rescan run on each machine in a cluster. NOTE: Reboot each server to clear the SCSI port after seven hot replaces. 1. - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 118
monitor the ProLiant Cluster and perform cluster administration functions such as starting and stopping an MSCS service, starting or stopping an MSCS node, and starting or stopping the MSCS cluster. Compaq Insight Manager consists of two components: a Windows-based console application and server- or - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 119
5-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster-Specific Features of Compaq Insight Manager The following is an overview of the cluster-specific features found in Compaq Insight Manager. NOTE: The term cluster group used in this section refers to Compaq Insight Manager, - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 120
SNMP and DMI V2 systems. Compaq management agents provide health status to either Compaq Insight Manager or Compaq Insight Manager XE. The agents translate data supplied by the device drivers into useful information that assists the user in correctly diagnosing the problem. Compaq Insight Manager or - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 121
5-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq Insight Manager XE offers a simple, specific operational performance thresholds that will alert you when these thresholds have been met or exceeded on your application systems. Cluster Monitor relies heavily on the Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 122
, including MSCS health, processor, bus, disk, or network usage and performance thresholds s A detailed problem definition based on monitored conditions and a proposed resolution to the problem, if one can be determined, with drill down ability to the specific device or system causing a negative or - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 123
5-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Monitor supports these attributes: s Disk space s CPU utilization s MSCS cluster status s Node Environment (Compaq Management Agent) status. Cluster Monitor uses pop-up notifications, alerts in the alert list, colored - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 124
Managing the Compaq ProLiant Clusters HA/F100 and HA/F200 5-21 s Assign resources to groups and nodes s Establish resource dependencies s Assign failover policies for cluster resources s Fail over resources and nodes s Stop and start cluster services Managing Cluster History Using the Cluster - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 125
5-22 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Microsoft Cluster Administrator Microsoft Cluster Administrator manages groups, resources, and the operating state of the cluster. Cluster Administrator gives you the ability to: s View the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 126
HA/F100 and HA/F200 This chapter addresses problems encountered while installing, configuring, testing, and operating the Compaq ProLiant Clusters HA/F100 and HA/F200. These problems are described in the following troubleshooting categories: s Installation s Node-to-Node s Shared Storage s Client - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 127
6-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Installation This section addresses problems encountered during installation. Table 6-1 Solving Installation Problems Problem The error message "RPC Server is Unavailable" is displayed. Possible Cause Name resolution issue. - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 128
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-3 Table 6-1 Solving Installation Problems continued Problem Possible Cause MSCS installation will not complete on the first node. Insufficient space on nonshared drives for MSCS. Operating system is incorrect or deficient. - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 129
6-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-1 Solving Installation Problems continued Problem Possible Cause Clients do not see the cluster. Clients can only view the virtual servers. Added logical drives are not recognized Windows NT and Redundancy Manager do - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 130
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-5 Troubleshooting Node-to-Node Problems This section describes problems that may be encountered during server-to-server communication. Table 6-2 Solving Node-to-Node Problems Problem The resources failed over but the nodes do not - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 131
6-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-2 Solving Node-to-Node Problems continued Problem Possible Cause The second node cannot join the cluster. Improper name resolution. Cluster Service is not running. No network connectivity exists. TCP/IP is not - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 132
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-7 Shared Storage This section addresses problems encountered using the Compaq StorageWorks RAID Array 4000 storage system as a shared storage device. This section does not address RA4000 storage system problems specific to the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 133
configuration problems. Action 1. Run the Compaq Array Configuration Utility online. This utility can be run online if at least one logical drive is configured and recognized. 2. Run the Compaq Array Configuration Utility offline. Shut down the server(s) and reboot with either the SmartStart CD or - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 134
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-9 Table 6-3 Solving Shared Storage Problems continued Problem Possible Cause Data on shared storage appears to be overwritten. MSCS may not be loaded and therefore cannot manage access to drive volumes in the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 135
6-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-3 Solving Shared Storage Problems continued Problem Possible Cause Compaq Redundancy Manager shows cluster in nonredundant mode. You are using ACU to expand capacity. ICL has failed. Mismatched firmware on the array - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 136
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-11 Table 6-3 Solving Shared Storage Problems continued Problem Possible Cause Windows NT Event Log states: The Host Bus Adapter in slot %1 has averaged more than %2 Soft PCI Errors over the last five seconds. - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 137
6-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Client-to-Cluster Connectivity This section addresses problems that may be encountered in cluster-to-LAN communication. NOTE: The cluster is assigned one or more Net BIOS names associated with an IP address. Network clients - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 138
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-13 Table 6-4 Solving Client-to-Cluster Connectivity Problems continued Problem Possible Cause Clients do not see virtual servers. Virtual servers cache (Nbtstat.exe on the Windows NT CD) to determine whether the name had been - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 139
14 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-4 Solving Client-to-Cluster Connectivity Problems continued Problem related. resources. Action 1. Manually fail over each of the applications from the primary server to the secondary server. Make sure automatic failback - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 140
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-15 Table 6-4 Solving Client-to-Cluster Connectivity Problems continued Problem Possible Cause Clients cannot access a group that has failed over. Networking problem over group is a virtual server (that is, the group contains - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 141
6-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Groups and Cluster Resource Microsoft Cluster Administrator solves many group and cluster resource problems. For troubleshooting tips on this topic, refer to the Microsoft Cluster Server Administrator's Guide and Cluster - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 142
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-17 Troubleshooting Compaq Redundancy Manager The following section addresses system. When a message is displayed, click on Help to receive more details about that particular message. The Event Viewer for Microsoft Windows NT Server - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 143
6-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-6 Compaq Redundancy Manager Informational Messages continued Message Description Action The loop has been locked by another application. Another application has issued a lock management command to an - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 144
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-19 Table 6-6 Compaq Redundancy Manager Informational Messages assign the path for this logical disk; Or, click Cancel to assign the path manually. You have not selected any Paths A logical disk is claimed but no to - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 145
6-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Warning Message This section provides a warning message and actions to take using Redundancy Manager. Message overriding the lock and taking control of the array controller. Reload Redundancy Manager from the CD. continued - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 146
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-21 Table 6-8 Error Messages . A version of the Compaq Fibre Channel Host Adapter SCSI Miniport Driver (cpqfcalm.sys) is being used that does not support redundancy. The minimum version for redundancy support is VX.X. The current - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 147
6-22 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Other Potential Problems Redundancy Manager displays text messages warning of possible changes and events to the system. When a message is displayed, click on Help to receive more details about that particular message. Table 6-9 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 148
Overview This appendix contains blank worksheets you can use to design, configure, and install your Compaq ProLiant Cluster HA/F100 or HA/F200. Completed worksheets are illustrated in chapters 2 and 3 of this guide. Copy these worksheets and use as many as necessary to assist you in planning and - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 149
A-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Group Definition Worksheet Complete the Cluster Group Definition worksheet for each business function requiring clustering. Cluster Function Group #1 Group #2 Resource #1 Cluster Group Definition Worksheet - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 150
Cluster Configuration Worksheets A-3 Shared Storage Capacity Worksheet Use the Shared Storage Capacity worksheet to outline your shared storage capacity requirements. Description Shared Storage Capacity Worksheet Disk Resource 1 Disk Resource 2 Required Capacity without RAID Level of - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 151
A-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Group Failover/Failback Policy Worksheet Use the Group Failover/Failback Policy worksheet to define failover and failback settings for each cluster group. Group Failover/Failback - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 152
to gather information necessary for the installation of Compaq ProLiant Clusters HA/F100 or HA/F200. Are you: Cluster Name: Preinstallation Worksheet r Forming a cluster or r Joining a cluster Domain account Microsoft Cluster Server will run under: User Name Password Domain Network - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 153
adapters, and it may be connected to dual RA4000s containing redundant storage array controllers and two Compaq StorageWorks Fibre Channel Storage Hubs. IMPORTANT: Cable your Compaq ProLiant single-server system according to Compaq-recommended guidelines. Redundancy Manager may appear to work if the - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 154
B-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Figure B-1 shows a single-server setup with an RA4000. This setup provides redundant paths to the RA4000. RA4000 storage hub storage hub server L AN Figure B-1. Single-server setup with a single RA4000 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 155
sophisticated system configuration and control of each defined path. Redundancy Manager is supported on all Compaq ProLiant servers in single-server configurations. The following sections provide information about: s Installing Redundancy Manager s Managing Redundancy Manager s Troubleshooting - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 156
B-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Installing Redundancy Manager The following requirements must be met to install Redundancy Manager on a server running Microsoft Windows NT Server 4.0/Enterprise Edition or Microsoft Windows NT Server 4.0: s 32 MB of RAM required, - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 157
Single-Server Environment B-5 Manually Installing Redundancy Manager If the server is not set up to automatically load when the CD is placed in the CD-ROM drive, follow these steps to manually install Compaq Redundancy Manager: 1. Place the Compaq Redundancy Manager (Fibre Channel) CD in the CD-ROM - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 158
B-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Managing Redundancy Manager Redundancy Manager increases the availability of single-server or clustered systems using the RA4000 storage system. Redundancy Manager can detect failures of the host bus adapters, array controllers, - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 159
Using Compaq Redundancy Manager in a Single-Server Environment B-7 Changing Paths The following information describes how to change paths using Redundancy Manager. NOTE: Redundancy Manager will not change the configuration until you close - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 160
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide close Redundancy Manager. Expanding Capacity Redundancy Manager does not support the hot-add of logical drives. To add drives Reboot Windows NT on the server to see the new drives. Shut down and reboot the server. 3. Run the Array - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 161
if a failure has happened in the system. Refresh does not affect any processing or interrupt any of the system's functions. Rescan Rescan is used to check menu. NOTE: Reboot each server to clear the SCSI port after seven hot replaces. Troubleshooting Redundancy Manager This section provides - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 162
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Overview Redundancy Manager displays messages to inform you of possible changes and events in the system error message. The Event Viewer for Microsoft Windows NT Server displays additional information. For further information on using - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 163
Using Compaq Redundancy Manager in a Single-Server Environment B-11 Table B-1 Informational Messages continued Message Description The loop has been locked by another application. This message indicates that another application has issued a lock - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 164
12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Warning Message This section provides a list of the warning messages that Redundancy Manager may display and the action you can take. A warning message informs you of possible noncritical changes and events in the system. Message - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 165
Using Compaq Redundancy Manager in a Single-Server Environment B-13 Error Messages This section provides a list of the error messages that Redundancy Manager may display and the action you should take. An error message informs you of possible critical changes and events in the system. Table B-3 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 166
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide GBIC laser has malfunctioned. Action Reload Redundancy Manager from the CD. Check the RA4000 and all connections. No action needed Compaq StorageWorks RAID Array 4000 User Guide's "Replacing GBICs" chapter for instructions about how - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 167
cpqfcalm.sys doesn't support redundancy. GBIC laser has malfunctioned. GBIC laser has malfunctioned. Action You must make sure you have the correct version of cpqfcalm.sys. Refer to the Compaq StorageWorks RAID Array 4000 User Guide's "Replacing GBICs" chapter for instructions about how to replace - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 168
B-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Troubleshooting Redundancy Manager Troubleshooting Potential Problems This section provides help for troubleshooting potential problems with the Redundancy Manager. Table B-4 Troubleshooting Potential Problems Message Could - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 169
.com/highavailability) for information about software and firmware updates recommended or required for your Compaq ProLiant Cluster. Table C-1 Supported Software/Firmware Versions Software/Firmware Title Compaq SmartStart and Support Software CD Compaq Support Software Diskette for Windows NT (NT - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 170
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table C-1 Supported Software/Firmware Versions continued Software/Firmware Title Compaq Redundancy Manager (Fibre Channel) Microsoft Windows NT Server 4.0 Service Pack Compaq Cluster Verification Utility Compaq Insight Manager Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 171
one host bus adapter is processing data and the other is in a booted, but inactive, state when the cluster is operating normally. The standby node RA4000 controller. A measure of how well a computer system can continuously deliver services to its clients. Availability is typically expressed as a - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 172
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster group Compaq StorageWorks RA4000 Controller Compaq StorageWorks Fibre Channel Host Bus Adapter Compaq device that provides an interface between a host system (server) and storage system or other devices connected on a fibre - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 173
host bus adapter. This can be accomplished manually using Compaq Redundancy Manager or automatically upon failure of one of the adapters. Computer components that can be removed and replaced without powering down the system. A computer component that is powered on, not actively processing data, and - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 174
Glossary-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide IP address Load balancing Logical disks Mission-critical Network interface controller NIC Node NTFS Paging file POST Power-On Self-Test Preferred node Proprietary clustering system Internet Protocol Address. A number that - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 175
well as for arbitrating ownership of cluster resources. See Compaq StorageWorks RAID Array 4000 See Redundant Array of Inexpensive Disks other members include associated parity information. The continuous integrity of a system (server, storage, network, or cluster). The ability to check for new - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 176
Glossary-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Resource Scalability SCSI ServerNet Service Shared resource Static IP address assignment System UPS Virtual server A software or hardware entity upon which a client/server application or service is dependent. As it - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 177
Array Configuration Utility See Compaq Array Configuration Utility Automatic Server Recovery 3-6 B backup cluster 5-10 data 5-10 server IP address 2-36 ServerNet description 1-10 troubleshooting 6-4, 6-5 types 1-10 capacity planning 2-26 network 2-33 client/server applications 2-36 reconfiguration - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 178
-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide client-to-cluster connectivity troubleshooting 6-12 cluster address 5-17 administrator 1-18, 5-2 availability 1-1 backup 5-10. See also backup backup solutions 5-10 limitations 5-10 cables 1-2 communication strategy 2-16 Compaq - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 179
5-22 screen example 5-12, B-6 troubleshooting 6-17, 6-22, B-9 using and configuring 5-12, B-6 Redundant NIC Utility 2-14, 2-15 SmartStart assisted integration 3-14 description 1-15 manual configuration 3-19 recommended installation 3-13 SmartStart and Support Software CD 1-15 software tools features - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 180
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide website xiii www.compaq.com xii white papers 1-13, 2-16 D data backup 5-10 dedicated interconnect 1-12 DHCP 2-35 disk resource troubleshooting 6-3 DNS See Domain Name Service Domain Name Service 40 defined 2-40 manual 2-40 policy 2-41 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 181
redundancy 2-18 file and print services connection considerations 2-35 filter Compaq ProLiant Cluster HA/F100 2-26 Compaq ProLiant Cluster HA/F100 hardware components 1-4 Compaq ProLiant Cluster HA/F200 hardware components 1-6 Compaq ProLiant cluster HA/F200 with dual RA4000s 2-33 Compaq ProLiant - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 182
10 Microsoft Cluster Server 6-3 redundant interconnect 3-12 requirements Compaq Redundancy Manager B-4 ServerNet interconnect 3-11 servers 3-6 SmartStart 3-13, 4-2 troubleshooting 6-2 installing Compaq Redundancy Manager automatically 3-16, B-4 Compaq Redundancy Manager manually 3-16, B-5 Integrated - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 183
See Microsoft Cluster Server N net use command 2-36 network capacity 2-33 clients 5-3 migrating 2-35 troubleshooting 6-4, 6-12 configurations 2-34 considerations 2-34 clients 2-34 corporate LAN 2-34 protocols 2-34 interface card 2-40 protocols 2-34 DNS 2-34 not supported 2-34 supported 2-34 TCP/IP - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 184
Index-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide restart period 2-38 threshold 2-38 RPC server is unavailable 6-2 S scheduling automatic failback 6-16 screen refresh how to 5-14, B-9 SDLC See synchronous data link control server capacity active/active configurations 2-28 - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 185
solving shared storage problems 6-7, 6-10 troubleshooting redundancy manager problems 6-22 warning messages 6-20 technical support xii TechNote Planning Considerations for Compaq ProLiant Clusters Using Microsoft Cluster Server 2-30 testing client failover 3-25 creation of the cluster 3-23 node - Compaq ProLiant 5500 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 186
Index-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide group failover/failback policy A-4 preinstallation A-5 shared storage capacity A-3 www.compaq.com 1-7, 1-8, 1-13, 1-18, 1-19, 2-29, 2-30, 3-6, 3-7, 3-23, 4-3
Compaq Confidential – Need to Know Required
Writer: Linda Arnold
Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide
Comments:
Part Number: 380362-002
File Name: a-frnt.doc
Last Saved On: 8/11/99 3:55 PM
Compaq ProLiant Clusters
HA/F100 and HA/F200
Administrator Guide
Second Edition (September 1999)
Part Number 380362-002
Compaq Computer Corporation