Compaq ProLiant 1600 Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator
Compaq ProLiant 1600 Manual
View all Compaq ProLiant 1600 manuals
Add to My Manuals
Save this manual to your list of manuals |
Compaq ProLiant 1600 manual content summary:
- Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 1
HA/F100 and HA/F200 Administrator Guide Second Edition (September 1999) Part Number 380362-002 Compaq Computer Corporation Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 2
Clusters HA/F100 and HA/F200 Administrator Guide Second Edition (September 1999) Part Number 380362-002 Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File Name: a-frnt.doc - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 3
1 Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 Overview of Compaq ProLiant Clusters HA/F100 and HA/F200 Components ...... 1-1 Compaq ProLiant Cluster HA/F100 1-3 Compaq ProLiant Cluster HA/F200 1-5 Compaq ProLiant Servers 1-7 Compaq StorageWorks RAID Array 4000 Storage System - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 4
Administrator Guide Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 continued Compaq Software ...1-14 Compaq SmartStart and Support Software CD 1-15 Compaq Redundancy Manager (Fibre Channel 1-16 Compaq Cluster Verification Utility 1-16 Compaq Insight Manager 1-17 Compaq Insight - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 5
Manager 5-15 Cluster-Specific Features of Compaq Insight Manager 5-16 Compaq Insight Manager XE 5-17 Cluster Monitor 5-18 Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 6
5-20 Managing Cluster History 5-21 Importing and Exporting Cluster Configurations 5-21 Microsoft Cluster Administrator 5-22 Chapter 6 Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 Installation ...6-2 Troubleshooting Node-to-Node Problems 6-5 Shared Storage ...6-7 Client - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 7
Redundancy Manager B-16 Troubleshooting Potential Problems B-16 Appendix C Software and Firmware Versions Glossary Index Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 8
to be used as step-by-step instructions for installation and as a reference for operation, troubleshooting, and future upgrades of the cluster server. This guide provides information about the installation, configuration, and implementation of the Compaq ProLiant Cluster Models HA/F100 and HA/F200 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 9
and HA/F200," contains high-level troubleshooting information for the Compaq ProLiant Clusters HA/F100 and HA/F200. Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File Name - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 10
Enter key. Enter When you are instructed to enter information, type the information and then press the Enter key. Compaq Confidential - Need to Know Required Writer: Linda Arnold Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Comments: Part Number: 380362-002 File Name - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 11
or specific instructions. NOTE: Text set off in this manner presents commentary, sidelights, or interesting points of information. Getting Help If you have a problem and have exhausted the information in this guide, you can get further information and other help in the following locations. Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 12
to specific hardware and software components of the Compaq ProLiant Clusters HA/F100 and HA/F200, including, but not limited to, the following: s Documentation related to the ProLiant servers you are clustering (for example, manuals, posters, and performance and tuning guides) s Compaq RA4000 Array - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 13
Clusters HA/F100 and HA/F200 Overview of Compaq ProLiant Clusters HA/F100 and HA/F200 Components A cluster is a loosely coupled collection of servers and storage that acts as a single system, presents a single-system image to clients, provides protection against system failures, and provides - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 14
1-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq ProLiant Clusters HA/F100 and HA/F200 platforms are composed of the following. Hardware: s Compaq ProLiant servers s Compaq StorageWorks RAID Array 4000 Storage System (formerly Compaq Fibre Channel Storage System) q Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 15
Channel cable q Ethernet crossover cable q Network (LAN) cable The Compaq ProLiant Cluster HA/F100 includes these software solution components: s Microsoft Windows NT Server 4.0 Enterprise Edition s Compaq SmartStart and Support Software CD s Compaq Support Software Diskette for Windows NT (NT SSD - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 16
components of the Compaq ProLiant Cluster HA/F100 The Compaq ProLiant Cluster HA/F100 configuration is a cluster with a Compaq StorageWorks RAID Array 4000, a single Compaq StorageWorks Fibre Channel Storage Hub (7- or 12-port), two Compaq ProLiant servers (nodes), a single Compaq StorageWorks Fibre - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 17
Channel cable q Ethernet crossover cable q Network (LAN) cable The Compaq ProLiant Cluster HA/F200 includes these software solution components: s Microsoft Windows NT Server 4.0 Enterprise Edition s Compaq SmartStart and Support Software CD s Compaq Support Software Diskette for Windows NT (NT SSD - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 18
components of the Compaq ProLiant Cluster HA/F200 The Compaq ProLiant Cluster HA/F200 configuration is a cluster with one or more Compaq StorageWorks RAID Array 4000s, two Compaq StorageWorks Fibre Channel Storage Hubs (7- or 12-port), two Compaq ProLiant servers, two Compaq StorageWorks Fibre - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 19
Network Interface Controller (NIC) support, dual-ported hot-pluggable 10/100 NICs and redundant hot-pluggable power supplies (on most high-end models). Many of these features are available at the low end and mid range of the Compaq ProLiant server line, as well. Compaq has logged thousands of hours - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 20
storage cabinet that contains the disk drives, power supply, and array controller. The RA4000 can hold twelve 1-inch or eight 1.6-inch Wide-Ultra SCSI drives. The RA4000 supports the same hot-pluggable drives as Compaq Servers and Compaq ProLiant Storage Systems, online capacity expansion, online - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 21
Compaq ProLiant Clusters HA/F100 and HA/F200 1-9 Compaq StorageWorks Fibre Channel Storage Hubs The servers in a Compaq ProLiant Cluster HA/F100 and HA/F200 are connected to one or more Compaq StorageWorks Raid Array will be unused ports. Compaq does not currently support using these ports to connect - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 22
ease of expansion and 100 MB/s performance. GBIC-SW modules support distances up to 500 meters using multi-mode fibre optic cable. Cables Three general categories of cables are used for Compaq ProLiant HA/F100 and HA/F200 clusters: Server to Storage Shortwave (multi-mode) fiber optic cables are - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 23
Architecture of the Compaq ProLiant Clusters HA/F100 and HA/F200 1-11 Cluster Interconnect Two types there are three options: Dedicated Interconnect Using an Ethernet Crossover Cable: An Ethernet crossover cable (supplied in both the HA/F100 and HA/F200 kits) can be used to connect the NICs directly - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 24
1-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Interconnect The cluster interconnect is a data stand-alone server configuration. Because clients desiring the full advantage of the cluster will now connect to the cluster rather than to a specific server, configuring - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 25
a Windows NT Cluster," available from the Compaq High Availability website (http://www.compaq.com/highavailability). Interconnect Adapters Ethernet adapters, or Compaq ServerNet adapters, can be used for the interconnect between the servers in a Compaq ProLiant Cluster. Either 10Mb/sec, or 100Mb/sec - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 26
Clusters HA/F100 and HA/F200 Administrator Guide Microsoft Software Microsoft Windows NT Server 4.0/Enterprise Edition (Windows NTS/E) is the operating system for the Compaq ProLiant Clusters HA/F100 and HA/F200. Microsoft Cluster Server (MSCS) is part of Windows NTS/E. As the core component - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 27
to the Compaq Server Setup and Management pack. For information about using SmartStart to install the Compaq ProLiant Cluster HA/F100 and HA/F200, see chapters 3 and 4 of this guide. Compaq Array Configuration Utility The Compaq Array Configuration Utility, found on the Compaq SmartStart and Support - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 28
1-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Fibre Channel Fault Isolation Utility (FFIU) The SmartStart and Support Software CD both single-server and clustered systems that use the Compaq StorageWorks RAID Array 4000 Storage System and Compaq ProLiant servers. Redundancy - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 29
of the Compaq ProLiant Clusters HA/F100 and HA/F200 1-17 s Storage tests verify the presence and minimum configuration requirements of supported host bus adapters, array controllers, and external storage subsystem. s System software tests verify that Microsoft Windows NT Server 4.0/ Enterprise - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 30
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq Insight Manager XE Compaq Insight Manager XE is a Web-based management system. It can be used in conjunction with Compaq designed specifically for monitoring cluster health. Cluster Monitor provides access to the Compaq Insight - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 31
these applications in a Compaq ProLiant Cluster environment. Visit the Compaq High Availability website (http://www.compaq.com/highavailability) to download current versions of these TechNotes and other technical documents. IMPORTANT: Your software applications may need to be updated to take full - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 32
2 Chapter Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 Before connecting any cables or powering on any machines, it is important to understand how all of /Failback Planning In addition to reading this chapter, read the planning chapter in Microsoft Cluster Server Administrator's Guide. - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 33
2-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Planning Considerations To correctly assess capacity, detailed in this section will help you design your Compaq ProLiant Cluster so that it addresses your specific availability needs. s Cluster configuration design is addressed in - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 34
Compaq ProLiant Clusters HA/F100 and HA/F200 2-3 An active/active configuration has two primary designs: s The first design uses Microsoft Cluster Server over to the other (Node2), ensure that Node2 has enough capacity, memory, and CPU power to execute not only its own applications, but to run the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 35
2-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Example 1: File & Print/File & Print An example business scenario involves two file and print servers. The Human Resources (HR) department uses one server, and the Marketing department uses the other. Both servers actively run - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 36
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-5 Example 2: Database/Database Another resources fail over to their secondary node, the HR database server. The Marketing clients experience a slight disruption of service while the database resources are failed over, the database - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 37
2-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide If the node running the order entry database encounters a failure, the database fails over to its secondary node. The order entry clients experience a slight disruption of service while the database resources are failed over, the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 38
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-7 Example - Database/Standby Server An example business scenario uses a single server to perform queries and calculations on order entry information, translating sales orders into packaging and distribution instructions for the warehouse. - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 39
2-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Groups Understanding the relationship between your company's business functions and cluster groups is essential to getting the most from your cluster. Business functions rely on computer systems to support activities such - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 40
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-9 Resource Dependency Tree Order business function, which consists of two cluster groups: a database server (a Windows NT application) and a Web server (a Windows NT service). NOTE: For this example, it is assumed that each cluster group - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 41
2-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 2. List each application or service required for each business function. Web Sales Order Business Function Web Server Service (Cluster Group #1) Database Server Application (Cluster Group #2) Resource Resource Resource #1 #2 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 42
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-11 3. List the immediate dependencies for each application (or service). Web Sales Order Business Function Web Server Service (Cluster Group #1) Database Server Application (Cluster Group #2) Network Name Web Server Service Physical - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 43
2-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Figure 2-7 illustrates the worksheet for the Web Sales Order Sub Resource 1 Sub Resource 2 Sub Resource 3 Sub Resource 4 Resource #3 Web Server Service Sub Resource 1 Sub Resource 2 Sub Resource 3 Sub Resource 4 Resource #4 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 44
not specifically covered in this section, redundant server components (such as power supplies specific server model. The single points of failure described in this section are: s Cluster interconnect s Fibre Channel data paths s Non-shared disk drives s Shared disk drives NOTE: The Compaq ProLiant - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 45
2-14 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide MSCS Configuration MSCS allows you to configure a on your hardware. The Compaq Redundant NIC Utility (originally called Advanced Network Fault Detection and Correction Feature) is supported on all Compaq TI-based Ethernet and - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 46
Compaq ProLiant Clusters HA/F100 and HA/F200 2-15 Because the purpose of the redundant interconnect is to increase the availability of the cluster, it is important to monitor the status of your redundant NICs. Compaq Insight Manager and Compaq Clients connected to a virtual server on the cluster (via - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 47
2-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Recommended Cluster Communication Strategy The past two sections discussed the redundancy of intracluster and cluster-to-LAN communication. However, to obtain the most benefit while - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 48
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-17 Example 1 A Compaq dual-port NIC and a single-port port NIC is configured as the primary network path for cluster-to-LAN communication. The Compaq Advanced Network Control Utility is used to configure the second port on the dual-port - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 49
2-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Example 2 The second example configuration consists of three single-port NICs. One NIC is dedicated to intracluster communication. The other two NICs are used for cluster-to-LAN communication. The Compaq Advanced Network Control - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 50
which ProLiant Clusters implement shared storage. Generally, the storage system consists of Compaq StorageWorks Host Adapters (host bus adapters) in each server, a Compaq StorageWorks Fibre Channel Storage Hub, a Compaq StorageWorks RA4000 Controller (array controller), and a Compaq StorageWorks - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 51
2-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide If the host bus adapter-to-storage hub path fails, it results in a failover of all applications. For instance, if one server can no longer access the storage hub (and by extension the shared storage), all of the cluster groups - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 52
the Compaq ProLiant Clusters HA Compaq Array Configuration Utility. If RAID 1 or 5 is not used, failure of a shared disk drive will disrupt service to all clustered applications and services that depend on the drive. Failover of a cluster node will not resolve this failure, since neither server - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 53
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq ProLiant Cluster HA/F100 reduces the single points of failure that exist in a single-server environment by allowing two servers will occur. s An array controller failure will cause the redundant array controller to take over for - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 54
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-23 The following illustration depicts the HA/F200 configuration components. RA4000 storage hub storage hub Node 1 Dedicated Interconnect Node 2 L AN Figure 2-12. HA/F200 configuration - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 55
F200 Administrator Guide HA/F200 Fibre Channel Data Paths The Compaq StorageWorks RAID Array 4000 storage system is the mechanism with which the HA/F200 cluster implements shared storage. The Compaq ProLiant Cluster HA/F200 minimum configuration consists of two host bus adapters in each server, two - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 56
the Compaq ProLiant Clusters HA/F100 and HA/F200 2-25 The active data paths run from the active host bus adapters in the servers to the active storage hub. If this path fails, the applications can seamlessly fail over to the standby host bus adapter-to-storage hub data paths. A A S S Server - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 57
Operating System Clustered Applications & Services Non-Clustered Applications & Services Node1 Node2 Figure 2-15. File locations in a Compaq ProLiant Cluster For each server, determine the processor, memory, and disk storage requirements needed to support its operating system and nonclustered - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 58
the Compaq ProLiant Clusters HA/F100 and HA/F200 2-27 Determine the processor and memory requirements needed to support the clustered applications and services that will run on each node while the cluster is in a normal operating state. If the program files of a clustered application and/or service - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 59
2-28 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide The following table details the capacity requirements that can be applied to either active/active design. Table 2-1 Server Capacity* Requirements for Active/Active Configuration Node1 Node2 Operating system (Windows NT and - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 60
for shared drives when using MSCS. Hardware RAID is the only available RAID option for shared storage. For more information about hardware RAID, see the following: s Compaq StorageWorks Fibre Channel RAID Array 4000 User Guide s Configuring Compaq RAID Technology for Database Servers (TechNote - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 61
while cluster Node2 controls drives F and H. More information regarding cluster disk configuration can be found in the Compaq TechNote, Planning Considerations for Compaq ProLiant Clusters Using Microsoft Cluster Server, located on the Compaq website (http://www.compaq.com). This capability provides - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 62
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-31 Shared Storage Capacity Worksheet Worksheet Disk Resource 1 Disk Resource 2 Description Web files and Web scripts for Web Service Group Log file(s) for Database Required Application Capacity 12 GB 4.3 GB Desired Level of - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 63
2-32 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Load Balancing Load balancing helps to your HA/F200 configuration for host bus adapters in a single server to be active/active. Figure 2-17 shows a Compaq ProLiant Cluster HA/F200 configuration with only one RA4000. Because there - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 64
RA4000 A S storage hub storage hub A Server S A S Server Figure 2-18. Compaq ProLiant Cluster HA/F200 with dual RA4000s Networking Capacity attach to the cluster. If Node1 encounters a failure and its applications and services fail over to Node2, then Node2 needs to handle access from its own - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 65
2-34 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Network Considerations This section addresses clustering items that affect the corporate LAN. MSCS has specific requirements regarding which protocol can be used and how IP address and network name resolution occurs. Additionally, - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 66
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-35 DHCP Only use DHCP for the clients; it should not be used for the cluster node IP addresses or cluster resource IP addresses. DHCP cannot be used to assign IP addresses for virtual servers. When configuring DHCP, exclude enough static - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 67
2-36 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Connecting to Shared Resources In the traditional, command-driven connection to a shared resource, the user needs to know the server name and the share name. In a clustered environment, the command is changed to reflect the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 68
the Compaq ProLiant Clusters HA/F100 and HA/F200 2-37 In a clustered environment, IP addresses for the database are configured to fail over with the database application, making a backup IP address on the client unnecessary. When the database resources have failed over to the other server, the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 69
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide you have a disk resource (Disk1) that is part of a cluster group (Group1). You set the problem, set the restart threshold to 0 (zero). If the group will experience severe performance limitations if failed over to a secondary server - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 70
Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-39 Failover Threshold and Failover Period as bank card readers. For example, if a server is providing print services to users, and the printer is directly connected to the parallel port of the server, there is no way to switch the physical - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 71
2-40 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Another example of a direct-connect device is a directly connected mainframe interface. If the first server is directly connected to the mainframe, as through an SDLC (Synchronous Data Link Control) card in the server, there is no - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 72
the Compaq ProLiant Clusters failback. This setting allows the administrator to fail back a group manually. Allow automatic failback. This setting allows MSCS to fail back specific hours of the day during which automatic failback can occur. Refer to the Microsoft Cluster Server Administrator's Guide - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 73
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Group Failover/Failback Policy Use the Group Failover/Failback Policy worksheet to define the failover and failback policies for each cluster group. Figure 2-19 illustrates the failover/failback parameters for the Web Server Service - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 74
for the clustered Compaq ProLiant Servers s Compaq StorageWorks RAID Array 4000 User Guide s Compaq StorageWorks Fibre Channel Host Adapter Installation Guide s Installation guide for the interconnect card of your choice s Compaq SmartStart for Servers Setup Poster s Compaq Insight Manager - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 75
Administrator's Guide s Microsoft Cluster Server Administrator's Guide The installation and setup of your ProLiant Cluster can be described in the following phases: s Preinstallation guidelines s Installing the hardware, including: q Cluster nodes q Compaq StorageWorks RAID Array 4000 storage - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 76
Compaq ProLiant Clusters HA/F100 and HA/F200 3-3 Preinstallation Guidelines When setting up the cluster, you will need to answer each of the following questions. Using the Preinstallation Worksheet in Appendix A, write down the answers to these questions before installing Microsoft Cluster Server - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 77
s One of the utilities the SmartStart CD runs is the Compaq Array Configuration Utility, which configures the drives in the RA4000. The Array Configuration Utility stores the drive configuration information on the drives themselves. After you have configured the shared drives from one of the cluster - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 78
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-5 s MSCS requires drive letters to remain constant throughout the life of the cluster; therefore, you must assign permanent drive letters to your shared drives. If you are performing manual software installation, use Windows NT Disk - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 79
node until all the hardware has been installed in both cluster nodes. NOTE: Compaq recommends that Automatic Server Recovery (ASR) be left at the default values for clustered servers. Follow the installation instructions in your Compaq ProLiant Server documentation to set up the hardware. To install - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 80
the Logical-Physical Slot Numbering Problem," available from the Compaq website (http://www.compaq.com). For specific instructions on how to install an adapter card, refer to the documentation for the interconnect card you are installing or the Compaq ProLiant Server you are using. The cabling - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 81
3-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Setting Up the Compaq StorageWorks Raid Array 4000 Storage System Follow the instructions in the Compaq StorageWorks RAID Array 4000 User Guide to set up the RA4000s, the Compaq StorageWorks Fibre Channel Storage Hub 7 or 12, the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 82
configure the drives from the other cluster node. For detailed information about configuring the drives, refer to "Running the Array Configuration Utility" in the Compaq StorageWorks RAID Array 4000 User Guide. The Array Configuration Utility runs automatically during an Automated SmartStart cluster - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 83
3-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Setting Up a Private Interconnect There are four ways to set switch Ethernet Direct Connect An Ethernet crossover cable is included with your Compaq ProLiant Cluster. This cable directly connects two Ethernet cards. Place one end - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 84
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-11 Ethernet Direct Connect Using a Private Hub An Ethernet hub requires standard Ethernet cables; Ethernet crossover cables will not work with a hub. Follow these steps to cable the server interconnect using an Ethernet hub: 1. Connect - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 85
3-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Setting Up a Public Interconnect It is possible-but not Recommended Cluster Communication Strategy" section in Chapter 2 of this guide for more information about setting up redundancy for intracluster and cluster-to-LAN - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 86
system, and one server powered up. You need the following during installation: IMPORTANT: Refer to Appendix C for the software and firmware version levels your cluster requires. s Compaq SmartStart for the Servers CD s Compaq SmartStart for Servers Setup Poster s Server Profile Diskette (included - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 87
Clusters HA/F100 and HA/F200 Administrator Guide Cluster-Specific SmartStart Installation The SmartStart setup poster describes the general flow of configuring and installing software on a single server. The installation for a Compaq ProLiant Cluster HA/F100 and HA/F200 will be very similar. The - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 88
system partition. 7. Next, you will be guided through steps to install addition Compaq software and utilities including choosing the NT boot partition size and installing the Compaq Support Software Disk (SSD). Follow the instructions in the SmartStart setup poster. IMPORTANT: Node2 Exception: When - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 89
3-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 16. Power on storage and wait for drives to spin, then power on the server. 17. If setting up an HA/F200, install Redundancy Manager. To automatically install Redundancy Manager Redundancy Manager: a. Place the Compaq Redundancy - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 90
Setting Up the Compaq ProLiant Clusters HA/F100 and HA/F200 3-17 IMPORTANT: Node2 Exception: Repeat SmartStart Assisted Integration steps 1-11 for Node2. Then proceed to step 17. IMPORTANT: Node1 Exception: Execute step 18 only after Node2 is set up. 18. Run the Compaq Cluster Verification Utility ( - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 91
completes, install Windows NT Service Pack 5. For the latest information on Service Packs, please refer to the release notes. 22. Run the Compaq Support Software Disk (SSD) through the diskettes you created or from the SmartStart CD and verify that all installed drivers are current. 23. Install - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 92
Node1 setup. Accept these changes for Node2 by exiting. NOTE: Create a logical drive with 100MB of space to be used as the quorum disk. 6. Next, SmartStart will automatically run the Array Configuration Utility. Refer to the Compaq StorageWorks RAID Array 4000 User Guide for instructions about - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 93
3-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 9. Power down the server, insert Options ROMPaq diskette in Node1, and restart the system. IMPORTANT: When updating the firmware on the array controllers, make sure that one server is powered off. IMPORTANT: Node2 Exception: Do - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 94
Pack 5. See the latest release notes for the latest service pack information. 22. Run Compaq Support Software Diskette (SSD) through the diskettes you created or from the SmartStart CD and verify that all installed drivers are current. 23. Install your applications and managing and monitoring - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 95
service to the Windows NT domain administrator user account. 8. Repeat these steps to install the software on the other cluster node. For more specific instructions about using Compaq Intelligent Cluster Administrator, refer to the Compaq Intelligent Cluster Administrator Quick Setup Guide - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 96
in a Compaq ProLiant Cluster environment. Visit to the Compaq High Availability website (http://www.compaq.com/ servers in a fresh state, verify creation of the cluster using the following steps. 1. Shut down and power off both servers. 2. Power off and then power on the RA4000. 3. Power both servers - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 97
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide When Windows NTS/E finishes booting up on both servers, follow these steps to use Microsoft Cluster Administrator to verify creation of the cluster: 1. From the Windows NTS/E desktop on either cluster server troubleshooting - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 98
the Compaq ProLiant Clusters HA/F100 and HA/F200 3-25 3. Make sure all predefined resources and groups are online. Verify that some of the resources and groups are owned by the server you will be powering off, so that a failure event will result in failover of resources and/or groups. 4. Power off - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 99
3-26 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 5. Use Microsoft Cluster Administrator to perform a manual failover of the cluster group that contains the IP address. 6. After the manual failover completes, execute the ping command again. 7. As soon as the other node brings the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 100
StorageWorks Fibre Channel Storage Hub s One additional Compaq StorageWorks Fibre Channel Host Adapter (host bus adapter) per server s Compaq Redundancy Manager (Fibre Channel) software s Appropriate firmware and drivers If you already have a Compaq ProLiant Cluster HA/F100 up and running, you do - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 101
(array controller) per storage subsystem One additional Compaq StorageWorks Fibre Channel Storage Hub One additional Compaq StorageWorks Fibre Channel Host Adapter (host bus adapter) per server s Compaq SmartStart for Servers CD s Compaq SmartStart for Servers Setup Poster s Server Profile - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 102
website (http://www.compaq.com/highavailability). 3. Create Support Software Disk (SSD) from the Diskette Builder utility. Run the Support Software Disk (SSD) either from the diskettes or directly from the SmartStart CD. Install the latest Fibre Channel drivers and other server components. Do not - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 103
4-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide 9. Insert the first Options ROMPaq diskette that you created in Diskette Builder. Run Options ROMPaq and choose to update the firmware on the array controllers. 10. Power down the storage and Node1 after the firmware update - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 104
details the utilities and programs used in the ongoing management of Compaq ProLiant Clusters HA/F100 and HA/F200. The topics addressed in this chapter include: s Managing a Cluster Without Interrupting Cluster Services s Managing a Cluster in a Degraded Condition s Managing Hardware Components of - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 105
5-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide The chapter also details the utilities and programs used in the ongoing management of Compaq ProLiant Clusters HA/F100 and HA/F200. The tools addressed in this chapter include: s Compaq Redundancy Manager (Fibre Channel) s Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 106
Compaq Insight Manager has been enhanced to operate with the Compaq ProLiant Clusters HA/F100 and HA/F200. Compaq Insight Manager XE allows you to view and manage servers a cluster. Since users will experience some disruption of service and, possibly, a performance degradation during failover, they - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 107
5-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Managing a Cluster's Shared Storage Compaq Insight Manager and Compaq Insight Manager XE monitors the RAID Array 4000 storage system from both a physical and a logical perspective: s The physical drives and Fibre Channel hardware - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 108
: s Service Pack 5 for Windows NT 4.0 is applied. s Compaq SmartStart CD 4.10 or later is used to apply system drivers. 1. Power down one of the cluster servers (Node2). 2. Insert the Compaq SmartStart and Support Software CD into the CD-ROM drive of the other cluster server (Node1). Power down - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 109
using a Compaq ProLiant Cluster HA/F200 with redundant paths, be sure to attach both array controllers, one to each hub. 4. Add the additional RA4000 to the storage subsystem. Follow the hardware installation steps detailed in Chapter 3 of this guide. 5. Power on the newly added RA4000. 6. Power on - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 110
some or all of the data on the failed drive. Refer to the Compaq StorageWorks RAID Array 4000 User Guide for instructions on replacing a failed drive. Adding Drives to Increase Storage Capacity The following steps describe how to add a drive to the Compaq RA4000 storage system and to allocate it to - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 111
and HA/F200 Administrator Guide 1. Power down one of the cluster servers (Node2). 2. Insert the Compaq SmartStart and Support Software CD the other cluster server (Node1). Power down Node1. 3. Insert new drives in the RA4000 storage array. IMPORTANT: If using a Compaq ProLiant Cluster HA/F200 with - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 112
into the new node. 6. If the new node is part of a rack system, place the server in the rack. Attach the interconnect, LAN, Fibre Channel cables, and power cables. If you are using the Windows NT boot drives from the replaced node in the new node, power on the new node and follow the steps described - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 113
5-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Installing a New Windows NT Boot Drive New Windows NT boot drives require installation of Windows NT, configuration of the networking components of the new node, and installation of MSCS. Follow the SmartStart Assisted Path - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 114
in a Compaq ProLiant Cluster HA/F200 configuration. This functionality is only available when accessing two separate RAID arrays. The timing of manual load should not be moved from one server to another during peak processing periods. To move a database from one server to another, the database must - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 115
5-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq Redundancy Manager Compaq Redundancy Manager (Fibre Channel) increases the availability of single-server or clustered systems using Compaq StorageWorks RAID Array 4000. Redundancy Manager can detect failures of the Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 116
Managing the Compaq ProLiant Clusters HA/F100 and HA/F200 5-13 Changing Paths Redundancy Manager allows you to change the active and standby paths for your cluster. The following provide instructions for changing paths. NOTE: Redundancy Manager will not change the configuration until you close the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 117
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Rescan. Refresh Refresh (F5) updates information on the GUI screen array controllers and after adding or removing physical drives. NOTE: For every hot replace, a rescan should be run on each machine in a cluster. NOTE: Reboot each server - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 118
monitor the ProLiant Cluster and perform cluster administration functions such as starting and stopping an MSCS service, starting or stopping an MSCS node, and starting or stopping the MSCS cluster. Compaq Insight Manager consists of two components: a Windows-based console application and server- or - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 119
5-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster-Specific Features of Compaq Insight Manager The following is an overview of the cluster-specific features found in Compaq Insight Manager. NOTE: The term cluster group used in this section refers to Compaq Insight Manager, - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 120
agents provide health status to either Compaq Insight Manager or Compaq Insight Manager XE. The agents translate data supplied by the device drivers into useful information that assists the user in correctly diagnosing the problem. Compaq Insight Manager or Compaq Insight Manager XE then provide - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 121
5-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Compaq Insight Manager XE offers a simple, industry-standard approach to Compaq Insight Manager XE. With Cluster Monitor, you can view all clusters from a single browser and configure monitor points and specific operational - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 122
Managing the Compaq ProLiant Clusters HA/F100 and HA/F200 5-19 Cluster Monitor has three distinct informational areas to meet individual operational needs: s A problem window with a prioritized cluster event list sorted by severity for the clusters that are under the administrator's control s A - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 123
5-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Monitor supports these attributes: s Disk space s CPU utilization s MSCS cluster status s Node Environment (Compaq Management Agent) status. Cluster Monitor uses pop-up notifications, alerts in the alert list, colored - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 124
Managing the Compaq ProLiant Clusters HA/F100 and HA/F200 5-21 s Assign resources to groups and nodes s Establish resource dependencies s Assign failover policies for cluster resources s Fail over resources and nodes s Stop and start cluster services Managing Cluster History Using the Cluster - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 125
5-22 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Microsoft Cluster Administrator Microsoft Cluster Administrator manages groups, resources, and the operating state of the cluster. Cluster Administrator gives you the ability to: s View the - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 126
HA/F100 and HA/F200 This chapter addresses problems encountered while installing, configuring, testing, and operating the Compaq ProLiant Clusters HA/F100 and HA/F200. These problems are described in the following troubleshooting categories: s Installation s Node-to-Node s Shared Storage s Client - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 127
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Installation This section addresses problems encountered during installation. Table 6-1 Solving Installation Problems Problem The error message "RPC Server server is operational and that the Cluster Service and the RPC services are - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 128
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-3 Table 6-1 Solving Installation Problems continued Problem Possible Cause MSCS installation will not complete on the first node. Insufficient space on nonshared drives for MSCS. Operating system is incorrect or deficient. - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 129
6-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-1 Solving Installation Problems continued Problem Possible Cause Clients do not see the cluster. Clients can only view the virtual servers. Added logical drives are not recognized Windows NT and Redundancy Manager do - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 130
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-5 Troubleshooting Node-to-Node Problems This section describes problems that may be encountered during server-to-server communication. Table 6-2 Solving Node-to-Node Problems Problem The resources failed over but the nodes do not - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 131
6-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-2 Solving Node-to-Node Problems continued Problem Possible Cause The second node cannot join the cluster. Improper name resolution. Cluster Service is not running. No network connectivity exists. TCP/IP is not - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 132
problems. Compaq ProLiant Clusters do not support the physical SCSI disks. Action Wait a minute, then click Refresh (F5). Reboot cluster nodes after installing MSCS. Ensure the drives are recognized. Ensure the host bus adapter driver for Windows NT is installed and running on both servers - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 133
6-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-3 Solving Shared Storage Problems continued Problem Possible Cause Drive(s) in the Compaq StorageWorks RAID Array 4000 are not recognized. Possible drive configuration problems. Action 1. Run the Compaq Array - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 134
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-9 Table 6-3 Solving Shared Storage Problems continued Problem Possible Cause Data on shared storage appears to be overwritten. MSCS may not be loaded and therefore cannot manage access to drive volumes in the shared storage. - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 135
6-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-3 Solving Shared Storage Problems continued Problem Possible Cause Compaq Redundancy Manager shows cluster in nonredundant mode. You are using ACU to expand capacity. ICL has failed. Mismatched firmware on the array - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 136
resources. 2. Refer to the Compaq StorageWorks RAID Array 4000 User Guide's "Replacing GBICs" chapter for instructions on replacing a GBIC-SW. 3. Manually fail back resources. 1. Manually fail over resources. 2. Refer to the Compaq StorageWorks RAID Array 4000 User Guide's "Replacing GBICs" chapter - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 137
6-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Client-to-Cluster Connectivity This section addresses problems that may be encountered in cluster-to-LAN communication. NOTE: The cluster is assigned one or more Net BIOS names associated with an IP address. Network clients - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 138
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-13 Table 6-4 Solving Client-to-Cluster Connectivity Problems continued Problem Possible Cause Clients do not see virtual servers. Virtual servers may not have their own IP addresses or Network Name resources. Client protocol - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 139
14 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-4 Solving Client-to-Cluster Connectivity Problems continued Problem related. resources. Action 1. Manually fail over each of the applications from the primary server to the secondary server. Make sure automatic failback - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 140
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-15 Table 6-4 Solving Client-to-Cluster Connectivity Problems continued Problem Possible Cause Clients cannot access a group that has failed over. Networking problem over group is a virtual server (that is, the group contains - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 141
6-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Groups and Cluster Resource Microsoft Cluster Administrator solves many group and cluster resource problems. For troubleshooting tips on this topic, refer to the Microsoft Cluster Server Administrator's Guide and Cluster - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 142
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-17 Troubleshooting Compaq Event Viewer for Microsoft Windows NT Server displays additional information. For more information on to an array controller. The lock management command only allows viewing of the data. The array controller - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 143
6-18 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table 6-6 Compaq Redundancy Manager Informational Messages continued Message Description Action The loop has been locked by another application. Another application has issued a lock management command to an array controller - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 144
Troubleshooting the Compaq ProLiant Clusters HA/F100 and HA/F200 6-19 Table 6-6 Compaq Redundancy Manager Informational Messages continued Message Description Action You have not selected all the Paths to the following logical disk(s): The logical disk or drives shown are claimed but not all - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 145
6-20 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Warning Message This section provides a warning message Error Messages Description Another application has issued a lock management command to an array controller. The lock management command only allows viewing of the data. The - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 146
of the Compaq Fibre Channel Host Adapter SCSI Miniport Driver (cpqfcalm.sys) is being used that does not support redundancy. The minimum version for redundancy support is VX.X. The current version is VX.Z. Redundancy has been disabled. Description Redundancy Manager found an array with no drives in - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 147
6-22 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Other Potential Problems Redundancy Troubleshooting Redundancy Manager Problems Message Could not find the resource DLL file. Intercontroller Link Failure. Illegal Drives. Array controller firmware versions don't match. Array - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 148
Overview This appendix contains blank worksheets you can use to design, configure, and install your Compaq ProLiant Cluster HA/F100 or HA/F200. Completed worksheets are illustrated in chapters 2 and 3 of this guide. Copy these worksheets and use as many as necessary to assist you in planning and - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 149
A-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster Group Definition Worksheet Complete the Cluster Group Definition worksheet for each business function requiring clustering. Cluster Function Group #1 Group #2 Resource #1 Cluster Group Definition Worksheet - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 150
Cluster Configuration Worksheets A-3 Shared Storage Capacity Worksheet Use the Shared Storage Capacity worksheet to outline your shared storage capacity requirements. Description Shared Storage Capacity Worksheet Disk Resource 1 Disk Resource 2 Required Capacity without RAID Level of - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 151
A-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Group Failover/Failback Policy Worksheet Use the Group Failover/Failback Policy worksheet to define failover and failback settings for each cluster group. Group Failover/Failback - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 152
to gather information necessary for the installation of Compaq ProLiant Clusters HA/F100 or HA/F200. Are you: Cluster Name: Preinstallation Worksheet r Forming a cluster or r Joining a cluster Domain account Microsoft Cluster Server will run under: User Name Password Domain Network - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 153
, and it may be connected to dual RA4000s containing redundant storage array controllers and two Compaq StorageWorks Fibre Channel Storage Hubs. IMPORTANT: Cable your Compaq ProLiant single-server system according to Compaq-recommended guidelines. Redundancy Manager may appear to work if the system - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 154
B-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Figure B-1 shows a single-server setup with an RA4000. This setup provides redundant paths to the RA4000. RA4000 storage hub storage hub server L AN Figure B-1. Single-server setup with a single RA4000 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 155
system configuration and control of each defined path. Redundancy Manager is supported on all Compaq ProLiant servers in single-server configurations. The following sections provide information about: s Installing Redundancy Manager s Managing Redundancy Manager s Troubleshooting Redundancy Manager - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 156
B-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Installing Redundancy Manager The following requirements must be met to install Redundancy Manager on a server running Microsoft Windows NT Server 4.0/Enterprise Edition or Microsoft Windows NT Server 4.0: s 32 MB of RAM required, - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 157
from the Add/Remove Programs page. The setup program begins. 10. Follow the instructions displayed on the Redundancy Manager installation screens. 11. Close the Control Panel. 12. Remove the Redundancy Manager CD from the CD-ROM drive. 13. Reboot the server. Redundancy Manager is now installed on - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 158
B-6 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Managing Redundancy Manager Redundancy Manager increases the availability of single-server or clustered systems using the RA4000 storage system. Redundancy Manager can detect failures of the host bus adapters, array controllers, - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 159
Using Compaq Redundancy Manager in a Single-Server Environment B-7 Changing Paths The following information describes how to change paths using Redundancy Manager. NOTE: Redundancy Manager will not change the configuration until you close - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 160
B-8 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Shortcut method support the hot-add of logical drives. To add drives, you must: 1. Physically add the drives to the RA4000. 2. Reboot Windows NT on the server to see the new drives. Shut down and reboot the server. 3. Run the Array - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 161
drives. Follow these steps to run Rescan: NOTE: For every hot replace, a rescan should be run on each machine. 1. Select Features from the Main screen. 2. Select Rescan from the Features menu. NOTE: Reboot each server to clear the SCSI port after seven hot replaces. Troubleshooting Redundancy - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 162
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide . The Event Viewer for Microsoft Windows NT Server displays additional information. For further information on you that the array controller board is in an unknown state that caused a failed connection to the array controller. Action No - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 163
Compaq Redundancy Manager in a Single-Server Environment B-11 Table B-1 Informational Messages continued Message Description The loop has been locked by another application. This message indicates that another application has issued a lock management command to an array disk or drives shown are - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 164
B-12 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Warning Message This section provides a list of the warning Description This message indicates that the previous lock on an array controller has expired. Action No action needed to view the data. Override to take control of - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 165
Using Compaq Redundancy Manager in a Single-Server Environment B-13 Error Messages This section provides a list or has exited improperly. It is recommended that Compaq Redundancy Manager not be run while another program has a lock on the Array Controller(s). To stop this instance from starting, - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 166
drives in it. A damaged RA4000 or a bad connection could cause this. This message informs you that the array array controller. Close all applications and shut down your computer immediately. Refer to the Compaq StorageWorks RAID Array 4000 User Guide's "Replacing GBICs" chapter for instructions - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 167
cpqfcalm.sys doesn't support redundancy. GBIC laser has malfunctioned. GBIC laser has malfunctioned. Action You must make sure you have the correct version of cpqfcalm.sys. Refer to the Compaq StorageWorks RAID Array 4000 User Guide's "Replacing GBICs" chapter for instructions about how to replace - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 168
B-16 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Troubleshooting Redundancy Manager Troubleshooting Potential Problems This section provides help for troubleshooting potential problems with the Redundancy Manager. Table B-4 Troubleshooting Potential Problems Message Could - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 169
about software and firmware updates recommended or required for your Compaq ProLiant Cluster. Table C-1 Supported Software/Firmware Versions Software/Firmware Title Compaq SmartStart and Support Software CD Compaq Support Software Diskette for Windows NT (NT SSD) Array controller firmware Options - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 170
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Table C-1 Supported Software/Firmware Versions continued Software/Firmware Title Compaq Redundancy Manager (Fibre Channel) Microsoft Windows NT Server 4.0 Service Pack Compaq Cluster Verification Utility Compaq Insight Manager Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 171
controller. A measure of how well a computer system can continuously deliver services to its clients. Availability is typically expressed as a percentage, with 100 percent being the best possible rating. The ability to light the drive tray LEDs on a particular RA4000. Applications that are key to - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 172
Glossary-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Cluster group Compaq StorageWorks RA4000 Controller Compaq StorageWorks Fibre Channel Host Bus Adapter Compaq StorageWorks RAID Array 4000 Conflict Dynamic IP address assignment Ethernet Failback Failover Fault tolerance A - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 173
Hot pluggable Hot spare Interconnect See RA4000 (Compaq StorageWorks RAID Array 4000) An IEEE standard for providing server. Also called host adapter. The process of moving the operation of all I/O from one host bus adapter to another host bus adapter. This can be accomplished manually using Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 174
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide IP address Load balancing Logical disks Mission-critical Network interface controller NIC Node NTFS Paging file POST Power Controller An individual server in a Windows NT paging file for virtual memory, called PAGEFILE.SYS. The paging - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 175
resources. See Compaq StorageWorks RAID Array 4000 See Redundant Array of Inexpensive Disks A method of using hard disk drives in an array to provide data information. The continuous integrity of a system (server, storage, network, or cluster). The ability to check for new or lost logical disks - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 176
Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide Resource Scalability SCSI ServerNet Service Shared resource Static IP address assignment System UPS Virtual server A software or hardware entity upon which a client/server application or service a server. Uninterruptible Power Supply. - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 177
drives 5-7 shared storage to existing cluster 5-5 application software cluster-aware 1-19 Compaq integration technotes 1-19 array creating 2-30 maximum volumes 2-30 optimizing performance 2-30 volume 2-30 Array Configuration Utility See Compaq Array Configuration Utility Automatic Server Recovery - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 178
-2 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide client-to-cluster connectivity troubleshooting 6-12 cluster address 5-17 administrator 1-18, 5-2 availability 1-1 backup 5-10. See also backup backup solutions 5-10 limitations 5-10 cables 1-2 communication strategy 2-16 Compaq - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 179
5-22 screen example 5-12, B-6 troubleshooting 6-17, 6-22, B-9 using and configuring 5-12, B-6 Redundant NIC Utility 2-14, 2-15 SmartStart assisted integration 3-14 description 1-15 manual configuration 3-19 recommended installation 3-13 SmartStart and Support Software CD 1-15 software tools features - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 180
Index-4 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide website xiii www.compaq.com xii white papers 1-13, 2-16 D data backup 5-10 dedicated interconnect 1-12 DHCP 2-35 disk resource troubleshooting 6-3 DNS See Domain Name Service Domain Name Service 2-34 drive letters 3-5 drive - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 181
redundancy 2-18 file and print services connection considerations 2-35 filter Compaq ProLiant Cluster HA/F100 2-26 Compaq ProLiant Cluster HA/F100 hardware components 1-4 Compaq ProLiant Cluster HA/F200 hardware components 1-6 Compaq ProLiant cluster HA/F200 with dual RA4000s 2-33 Compaq ProLiant - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 182
10 Microsoft Cluster Server 6-3 redundant interconnect 3-12 requirements Compaq Redundancy Manager B-4 ServerNet interconnect 3-11 servers 3-6 SmartStart 3-13, 4-2 troubleshooting 6-2 installing Compaq Redundancy Manager automatically 3-16, B-4 Compaq Redundancy Manager manually 3-16, B-5 Integrated - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 183
to active 5-13, B-7 managing 5-13, B-7 performance server 2-37 PING command 3-25 preinstallation worksheet 3-3, A-5 private interconnect 1-12 public interconnect 1-12 Q quorum disk 3-14, 0-5 drive 2-30 R RA4000 See Compaq, StorageWorks RAID Array 4000 RAID example configurations 2-30 shared storage - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 184
a failed drive 5-7 troubleshooting 6-7 single points of failure 1-13 cluster-to-LAN communication 2-15 Fibre Channel data paths 2-19 interconnect 2-13 reducing 2-13 redundancy 2-16 single-port NIC 2-17, 2-18 SmartStart See Compaq SmartStart software patches 1-19 SSD See Compaq Support Software - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 185
problems 6-7, 6-10 troubleshooting redundancy manager problems 6-22 warning messages 6-20 technical support xii TechNote Planning Considerations for Compaq ProLiant Clusters Using Microsoft Cluster Server 6-12 Compaq Redundancy Manager 6-17, 6-22, B-9 Compaq StorageWorks RAID Array 4000 6-7 - Compaq ProLiant 1600 | Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator - Page 186
Index-10 Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide group failover/failback policy A-4 preinstallation A-5 shared storage capacity A-3 www.compaq.com 1-7, 1-8, 1-13, 1-18, 1-19, 2-29, 2-30, 3-6, 3-7, 3-23, 4-3
Compaq Confidential – Need to Know Required
Writer: Linda Arnold
Project: Compaq ProLiant Clusters HA/F100 and HA/F200 Administrator Guide
Comments:
Part Number: 380362-002
File Name: a-frnt.doc
Last Saved On: 8/11/99 3:55 PM
Compaq ProLiant Clusters
HA/F100 and HA/F200
Administrator Guide
Second Edition (September 1999)
Part Number 380362-002
Compaq Computer Corporation