HP ProLiant 4500 Compaq ProLiant Cluster HA/F100 and HA/F200 Administrator Gui - Page 70

Failover of Directly Connected Devices, Failover Threshold and Failover Period

Page 70 highlights

Designing the Compaq ProLiant Clusters HA/F100 and HA/F200 2-39 Failover Threshold and Failover Period The failover threshold and failover period are similar to the restart values. The failover threshold defines the maximum number of times per failover period that MSCS attempts to fail over a cluster group. If the cluster group exceeds the failover threshold in the allotted failover period, the group is left on its current node, in its current state, whether that is online, offline, or partially online. The failover threshold and failover period prevents a cluster group from bouncing back and forth between servers. If a cluster group is so unstable that it cannot run properly on either cluster node, it will eventually be left in its current state on one of the nodes. The failover threshold and period determine the point at which the decision is made to leave the cluster group in its current state. The following example illustrates the relationship between the restart threshold and period and the failover threshold and period. Assume you have a cluster group (Group1) that is configured to have a preferred server (Server1). If Group1 encounters an event that forces it offline, MSCS attempts to restart the resource. If Group1 cannot be restarted within the limits of the restart threshold and period, MSCS attempts to fail over Group1 to Node2. If the failover threshold for Group1 is set to 10 and the failover period is set to 3 (hours), MSCS will fail over Group1 as many as 10 times in a 3-hour period. If a failure is still forcing Group1 offline after three hours, MSCS will no longer attempt to fail over the group. Failover of Directly Connected Devices Devices that are physically connected to a server cannot move to the other cluster node. Therefore, any applications or resources dependent on these devices may be unable to restart on the other cluster node. Examples of direct-connect devices include printers, mainframe interfaces, modems, fax interfaces, and customized input devices such as bank card readers. For example, if a server is providing print services to users, and the printer is directly connected to the parallel port of the server, there is no way to switch the physical connection to the other server, even though the print queue and spooler can be configured to fail over. The printer should be configured as a true network printer and connected to a hub that is accessible from either cluster node. In the event of a server failure, not only will the print queue and spooler fail over to the other server, but physical access to the printer will be maintained.

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186

Designing the Compaq ProLiant Clusters HA/F100 and HA/F200
2-39
Failover Threshold and Failover Period
The failover threshold and failover period are similar to the restart values. The
failover threshold defines the maximum number of times per failover period
that MSCS attempts to fail over a cluster group. If the cluster group exceeds
the failover threshold in the allotted failover period, the group is left on its
current node, in its current state, whether that is online, offline, or partially
online.
The failover threshold and failover period prevents a cluster group from
bouncing back and forth between servers. If a cluster group is so unstable that
it cannot run properly on either cluster node, it will eventually be left in its
current state on one of the nodes. The failover threshold and period determine
the point at which the decision is made to leave the cluster group in its current
state.
The following example illustrates the relationship between the restart
threshold and period and the failover threshold and period.
Assume you have a cluster group (Group1) that is configured to have a
preferred server (Server1). If Group1 encounters an event that forces it offline,
MSCS attempts to restart the resource. If Group1 cannot be restarted within
the limits of the restart threshold and period, MSCS attempts to fail over
Group1 to Node2. If the failover threshold for Group1 is set to 10 and the
failover period is set to 3 (hours), MSCS will fail over Group1 as many as
10 times in a 3-hour period. If a failure is still forcing Group1 offline after
three hours, MSCS will no longer attempt to fail over the group.
Failover of Directly Connected Devices
Devices that are physically connected to a server cannot move to the other
cluster node. Therefore, any applications or resources dependent on these
devices may be unable to restart on the other cluster node. Examples of
direct-connect devices include printers, mainframe interfaces, modems, fax
interfaces, and customized input devices such as bank card readers.
For example, if a server is providing print services to users, and the printer is
directly connected to the parallel port of the server, there is no way to switch
the physical connection to the other server, even though the print queue and
spooler can be configured to fail over. The printer should be configured as a
true network printer and connected to a hub that is accessible from either
cluster node. In the event of a server failure, not only will the print queue and
spooler fail over to the other server, but physical access to the printer will be
maintained.