HP Server rp7420 HP 9000 rp7420 Server - User Service Guide, Fifth Edition - Page 80

Server Management Behavior, Thermal Monitoring, Fan Control

Page 80 highlights

Server Management Behavior This section describes how the system responds to over-temperature situations, how the firmware controls and monitors fans, and how it controls power to the server. Thermal Monitoring The manageability firmware is responsible for monitoring the ambient temperature in the server and taking appropriate action if this temperature becomes too high. To this end, the ambient temperature of the server is broken into four ranges: normal, overtemp low (OTL), overtemp medium (OTM), and overtemp high (OTH). Figure 5-11 shows the actions taken at each range transition. Actions for increasing temperatures are shown on the left; actions for decreasing temps are shown on the right. Figure 5-11 Temperature States On large temperature swings, the server transitions through all states in order. It might go to the following state immediately, but each of the preceding actions occur. If the temperature reaches the highest range, the server be shuts down immediately by the manageability firmware. Fan Control There are three sets of fans in the system: those on the I/O bay, the front and rear fans that are connected to the main backplane, and those on the cell boards. The front fans run on standby power, and run any time AC input power is supplied to the server. All of the fans turn on when 48 V power is supplied to the system. As shown in Figure 5-11, the fan behavior is related to the temperature state. The fans go to high speed when the ambient temperature is anywhere above the normal operating range. The front and rear fans go to high speed any time a chassis intrusion switch is triggered when removing a side cover. Altimeter Circuit The PCI-X backplane contains an altimeter circuit. This circuit is used to adjust the chassis fan speeds for the operating altitude at power on and during MP initialization. The chassis fans consist of the nine front fans, twelve rear fans, and the six PCI-X I/O assembly fans. If an altimeter failure is detected, the information is logged as an Event ID then propagated to the OS level to be picked up by monitoring diagnostics. The altimeter circuit is checked at power on by the MP. If an expected value is returned from the altimeter circuit, the altimeter is determined good. The altimeter reading is then set in non-volatile random access memory (NVRAM) on board the core I/O card. If the value is ever lost like for a core I/O replacement, the NVRAM will be updated at next boot provided the altimeter is functioning normally. If the altimeter has failed, and the stable storage value has been lost because of a core I/O failure or replacement, the MP will adjust the fan speeds for sea-level operation. 80 Troubleshooting

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127

Server Management Behavior
This section describes how the system responds to over-temperature situations, how the firmware
controls and monitors fans, and how it controls power to the server.
Thermal Monitoring
The manageability firmware is responsible for monitoring the ambient temperature in the server
and taking appropriate action if this temperature becomes too high. To this end, the ambient
temperature of the server is broken into four ranges: normal, overtemp low (OTL), overtemp
medium (OTM), and overtemp high (OTH).
Figure 5-11
shows the actions taken at each range
transition. Actions for increasing temperatures are shown on the left; actions for decreasing temps
are shown on the right.
Figure 5-11 Temperature States
On large temperature swings, the server transitions through all states in order. It might go to the
following state immediately, but each of the preceding actions occur. If the temperature reaches
the highest range, the server be shuts down immediately by the manageability firmware.
Fan Control
There are three sets of fans in the system: those on the I/O bay, the front and rear fans that are
connected to the main backplane, and those on the cell boards. The front fans run on standby
power, and run any time AC input power is supplied to the server. All of the fans turn on when
48 V power is supplied to the system.
As shown in
Figure 5-11
, the fan behavior is related to the temperature state. The fans go to high
speed when the ambient temperature is anywhere above the normal operating range. The front
and rear fans go to high speed any time a chassis intrusion switch is triggered when removing
a side cover.
Altimeter Circuit
The PCI-X backplane contains an altimeter circuit. This circuit is used to adjust the chassis fan
speeds for the operating altitude at power on and during MP initialization. The chassis fans
consist of the nine front fans, twelve rear fans, and the six PCI-X I/O assembly fans. If an altimeter
failure is detected, the information is logged as an Event ID then propagated to the OS level to
be picked up by monitoring diagnostics.
The altimeter circuit is checked at power on by the MP. If an expected value is returned from
the altimeter circuit, the altimeter is determined good. The altimeter reading is then set in
non-volatile random access memory (NVRAM) on board the core I/O card. If the value is ever
lost like for a core I/O replacement, the NVRAM will be updated at next boot provided the
altimeter is functioning normally. If the altimeter has failed, and the stable storage value has
been lost because of a core I/O failure or replacement, the MP will adjust the fan speeds for
sea-level operation.
80
Troubleshooting