IBM 86655RY Hardware Maintenance Manual - Page 40

Table 1. Light path diagnostics., Cause, Action

Page 40 highlights

Table 1. Light path diagnostics. None LED SMI NMI Cause Action The system error log is 75% or more full; a PFA alert was logged; or a failure occurred on the I2C bus. Check the system error log and correct any problems. See "Choices available from the Configuration/Setup main menu" on page 44 for information about clearing the error log. Disconnecting the server from all power sources for at least 20 seconds will turn off the system error LED. A systems management event occurred. Restart the server. A nonmaskable interrupt occurred. The PCIA, PCIB, or PCIC LED will probably also 1. be on. If the PCIA, PCIB, or PCIC LED is not on, restart the server. SP PCIA PCIB PCIC DASD MEM CPU VRM FAN TEMP The service processor has failed. If the problem persists, try to determine the failing adapter by removing one adapter at a time and restarting the server after each adapter is removed. 1. Run service processor diagnostics. 2. Replace Legacy I/O board. An error occurred on PCI bus A. An adapter in PCI slot 1 or 2, or the processor board 1. caused the error. 2. Check the error log for additional information. If you cannot correct the problem from the information in the error log, try to determine the failing adapter by removing one adapter at a time from PCI bus A (PCI slots 1-2) and restarting the server after each adapter is removed. An error occurred on PCI bus B. An adapter in PCI slot 3, 4, 5, or 6 or the processor 1. board caused the error. 2. Check the error log for additional information. If you cannot correct the problem from the information in the error log, try to determine the failing adapter by removing one adapter at a time from PCI bus B (PCI slots 3-6) and restarting the server after each adapter is removed. An error occurred on PCI bus C. An error on the processor or I/O board caused the problem. Check the error log for additional information. If the error log indicates a problem with the integrated SCSI controller, the Ethernet controller or video controller, see "Starting the diagnostic programs" on page 25. A hot-swap hard disk drive has failed on SCSI channel B. 1. If the TEMP LED is also on, take the actions listed for that LED. A memory error occurred. 2. If the amber status LED on one of the hot-swap hard disk drives is on, replace the drive. 1. Check the DIMM error LEDs on the memory board. 2. Replace the DIMM indicated by the lit DIMM error LEDs. One of the microprocessors has failed or a microprocessor is installed in the wrong 1. Check the microprocessor error LEDs on the memory board. If a microprocessor error LED is on for a connector. microprocessor connector that has a terminator card installed instead of a microprocessor, the microprocessors are not installed in the correct order. See "Installing a microprocessor kit" on page 73 for information about the correct order for installing microprocessors and VRMs. Otherwise, continue with the next step. One of the voltage regulator modules on the processor board has failed. 2. Turn off the server, reseat the microprocessor indicated by the lit microprocessor error LED, and restart the server. 3. If the problem persists, replace the microprocessor. 1. Check the VRM error LEDs on the processor board. 2. Turn off the server, reseat the VRM indicated by the lit VRM error LED, and restart the server. One of the fan assemblies has failed or is operating too slowly. 3. If the problem persists, replace the VRM. The LED on the failing fan assembly will be lit. Replace the fan assembly. Note: A failing fan can also cause the TEMP and DASD LEDs to be on. The system temperature has exceeded the maximum rating. 1. Check to see if a fan has failed. If it has, replace the fan. 2. Make sure the room temperature is not too high. (See "Features and specifications" on page 3.) NON RED OVER SPEC PS1 PS2 PS3 Server drawing too much power to operate in a redundant power mode. The server is drawing more power than the power supplies are rated for. The first power supply has failed. The second power supply has failed. The third power supply has failed. If the problem persists, see "Temperature checkout" on page 31. System can continue to operate in a nonredundant power mode. To operate in a redundant mode, add a power supply or remove most recently installed options. Either add a power supply or remove a device from the server. Replace the first power supply. Replace the second power supply. Replace the third power supply. 30 Hardware Maintenance Manual: Netfinity 7600 - Type 8665 Models 1RY, 2RY

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270
  • 271
  • 272
  • 273
  • 274
  • 275
  • 276
  • 277
  • 278
  • 279
  • 280
  • 281
  • 282
  • 283
  • 284
  • 285
  • 286
  • 287
  • 288
  • 289
  • 290
  • 291
  • 292
  • 293
  • 294

30
Hardware Maintenance Manual: Netfinity 7600
Type 8665 Models 1RY, 2RY
Table 1. Light path diagnostics.
LED
Cause
Action
None
The system error log is 75% or more full; a PFA alert was logged; or a failure occurred
on the I2C bus.
Check the system error log and correct any problems.
See
Choices available from the Configuration/Setup
main menu
on page 44 for information about clearing the error log.
Disconnecting the server from all power
sources for at least 20 seconds will turn off the system error LED.
SMI
A systems management event occurred.
Restart the server.
NMI
A nonmaskable interrupt occurred.
The PCIA, PCIB, or PCIC LED will probably also
be on.
1.
If the PCIA, PCIB, or PCIC LED is not on, restart the server.
If the problem persists, try to determine the failing adapter by removing one adapter at a time and restarting the
server after each adapter is removed.
SP
The service processor has failed.
1.
Run service processor diagnostics.
2.
Replace Legacy I/O board.
PCIA
An error occurred on PCI bus A.
An adapter in PCI slot 1 or 2, or the processor board
caused the error.
1.
Check the error log for additional information.
2.
If you cannot correct the problem from the information in the error log, try to determine the failing
adapter by removing one adapter at a time from PCI bus A (PCI slots 1
2) and restarting the server after
each adapter is removed.
PCIB
An error occurred on PCI bus B.
An adapter in PCI slot 3, 4, 5, or 6 or the processor
board caused the error.
1.
Check the error log for additional information.
2.
If you cannot correct the problem from the information in the error log, try to determine the failing
adapter by removing one adapter at a time from PCI bus B (PCI slots 3
6) and restarting the server after
each adapter is removed.
PCIC
An error occurred on PCI bus C.
An error on the processor or I/O board caused the
problem.
Check the error log for additional information.
If the error log indicates a problem with the integrated SCSI
controller, the Ethernet controller or video controller, see
Starting the diagnostic programs
on page 25.
DASD
A hot-swap hard disk drive has failed on SCSI channel B.
1.
If the TEMP LED is also on, take the actions listed for that LED.
2.
If the amber status LED on one of the hot-swap hard disk drives is on, replace the drive.
MEM
A memory error occurred.
1.
Check the DIMM error LEDs on the memory board.
2.
Replace the DIMM indicated by the lit DIMM error LEDs.
CPU
One of the microprocessors has failed or a microprocessor is installed in the wrong
connector.
1.
Check the microprocessor error LEDs on the memory board.
If a microprocessor error LED is on for a
microprocessor connector that has a terminator card installed instead of a microprocessor, the
microprocessors are not installed in the correct order.
See
Installing a microprocessor kit
on page 73
for information about the correct order for installing microprocessors and VRMs.
Otherwise, continue
with the next step.
2.
Turn off the server, reseat the microprocessor indicated by the lit microprocessor error LED, and restart
the server.
3.
If the problem persists, replace the microprocessor.
VRM
One of the voltage regulator modules on the processor board has failed.
1.
Check the VRM error LEDs on the processor board.
2.
Turn off the server, reseat the VRM indicated by the lit VRM error LED, and restart the server.
3.
If the problem persists, replace the VRM.
FAN
One of the fan assemblies has failed or is operating too slowly.
Note:
A failing fan can also cause the TEMP and DASD LEDs to be on.
The LED on the failing fan assembly will be lit.
Replace the fan assembly.
TEMP
The system temperature has exceeded the maximum rating.
1.
Check to see if a fan has failed.
If it has, replace the fan.
2.
Make sure the room temperature is not too high.
(See
Features and specifications
on page 3.)
If the problem persists, see
Temperature checkout
on page 31.
NON RED
Server drawing too much power to operate in a redundant power mode.
System can continue to operate in a nonredundant power mode.
To operate in a redundant mode, add a power
supply or remove most recently installed options.
OVER SPEC
The server is drawing more power than the power supplies are rated for.
Either add a power supply or remove a device from the server.
PS1
The first power supply has failed.
Replace the first power supply.
PS2
The second power supply has failed.
Replace the second power supply.
PS3
The third power supply has failed.
Replace the third power supply.