HP Integrity rx2800 rx2800 i2 User Service Guide - Page 75

Troubleshooting tools, LEDs, Front panel, Health LED

Page 75 highlights

Table 26 Advanced low end troubleshooting (continued) Step Symptom/Condition Action 8a MCA occurs during server Front panel LEDs indicate that the server detected a fatal error that it operation; the server reboots the cannot recover from through OS recovery routines (system health is OS. (NOTE: Server reboots OS, flashing red, internal health is steady green, external health is steady if enabled) green, and power is steady green). 1. Capture the MCA dump with the UEFI command, errdump mca. If the system can boot the OS, you can capture binary MCA dump files online. 2. Examine the iLO 3 MP logs for entries related to CPUs, CPU power modules (PPMs), shared memory, and core I/O devices (See "Errors and reading error logs" (page 80) for more details). 3. See "Troubleshooting tools" (page 75) for instructions on running the MCA Analysis Tool, in order to use MCAs to determine the most likely faulty or failed CRU. The preceding problem is fixed when the MCA does not repeat, or the source of the MCA has been determined and dealt with. 8b MCA occurs during server Front panel LEDs indicate that the server detected a fatal, front side bus operation; server reboot of OS error, caused by DIMMs; or by any parity in the I/O path between SBA, is prevented. NOTE: The LBA, or HBA (system health is off, internal health is flashing amber, troubleshooting actions for this external health is steady green, power is steady green). System firmware step are identical to those in is running to gather and log all error data for this MCA event. Step 8a, except that the server 1. Examine the iLO 3 MP logs for entries related to CPUs, CPU power in this step must be hard reset modules (PPMs), shared memory, and core I/O devices (See "Errors and to begin the booting process reading error logs" (page 80) for more details). You must hard reset the server to clear the fatal condition and boot the OS 2. See "Troubleshooting tools" (page 75) for instructions on running the mca Analysis Tool, in order to use MCAs to determine the most likely faulty or failed CRU. The preceding problem is fixed when the MCA does not repeat. Troubleshooting tools Use the following tools to aid in troubleshooting the server. LEDs Front panel The front panel of the system contains the power button/system power LED, health LED, System Event Log LED, and locator switch/LED. The server use flashing states (for example, flashing amber or red) on these LEDs to indicate a warning or an error. There are a total of three buttons, arranged horizontally, with the UID button and the power button each having an integrated LED. In addition to the two integrated button/LEDs, there is a health LEDs Health LED The front panel health LED indicates the status of the components that are externally serviceable. Whenever the external health LED is lit, the corresponding CRU should be lit for the failed component. Troubleshooting tools 75

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151

Table 26 Advanced low end troubleshooting
(continued)
Action
Symptom/Condition
Step
Front panel LEDs indicate that the server detected a fatal error that it
cannot recover from through OS recovery routines (system health is
flashing red, internal health is steady green, external health is steady
green, and power is steady green).
1. Capture the MCA dump with the UEFI command,
errdump mca
. If
the system can boot the OS, you can capture binary MCA dump files
online.
2. Examine the iLO 3 MP logs for entries related to CPUs, CPU power
modules (PPMs), shared memory, and core I/O devices (See
“Errors and
reading error logs” (page 80)
for more details).
3. See
“Troubleshooting tools” (page 75)
for instructions on running the
MCA Analysis Tool, in order to use MCAs to determine the most likely
faulty or failed CRU.
The preceding problem is fixed when the MCA does not repeat, or the
source of the MCA has been determined and dealt with.
MCA occurs during server
operation; the server reboots the
OS. (NOTE: Server reboots OS,
if enabled)
8a
Front panel LEDs indicate that the server detected a fatal, front side bus
error, caused by DIMMs; or by any parity in the I/O path between SBA,
LBA, or HBA (system health is off, internal health is flashing amber,
external health is steady green, power is steady green). System firmware
is running to gather and log all error data for this MCA event.
1. Examine the iLO 3 MP logs for entries related to CPUs, CPU power
modules (PPMs), shared memory, and core I/O devices (See
“Errors and
reading error logs” (page 80)
for more details).
2. See
“Troubleshooting tools” (page 75)
for instructions on running the
mca Analysis Tool, in order to use MCAs to determine the most likely
faulty or failed CRU.
The preceding problem is fixed when the MCA does not repeat.
MCA occurs during server
operation; server reboot of OS
is prevented. NOTE: The
troubleshooting actions for this
step are identical to those in
Step 8a, except that the server
in this step must be hard reset
to begin the booting process
You must hard reset the server
to clear the fatal condition and
boot the OS
8b
Troubleshooting tools
Use the following tools to aid in troubleshooting the server.
LEDs
Front panel
The front panel of the system contains the power button/system power LED, health LED, System
Event Log LED, and locator switch/LED. The server use flashing states (for example, flashing amber
or red) on these LEDs to indicate a warning or an error.
There are a total of three buttons, arranged horizontally, with the UID button and the power button
each having an integrated LED. In addition to the two integrated button/LEDs, there is a health
LEDs
Health LED
The front panel health LED indicates the status of the components that are externally serviceable.
Whenever the external health LED is lit, the corresponding CRU should be lit for the failed
component.
Troubleshooting tools
75