IBM JS20 Hardware Maintenance Manual - Page 155

Light, diagnostics, Error, symptoms

Page 155 highlights

Light path diagnostics LEDs Lit blade-error LED None DIMM x error v DIMM 1 - CR40 v DIMM 2 - CR45 v DIMM 3 - CR46 v DIMM 4 - CR53 Processor x error v CPU 0 - CR19 v CPU 1 - CR58 System board x error v System board - CR20 Service processor Note: There are two service processor error LEDs: v CR27 - amber v CR38 - green NMI error v CR17 Over temperature error v Temperature error - CR16 CRU/action v Check the event log or Linux Syslog (platform log) in the BladeCenter management module. v DIMM x Note: Multiple DIMM LEDs do not necessarily indicate multiple DIMM failures. If more than one DIMM LED is on, reseat/replace one DIMM at a time until error goes away. Refer to the BladeCenter management module event log for further isolation. v System board v System board v Reinsert the blade server in the BladeCenter unit, restart the server; if the error reoccurs: 1. Check the BladeCenter management module event log and the Linux Syslog (platform log) for more information. 2. System board. v Reinsert the blade server in the BladeCenter unit, restart the server; if the error reoccurs: 1. Check the BladeCenter management module event log and the Linux Syslog (platform log) for more information. 2. System board. v Reinsert the blade server in the BladeCenter unit, restart the server; if the error reoccurs: 1. Check the BladeCenter management module event log and the Linux Syslog (platform log) for more information. 2. System board. Error symptoms You can use the error symptom tables to find solutions to problems that have definite symptoms. If you cannot find the problem in the error symptom charts, go to "General checkout" on page 37 to test the server. If you have just added new software or a new option and the server is not working, do the following before using the error symptom charts: v Remove the software or device that you just added. v Go to "General checkout" on page 37 and run the firmware-based diagnostic tests to determine if the server is running correctly. v Reinstall the new software or new device. In the following tables, if the entry in the FRU/action column is a suggested action, perform that action; if it is the name of a component, reseat the component and replace it if necessary. The most likely cause of the symptom is listed first. Chapter 10. Symptom-to-FRU index 145

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218

Light
path
diagnostics
LEDs
Lit
blade-error
LED
CRU/action
None
v
Check
the
event
log
or
Linux
Syslog
(platform
log)
in
the
BladeCenter
management
module.
DIMM
x
error
v
DIMM
1
CR40
v
DIMM
2
CR45
v
DIMM
3
CR46
v
DIMM
4
CR53
v
DIMM
x
Note:
Multiple
DIMM
LEDs
do
not
necessarily
indicate
multiple
DIMM
failures.
If
more
than
one
DIMM
LED
is
on,
reseat/replace
one
DIMM
at
a
time
until
error
goes
away.
Refer
to
the
BladeCenter
management
module
event
log
for
further
isolation.
Processor
x
error
v
CPU
0
CR19
v
CPU
1
CR58
v
System
board
System
board
x
error
v
System
board
CR20
v
System
board
Service
processor
Note:
There
are
two
service
processor
error
LEDs:
v
CR27
amber
v
CR38
green
v
Reinsert
the
blade
server
in
the
BladeCenter
unit,
restart
the
server;
if
the
error
reoccurs:
1.
Check
the
BladeCenter
management
module
event
log
and
the
Linux
Syslog
(platform
log)
for
more
information.
2.
System
board.
NMI
error
v
CR17
v
Reinsert
the
blade
server
in
the
BladeCenter
unit,
restart
the
server;
if
the
error
reoccurs:
1.
Check
the
BladeCenter
management
module
event
log
and
the
Linux
Syslog
(platform
log)
for
more
information.
2.
System
board.
Over
temperature
error
v
Temperature
error
CR16
v
Reinsert
the
blade
server
in
the
BladeCenter
unit,
restart
the
server;
if
the
error
reoccurs:
1.
Check
the
BladeCenter
management
module
event
log
and
the
Linux
Syslog
(platform
log)
for
more
information.
2.
System
board.
Error
symptoms
You
can
use
the
error
symptom
tables
to
find
solutions
to
problems
that
have
definite
symptoms.
If
you
cannot
find
the
problem
in
the
error
symptom
charts,
go
to
“General
checkout”
on
page
37
to
test
the
server.
If
you
have
just
added
new
software
or
a
new
option
and
the
server
is
not
working,
do
the
following
before
using
the
error
symptom
charts:
v
Remove
the
software
or
device
that
you
just
added.
v
Go
to
“General
checkout”
on
page
37
and
run
the
firmware-based
diagnostic
tests
to
determine
if
the
server
is
running
correctly.
v
Reinstall
the
new
software
or
new
device.
In
the
following
tables,
if
the
entry
in
the
FRU/action
column
is
a
suggested
action,
perform
that
action;
if
it
is
the
name
of
a
component,
reseat
the
component
and
replace
it
if
necessary.
The
most
likely
cause
of
the
symptom
is
listed
first.
Chapter
10.
Symptom-to-FRU
index
145