IBM HS40 Hardware Maintenance Manual - Page 97

Error, symptoms - specification

Page 97 highlights

Lit blade-error LED Cause Board The system temperature has exceeded a temperature error threshold level. Processor x error The microprocessor has failed. Processor mismatch The processors do not match. BMC fault NMI error The BMC self-test has detected a failure A non-maskable interrupt has occurred. Action 1. Check to see if a blower on the BladeCenter unit has failed. If it has, replace the blower. 2. Make sure the room temperature is not too high. (See "Features and specifications" on page 4 for temperature information.) 3. Check the management module for ThermTrip errors and replace microprocessors as needed. 1. Make sure that the microprocessor indicated by the lit LED is installed correctly. (See "Installing an additional microprocessor" on page 46 for installation instructions). 2. Replace the microprocessor. Make sure that all microprocessors have the same cache size and type and the same clock speed. Internal and external clock frequencies must be identical; also see "Error symptoms." 1. Reset and initialize the blade server and I/O board. 2. Replace the I/O board. 1. Replace the blade server cover, reinsert the blade server in the BladeCenter unit, and then restart the blade server. 2. Check the system error log for information about the error. 3. Replace the processor board and the I/O board. See "Processor board" on page 72 and "I/O board" on page 74 for instructions. Error symptoms You can use the error symptom table to find solutions to problems that have definite symptoms. If you cannot find the problem in the error symptom charts, go to "Starting the diagnostic programs" on page 26 to test the server. If you have just added new software or a new option and your server is not working, do the following before using the error symptom charts: v Remove the software or device that you just added. v Run the diagnostic tests to determine if your server is running correctly. v Reinstall the new software or new device. In the following table, if the entry in the FRU/action column is a suggested action, perform that action; if it is the name of a component, reseat the component and replace it if necessary. The most likely cause of the symptom is listed first. Chapter 6. Symptom-to-FRU index 87

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172

Lit
blade-error
LED
Cause
Action
Board
temperature
error
The
system
temperature
has
exceeded
a
threshold
level.
1.
Check
to
see
if
a
blower
on
the
BladeCenter
unit
has
failed.
If
it
has,
replace
the
blower.
2.
Make
sure
the
room
temperature
is
not
too
high.
(See
“Features
and
specifications”
on
page
4
for
temperature
information.)
3.
Check
the
management
module
for
ThermTrip
errors
and
replace
microprocessors
as
needed.
Processor
x
error
The
microprocessor
has
failed.
1.
Make
sure
that
the
microprocessor
indicated
by
the
lit
LED
is
installed
correctly.
(See
“Installing
an
additional
microprocessor”
on
page
46
for
installation
instructions).
2.
Replace
the
microprocessor.
Processor
mismatch
The
processors
do
not
match.
Make
sure
that
all
microprocessors
have
the
same
cache
size
and
type
and
the
same
clock
speed.
Internal
and
external
clock
frequencies
must
be
identical;
also
see
“Error
symptoms.”
BMC
fault
The
BMC
self-test
has
detected
a
failure
1.
Reset
and
initialize
the
blade
server
and
I/O
board.
2.
Replace
the
I/O
board.
NMI
error
A
non-maskable
interrupt
has
occurred.
1.
Replace
the
blade
server
cover,
reinsert
the
blade
server
in
the
BladeCenter
unit,
and
then
restart
the
blade
server.
2.
Check
the
system
error
log
for
information
about
the
error.
3.
Replace
the
processor
board
and
the
I/O
board.
See
“Processor
board”
on
page
72
and
“I/O
board”
on
page
74
for
instructions.
Error
symptoms
You
can
use
the
error
symptom
table
to
find
solutions
to
problems
that
have
definite
symptoms.
If
you
cannot
find
the
problem
in
the
error
symptom
charts,
go
to
“Starting
the
diagnostic
programs”
on
page
26
to
test
the
server.
If
you
have
just
added
new
software
or
a
new
option
and
your
server
is
not
working,
do
the
following
before
using
the
error
symptom
charts:
v
Remove
the
software
or
device
that
you
just
added.
v
Run
the
diagnostic
tests
to
determine
if
your
server
is
running
correctly.
v
Reinstall
the
new
software
or
new
device.
In
the
following
table,
if
the
entry
in
the
FRU/action
column
is
a
suggested
action,
perform
that
action;
if
it
is
the
name
of
a
component,
reseat
the
component
and
replace
it
if
necessary.
The
most
likely
cause
of
the
symptom
is
listed
first.
Chapter
6.
Symptom-to-FRU
index
87