IBM QS21 Service Guide - Page 113

Solving, undetermined, problems

Page 113 highlights

Solving undetermined problems Note: When you are diagnosing a problem in the blade server, you must determine whether the problem is in the blade server or in the BladeCenter unit. v If all of the blade servers have the same symptom, the problem is probably elsewhere in the infrastructure. For more information, see the Hardware Maintenance Manual and Troubleshooting Guide or Problem Determination and Service Guide for your BladeCenter unit. v If the BladeCenter unit contains more than one blade server and only one of the blade servers has the problem, troubleshoot the blade server that has the problem. If the blade server is inoperative, use the information in this section. If you suspect that a software problem is causing failures (continuous or intermittent), see "Software problems" on page 58. Check the LEDs on all the power supplies of the BladeCenter unit in which the blade server is installed. If the LEDs indicate that the power supplies are working correctly and reseating the blade server does not correct the problem, complete the following steps: 1. Turn off the blade server. 2. Remove the blade server from the BladeCenter unit and remove the cover. 3. Make sure that the control panel connector is correctly seated on the system board (see "Removing the blade-server front bezel assembly" on page 41 for the location of the connector). 4. If no LEDs on the control panel are working of the blade server, replace the bezel assembly. Try to turn on the blade server from the Advanced Management Module (see the documentation for the BladeCenter unit and Advanced Management Module for more information). 5. Reinstall the blade server and check. If the blade server remains inactive, continue with step 6. 6. Turn off the blade server. 7. Remove the blade server from the BladeCenter unit and remove the cover. 8. Remove or disconnect the following devices one at a time, if installed, until you find the failure: v High Speed InfiniBand expansion card v SAS expansion card v I/O buffer DIMMx Reinstall, turn on, and reconfigure the blade server each time. If the problem is solved when you remove the device from the blade server but the problem recurs when you reinstall the same device, suspect the device; if the problem recurs when you replace the device with a different one, suspect the system board. Have a trained service technician replace the system board assembly. If you suspect a networking problem and the blade server passes all the system tests, suspect the network switch. However, the problem may concern the network itself and be external to the system. Chapter 5. Diagnostics and troubleshooting 95

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144

Solving
undetermined
problems
Note:
When
you
are
diagnosing
a
problem
in
the
blade
server,
you
must
determine
whether
the
problem
is
in
the
blade
server
or
in
the
BladeCenter
unit.
v
If
all
of
the
blade
servers
have
the
same
symptom,
the
problem
is
probably
elsewhere
in
the
infrastructure.
For
more
information,
see
the
Hardware
Maintenance
Manual
and
Troubleshooting
Guide
or
Problem
Determination
and
Service
Guide
for
your
BladeCenter
unit.
v
If
the
BladeCenter
unit
contains
more
than
one
blade
server
and
only
one
of
the
blade
servers
has
the
problem,
troubleshoot
the
blade
server
that
has
the
problem.
If
the
blade
server
is
inoperative,
use
the
information
in
this
section.
If
you
suspect
that
a
software
problem
is
causing
failures
(continuous
or
intermittent),
see
“Software
problems”
on
page
58.
Check
the
LEDs
on
all
the
power
supplies
of
the
BladeCenter
unit
in
which
the
blade
server
is
installed.
If
the
LEDs
indicate
that
the
power
supplies
are
working
correctly
and
reseating
the
blade
server
does
not
correct
the
problem,
complete
the
following
steps:
1.
Turn
off
the
blade
server.
2.
Remove
the
blade
server
from
the
BladeCenter
unit
and
remove
the
cover.
3.
Make
sure
that
the
control
panel
connector
is
correctly
seated
on
the
system
board
(see
“Removing
the
blade-server
front
bezel
assembly”
on
page
41
for
the
location
of
the
connector).
4.
If
no
LEDs
on
the
control
panel
are
working
of
the
blade
server,
replace
the
bezel
assembly.
Try
to
turn
on
the
blade
server
from
the
Advanced
Management
Module
(see
the
documentation
for
the
BladeCenter
unit
and
Advanced
Management
Module
for
more
information).
5.
Reinstall
the
blade
server
and
check.
If
the
blade
server
remains
inactive,
continue
with
step
6.
6.
Turn
off
the
blade
server.
7.
Remove
the
blade
server
from
the
BladeCenter
unit
and
remove
the
cover.
8.
Remove
or
disconnect
the
following
devices
one
at
a
time,
if
installed,
until
you
find
the
failure:
v
High
Speed
InfiniBand
expansion
card
v
SAS
expansion
card
v
I/O
buffer
DIMMx
Reinstall,
turn
on,
and
reconfigure
the
blade
server
each
time.
If
the
problem
is
solved
when
you
remove
the
device
from
the
blade
server
but
the
problem
recurs
when
you
reinstall
the
same
device,
suspect
the
device;
if
the
problem
recurs
when
you
replace
the
device
with
a
different
one,
suspect
the
system
board.
Have
a
trained
service
technician
replace
the
system
board
assembly.
If
you
suspect
a
networking
problem
and
the
blade
server
passes
all
the
system
tests,
suspect
the
network
switch.
However,
the
problem
may
concern
the
network
itself
and
be
external
to
the
system.
Chapter
5.
Diagnostics
and
troubleshooting
95