IBM 8872 Service Guide - Page 50

Checkout, procedure, About, checkout

Page 50 highlights

Checkout procedure The checkout procedure is the sequence of tasks that you should follow to diagnose a problem in the server. About the checkout procedure Before performing the checkout procedure for diagnosing hardware problems, review the following information: v Read the safety information beginning on page vii. v The diagnostic programs provide the primary methods of testing the major components of the server, such as the I/O board, Ethernet controller, keyboard, mouse (pointing device), serial ports, and hard disk drives. You can also use them to test some external devices. If you are not sure whether a problem is caused by the hardware or by the software, you can use the diagnostic programs to confirm that the hardware is working correctly. v When you run the diagnostic programs, a single problem might cause more than one error message. When this happens, correct the cause of the first error message. The other error messages usually will not occur the next time you run the diagnostic programs. Exception: If there are multiple error codes or light path diagnostics LEDs that indicate a microprocessor error, the error might be in a microprocessor or in a microprocessor socket. See "Microprocessor problems" on page 42 for information about diagnosing microprocessor problems. v Before running the diagnostic programs, you must determine whether the failing server is part of a shared hard disk drive cluster (two or more servers sharing external storage devices). If it is part of a cluster, you can run all diagnostic programs except the ones that test the storage unit (that is, a hard disk drive in the storage unit) or the storage adapter that is attached to the storage unit. The failing server might be part of a cluster if any of the following conditions is true: - You have identified the failing server as part of a cluster (two or more servers sharing external storage devices). - One or more external storage units are attached to the failing server and at least one of the attached storage units is also attached to another server or unidentifiable device. - One or more servers are located near the failing server. Important: If the server is part of a shared hard disk drive cluster, run one test at a time. Do not run any suite of tests, such as "quick" or "normal" tests, because this might enable the hard disk drive diagnostic tests. v If the server is halted and a POST error code is displayed, see "Error logs" on page 18. If the server is halted and no error message is displayed, see "Troubleshooting tables" on page 36 and "Solving undetermined problems" on page 90. v For information about power-supply problems, see "Solving power problems" on page 88 and "Power-supply LEDs" on page 57. v For intermittent problems, check the error log; see "Error logs" on page 18 and "Diagnostic programs, messages, and error codes" on page 59. 34 IBM xSeries 460 Type 8872 and xSeries MXE 460 Type 8874: Problem Determination and Service Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170

Checkout
procedure
The
checkout
procedure
is
the
sequence
of
tasks
that
you
should
follow
to
diagnose
a
problem
in
the
server.
About
the
checkout
procedure
Before
performing
the
checkout
procedure
for
diagnosing
hardware
problems,
review
the
following
information:
v
Read
the
safety
information
beginning
on
page
vii.
v
The
diagnostic
programs
provide
the
primary
methods
of
testing
the
major
components
of
the
server,
such
as
the
I/O
board,
Ethernet
controller,
keyboard,
mouse
(pointing
device),
serial
ports,
and
hard
disk
drives.
You
can
also
use
them
to
test
some
external
devices.
If
you
are
not
sure
whether
a
problem
is
caused
by
the
hardware
or
by
the
software,
you
can
use
the
diagnostic
programs
to
confirm
that
the
hardware
is
working
correctly.
v
When
you
run
the
diagnostic
programs,
a
single
problem
might
cause
more
than
one
error
message.
When
this
happens,
correct
the
cause
of
the
first
error
message.
The
other
error
messages
usually
will
not
occur
the
next
time
you
run
the
diagnostic
programs.
Exception:
If
there
are
multiple
error
codes
or
light
path
diagnostics
LEDs
that
indicate
a
microprocessor
error,
the
error
might
be
in
a
microprocessor
or
in
a
microprocessor
socket.
See
“Microprocessor
problems”
on
page
42
for
information
about
diagnosing
microprocessor
problems.
v
Before
running
the
diagnostic
programs,
you
must
determine
whether
the
failing
server
is
part
of
a
shared
hard
disk
drive
cluster
(two
or
more
servers
sharing
external
storage
devices).
If
it
is
part
of
a
cluster,
you
can
run
all
diagnostic
programs
except
the
ones
that
test
the
storage
unit
(that
is,
a
hard
disk
drive
in
the
storage
unit)
or
the
storage
adapter
that
is
attached
to
the
storage
unit.
The
failing
server
might
be
part
of
a
cluster
if
any
of
the
following
conditions
is
true:
You
have
identified
the
failing
server
as
part
of
a
cluster
(two
or
more
servers
sharing
external
storage
devices).
One
or
more
external
storage
units
are
attached
to
the
failing
server
and
at
least
one
of
the
attached
storage
units
is
also
attached
to
another
server
or
unidentifiable
device.
One
or
more
servers
are
located
near
the
failing
server.
Important:
If
the
server
is
part
of
a
shared
hard
disk
drive
cluster,
run
one
test
at
a
time.
Do
not
run
any
suite
of
tests,
such
as
“quick”
or
“normal”
tests,
because
this
might
enable
the
hard
disk
drive
diagnostic
tests.
v
If
the
server
is
halted
and
a
POST
error
code
is
displayed,
see
“Error
logs”
on
page
18.
If
the
server
is
halted
and
no
error
message
is
displayed,
see
“Troubleshooting
tables”
on
page
36
and
“Solving
undetermined
problems”
on
page
90.
v
For
information
about
power-supply
problems,
see
“Solving
power
problems”
on
page
88
and
“Power-supply
LEDs”
on
page
57.
v
For
intermittent
problems,
check
the
error
log;
see
“Error
logs”
on
page
18
and
“Diagnostic
programs,
messages,
and
error
codes”
on
page
59.
34
IBM
xSeries
460
Type
8872
and
xSeries
MXE
460
Type
8874:
Problem
Determination
and
Service
Guide