IBM JS20 Hardware Maintenance Manual - Page 31

Problem, determination, procedures, Linux - aix 6

Page 31 highlights

Chapter 4. Problem determination procedures for AIX and Linux This chapter outlines the procedure to follow if the server suspends operation without notice. Use the following procedure if any of the following is true: v The console displays - an SRN/SRC code - an 8-digit firmware error code - a 3- or 4-digit firmware checkpoint (progress) code v The server does not start up after installation v The server experiences an undetermined error while running, such as if the server stops running with no error code displayed Certain errors listed in the SRN/SRC table, Failing Function Code table, and Symptom-to-FRU index will also direct you to perform the diagnostic procedure based on the operating system and the type of problem. Problem determination Perform the steps in this section to perform the problem determination. Step 001 Check for the following information: 1. If a firmware checkpoint (progress) code (3 or 4 digits) is displayed, see "Firmware checkpoint (progress) codes" on page 94. 2. If a firmware error code (8 digits) is displayed, see "Firmware error codes" on page 102. 3. If you have an SRN or SRC, see "SRN tables" on page 110. 4. Check the BladeCenter management module event log. If an error was recorded by the system, see "SRN tables" on page 110. 5. Check the blade error LED on the information LED panel; if it is lit, see "Light path diagnostics LEDs" on page 145. 6. If the Blade has stalled, with no error codes and no command line or login prompt, continue to Step 002 7. If the login prompt appears and you still suspect a problem, continue to Step 002 . 8. If you have none of the above symptoms, go to "Undetermined problems" on page 156. Step 002 Perform the following steps: 1. Turn off the server, making sure to first turn off all external devices, if attached. 2. Check all cables and power cords. 3. Turn on all external devices; then, turn on the blade server. © Copyright IBM Corp. 2003 21

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218

Chapter
4.
Problem
determination
procedures
for
AIX
and
Linux
This
chapter
outlines
the
procedure
to
follow
if
the
server
suspends
operation
without
notice.
Use
the
following
procedure
if
any
of
the
following
is
true:
v
The
console
displays
an
SRN/SRC
code
an
8-digit
firmware
error
code
a
3-
or
4-digit
firmware
checkpoint
(progress)
code
v
The
server
does
not
start
up
after
installation
v
The
server
experiences
an
undetermined
error
while
running,
such
as
if
the
server
stops
running
with
no
error
code
displayed
Certain
errors
listed
in
the
SRN/SRC
table,
Failing
Function
Code
table,
and
Symptom-to-FRU
index
will
also
direct
you
to
perform
the
diagnostic
procedure
based
on
the
operating
system
and
the
type
of
problem.
Problem
determination
Perform
the
steps
in
this
section
to
perform
the
problem
determination.
Step
±001²
Check
for
the
following
information:
1.
If
a
firmware
checkpoint
(progress)
code
(3
or
4
digits)
is
displayed,
see
“Firmware
checkpoint
(progress)
codes”
on
page
94.
2.
If
a
firmware
error
code
(8
digits)
is
displayed,
see
“Firmware
error
codes”
on
page
102.
3.
If
you
have
an
SRN
or
SRC,
see
“SRN
tables”
on
page
110.
4.
Check
the
BladeCenter
management
module
event
log.
If
an
error
was
recorded
by
the
system,
see
“SRN
tables”
on
page
110.
5.
Check
the
blade
error
LED
on
the
information
LED
panel;
if
it
is
lit,
see
“Light
path
diagnostics
LEDs”
on
page
145.
6.
If
the
Blade
has
stalled,
with
no
error
codes
and
no
command
line
or
login
prompt,
continue
to
Step
±002²
7.
If
the
login
prompt
appears
and
you
still
suspect
a
problem,
continue
to
Step
±002²
.
8.
If
you
have
none
of
the
above
symptoms,
go
to
“Undetermined
problems”
on
page
156.
Step
±002²
Perform
the
following
steps:
1.
Turn
off
the
server,
making
sure
to
first
turn
off
all
external
devices,
if
attached.
2.
Check
all
cables
and
power
cords.
3.
Turn
on
all
external
devices;
then,
turn
on
the
blade
server.
©
Copyright
IBM
Corp.
2003
21