IBM JS20 Hardware Maintenance Manual - Page 119

Linux, service, diagela, Using

Page 119 highlights

Linux service aid ″diagela″ The following is an example of the diagela output that is automatically run by the Linux service aid tool kit that has been applied to the system. See "Service aids and the Linux system error log" on page 40 for more information about these tools. The Linux service aids for hardware diagnostics are separate from the operating system installation and are available for download from the following Web site: http://techsupport.services.ibm.com/server/lopdiags This service aid tool kit provides the key tools required to take advantage of the inherent JS20 hardware reliability, availability, and serviceability (RAS) functions. Use the SRN table to determine the description of the problem and the action plan for the SRN provided by the diagela application. For example: diagela: 02/19/2004 18:10:23 diagela: Automatic Error Log Analysis has detected a problem. diagela: diagela: The Service Request Number(s) /Probable Cause(s) diagela: (causes are listed in descending order of probability): diagela: diagela: A01-020: CPU internal cache or cache controller error diagela: Analysis of /var/log/platform sequence number: 3 Using the SRN list The service request number (SRN) list is in numerical sequence. The SRNs listed in the following tables are for all systems and devices for which this version of the diagnostic programs can produce an SRN. The columns in the table are defined as follows: Service request number Usually a six-digit number (9333 uses four digits) representing a specific failure of a specific function. Source of SRN SRN source codes identify the program or procedure that produced the SRN: A The SRN is from a progress code (i.e., 101-254E) B The SRN is from a MAP callout. C The SRN was due to a missing resource at configuration time. D The SRN is from a diagnostic test after complete isolation testing. E The SRN is from a POST failure. F The SRN is from a diagnostic test after partial isolation testing. G The SRN is from the Error Log Analysis program. H The SRN is from a diagnostic message after an 888 sequence. K The SRN is from off-line diagnostics. Failing Function Codes These numbers represent functional areas of the system unit. The "Failing Function Codes (FFCs)" on page 141 identify the FRU that contains this function for each specific system unit. Chapter 10. Symptom-to-FRU index 109

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218

Linux
service
aid
diagela
The
following
is
an
example
of
the
diagela
output
that
is
automatically
run
by
the
Linux
service
aid
tool
kit
that
has
been
applied
to
the
system.
See
“Service
aids
and
the
Linux
system
error
log”
on
page
40
for
more
information
about
these
tools.
The
Linux
service
aids
for
hardware
diagnostics
are
separate
from
the
operating
system
installation
and
are
available
for
download
from
the
following
Web
site:
This
service
aid
tool
kit
provides
the
key
tools
required
to
take
advantage
of
the
inherent
JS20
hardware
reliability,
availability,
and
serviceability
(RAS)
functions.
Use
the
SRN
table
to
determine
the
description
of
the
problem
and
the
action
plan
for
the
SRN
provided
by
the
diagela
application.
For
example:
diagela:
02/19/2004
18:10:23
diagela:
Automatic
Error
Log
Analysis
has
detected
a
problem.
diagela:
diagela:
The
Service
Request
Number(s)
/Probable
Cause(s)
diagela:
(causes
are
listed
in
descending
order
of
probability):
diagela:
diagela:
A01-020:
CPU
internal
cache
or
cache
controller
error
diagela:
Analysis
of
/var/log/platform
sequence
number:
3
Using
the
SRN
list
The
service
request
number
(SRN)
list
is
in
numerical
sequence.
The
SRNs
listed
in
the
following
tables
are
for
all
systems
and
devices
for
which
this
version
of
the
diagnostic
programs
can
produce
an
SRN.
The
columns
in
the
table
are
defined
as
follows:
Service
request
number
Usually
a
six-digit
number
(9333
uses
four
digits)
representing
a
specific
failure
of
a
specific
function.
Source
of
SRN
SRN
source
codes
identify
the
program
or
procedure
that
produced
the
SRN:
A
The
SRN
is
from
a
progress
code
(i.e.,
101-254E)
B
The
SRN
is
from
a
MAP
callout.
C
The
SRN
was
due
to
a
missing
resource
at
configuration
time.
D
The
SRN
is
from
a
diagnostic
test
after
complete
isolation
testing.
E
The
SRN
is
from
a
POST
failure.
F
The
SRN
is
from
a
diagnostic
test
after
partial
isolation
testing.
G
The
SRN
is
from
the
Error
Log
Analysis
program.
H
The
SRN
is
from
a
diagnostic
message
after
an
888
sequence.
K
The
SRN
is
from
off-line
diagnostics.
Failing
Function
Codes
These
numbers
represent
functional
areas
of
the
system
unit.
The
“Failing
Function
Codes
(FFCs)”
on
page
141
identify
the
FRU
that
contains
this
function
for
each
specific
system
unit.
Chapter
10.
Symptom-to-FRU
index
109