IBM JS20 Hardware Maintenance Manual - Page 73

Error, analysis, repair, action, Tasks, service

Page 73 highlights

If the error should be analyzed, a diagnostic application is invoked and the error is analyzed. No testing is done if the diagnostics determine that the error requires a service action. Instead, it sends a message to your console, or to all system groups. The message contains the SRN. Running diagnostics in this mode is similar to using the diag -c, -e, -d device command. To activate the automatic error log analysis feature on systems running AIX as the operating system, log in as root user (or use CE login) and type the following command: /usr/lpp/diagnostics/bin/diagela ENABLE To disable the automatic error log analysis feature on systems running AIX, log in as root user (or use CE login) and type the following command: /usr/lpp/diagnostics/bin/diagela DISABLE The diagela program can also be enabled and disabled using the Periodic Diagnostic Service Aid. Error log analysis This section provides information on error log analysis. v Error log analysis is the analysis of the AIX error log entries. v Error log analysis is part of the diagnostic applications. The analysis is started by selecting a device from the Diagnostic Selection menu and then using the diag command or selecting the Run Error Log Analysis task. v Error log analysis is only performed when running online diagnostics. v Error log analysis is not performed when running standalone diagnostics. v Error log analysis only reports problems if the errors have reached defined thresholds. Thresholds can be from 1 to 100, depending on the error. v Permanent errors do not necessarily mean a part should be replaced. v Automatic Error Log Analysis (diagela) provides the capability to do error log analysis whenever a permanent hardware error is logged. Log repair action The diagnostics perform error log analysis on most resources. The default time for error log analysis is seven days; however, this time can be changed from 1 to 60 days using the Display or Change Diagnostic Run Time Options task. To prevent false problems from being reported when error log analysis is run, repair actions need to be logged whenever a FRU is replaced. A repair action can be logged by using the Log Repair Action task or by running diagnostics in system verification mode. The Log Repair Action task lists all resources. Replaced resources can be selected from the list, and when Commit (F7 key) is selected, a repair action is logged for each selected resource. Tasks (service aids) These are tasks that might be available to the JS20 blade server: v Add Resource to Resource List v AIX Shell Prompt v Analyze Adapter Internal Log v Automatic Error Log Analysis and Notification Chapter 8. General AIX and xSeries standalone diagnostic information 63

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218

If
the
error
should
be
analyzed,
a
diagnostic
application
is
invoked
and
the
error
is
analyzed.
No
testing
is
done
if
the
diagnostics
determine
that
the
error
requires
a
service
action.
Instead,
it
sends
a
message
to
your
console,
or
to
all
system
groups.
The
message
contains
the
SRN.
Running
diagnostics
in
this
mode
is
similar
to
using
the
diag
-c,
-e,
-d
device
command.
To
activate
the
automatic
error
log
analysis
feature
on
systems
running
AIX
as
the
operating
system,
log
in
as
root
user
(or
use
CE
login)
and
type
the
following
command:
/usr/lpp/diagnostics/bin/diagela
ENABLE
To
disable
the
automatic
error
log
analysis
feature
on
systems
running
AIX,
log
in
as
root
user
(or
use
CE
login)
and
type
the
following
command:
/usr/lpp/diagnostics/bin/diagela
DISABLE
The
diagela
program
can
also
be
enabled
and
disabled
using
the
Periodic
Diagnostic
Service
Aid.
Error
log
analysis
This
section
provides
information
on
error
log
analysis.
v
Error
log
analysis
is
the
analysis
of
the
AIX
error
log
entries.
v
Error
log
analysis
is
part
of
the
diagnostic
applications.
The
analysis
is
started
by
selecting
a
device
from
the
Diagnostic
Selection
menu
and
then
using
the
diag
command
or
selecting
the
Run
Error
Log
Analysis
task.
v
Error
log
analysis
is
only
performed
when
running
online
diagnostics.
v
Error
log
analysis
is
not
performed
when
running
standalone
diagnostics.
v
Error
log
analysis
only
reports
problems
if
the
errors
have
reached
defined
thresholds.
Thresholds
can
be
from
1
to
100,
depending
on
the
error.
v
Permanent
errors
do
not
necessarily
mean
a
part
should
be
replaced.
v
Automatic
Error
Log
Analysis
(diagela)
provides
the
capability
to
do
error
log
analysis
whenever
a
permanent
hardware
error
is
logged.
Log
repair
action
The
diagnostics
perform
error
log
analysis
on
most
resources.
The
default
time
for
error
log
analysis
is
seven
days;
however,
this
time
can
be
changed
from
1
to
60
days
using
the
Display
or
Change
Diagnostic
Run
Time
Options
task.
To
prevent
false
problems
from
being
reported
when
error
log
analysis
is
run,
repair
actions
need
to
be
logged
whenever
a
FRU
is
replaced.
A
repair
action
can
be
logged
by
using
the
Log
Repair
Action
task
or
by
running
diagnostics
in
system
verification
mode.
The
Log
Repair
Action
task
lists
all
resources.
Replaced
resources
can
be
selected
from
the
list,
and
when
Commit
(F7
key)
is
selected,
a
repair
action
is
logged
for
each
selected
resource.
Tasks
(service
aids)
These
are
tasks
that
might
be
available
to
the
JS20
blade
server:
v
Add
Resource
to
Resource
List
v
AIX
Shell
Prompt
v
Analyze
Adapter
Internal
Log
v
Automatic
Error
Log
Analysis
and
Notification
Chapter
8.
General
AIX
and
xSeries
standalone
diagnostic
information
63