IBM x3655 Service Guide - Page 146

Intermittent, problems

Page 146 highlights

Intermittent problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 4, "Parts listing, System x3655, Type 7985," on page 97 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU). v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a trained service technician. Symptom Action A problem occurs only occasionally and is difficult to diagnose. 1. Make sure that: v All cables and cords are connected securely to the rear of the server and attached devices. v When the server is turned on, air is flowing from the fan grille. If there is no airflow, the fan is not working. This can cause the server to overheat and shut down. 2. Check the system event/error log or BMC system event log (see "Error logs" on page 116). 3. See "Solving undetermined problems" on page 177. The server resets (restarts) occasionally 1. If the reset occurs during POST and the POST watchdog timer is enabled (click Advanced Setup --> Baseboard Management Controller (BMC) Settings --> BMC Post Watchdog in the Configuration/Setup Utility program to see the POST watchdog setting), make sure that sufficient time is allowed in the watchdog timeout value (BMC POST Watchdog Timeout). See the User's Guide for information about the settings in the Configuration/Setup Utility program. If the server continues to reset during POST, see "POST" on page 107 and "Diagnostic programs, messages, and error codes" on page 144. 2. If the reset occurs after the operating system starts, disable any automatic server restart (ASR) utilities, such as the IBM Automatic Server Restart IPMI Application for Windows, or ASR devices that may be installed. Note: ASR utilities operate as operating-system utilities and are related to the IPMI device driver. If the reset continues to occur after the operating system starts, the operating system might have a problem; see "Software problems" on page 137. 3. If neither condition applies, check the system event/error log or BMC system event log (see "Error logs" on page 116). If the problem remains, call for service. 128 IBM System x3655 Type 7985: Problem Determination and Service Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212

Intermittent
problems
v
Follow
the
suggested
actions
in
the
order
in
which
they
are
listed
in
the
Action
column
until
the
problem
is
solved.
v
See
Chapter
4,
“Parts
listing,
System
x3655,
Type
7985,”
on
page
97
to
determine
which
components
are
customer
replaceable
units
(CRU)
and
which
components
are
field
replaceable
units
(FRU).
v
If
an
action
step
is
preceded
by
“(Trained
service
technician
only),”
that
step
must
be
performed
only
by
a
trained
service
technician.
Symptom
Action
A
problem
occurs
only
occasionally
and
is
difficult
to
diagnose.
1.
Make
sure
that:
v
All
cables
and
cords
are
connected
securely
to
the
rear
of
the
server
and
attached
devices.
v
When
the
server
is
turned
on,
air
is
flowing
from
the
fan
grille.
If
there
is
no
airflow,
the
fan
is
not
working.
This
can
cause
the
server
to
overheat
and
shut
down.
2.
Check
the
system
event/error
log
or
BMC
system
event
log
(see
“Error
logs”
on
page
116).
3.
See
“Solving
undetermined
problems”
on
page
177.
The
server
resets
(restarts)
occasionally
1.
If
the
reset
occurs
during
POST
and
the
POST
watchdog
timer
is
enabled
(click
Advanced
Setup
-->
Baseboard
Management
Controller
(BMC)
Settings
-->
BMC
Post
Watchdog
in
the
Configuration/Setup
Utility
program
to
see
the
POST
watchdog
setting),
make
sure
that
sufficient
time
is
allowed
in
the
watchdog
timeout
value
(
BMC
POST
Watchdog
Timeout
).
See
the
User’s
Guide
for
information
about
the
settings
in
the
Configuration/Setup
Utility
program.
If
the
server
continues
to
reset
during
POST,
see
“POST”
on
page
107
and
“Diagnostic
programs,
messages,
and
error
codes”
on
page
144.
2.
If
the
reset
occurs
after
the
operating
system
starts,
disable
any
automatic
server
restart
(ASR)
utilities,
such
as
the
IBM
Automatic
Server
Restart
IPMI
Application
for
Windows,
or
ASR
devices
that
may
be
installed.
Note:
ASR
utilities
operate
as
operating-system
utilities
and
are
related
to
the
IPMI
device
driver.
If
the
reset
continues
to
occur
after
the
operating
system
starts,
the
operating
system
might
have
a
problem;
see
“Software
problems”
on
page
137.
3.
If
neither
condition
applies,
check
the
system
event/error
log
or
BMC
system
event
log
(see
“Error
logs”
on
page
116).
If
the
problem
remains,
call
for
service.
128
IBM
System
x3655
Type
7985:
Problem
Determination
and
Service
Guide