IBM x3655 Service Guide - Page 148

Memory, problems

Page 148 highlights

Memory problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 4, "Parts listing, System x3655, Type 7985," on page 97 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU). v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a trained service technician. Symptom Action The amount of system memory 1. Make sure that: that is displayed is less than the amount of installed physical v No error LEDs are lit on the operator information panel. memory. v Memory sparing does not account for the discrepancy. v The memory modules are seated correctly. v You have installed the correct type of memory (see "Installing a memory module" on page 62). v If you changed the memory, you updated the memory configuration in the Configuration/Setup Utility program. v All banks of memory are enabled. The server might have automatically disabled a memory bank when it detected a problem, or a memory bank might have been manually disabled. 2. Check the POST error log for error message 289: v If a DIMM was disabled by a systems-management interrupt (SMI), replace the DIMM. v If a DIMM was disabled by the user or by POST, run the Configuration/Setup Utility program and enable the DIMM. 3. Run memory diagnostics (see "Running the diagnostic programs" on page 144). 4. Make sure that there is no memory mismatch when the server is at the minimum memory configuration (two 512 MB DIMMs). 5. Add one pair of DIMMs at a time, making sure that the DIMMs in each pair match. Install the DIMMs in the sequence described in "Installing a memory module" on page 62. 6. Reseat the DIMMs. 7. Replace the following components one at a time, in the order shown, restarting the server each time: a. DIMMs b. (Trained service technician only) System board Multiple rows of DIMMs in a failing branch are identified as failing. 1. Reseat the DIMMs; then, restart the server. 2. Remove the lowest-numbered DIMM pair of those that are identified and replace it with an identical pair of known good DIMMs; then, restart the server. Repeat as necessary. If the failures continue after all identified pairs are replaced, go to step 4. 3. Return the removed DIMMs, one pair at a time, to their original connectors, restarting the server after each pair, until a pair fails. Replace each DIMM in the failed pair with an identical known good DIMM, restarting the server after each DIMM. Replace the failed DIMM. Repeat step 3 until you have tested all removed DIMMs. 4. (Trained service technician only) Replace the system board. 130 IBM System x3655 Type 7985: Problem Determination and Service Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212

Memory
problems
v
Follow
the
suggested
actions
in
the
order
in
which
they
are
listed
in
the
Action
column
until
the
problem
is
solved.
v
See
Chapter
4,
“Parts
listing,
System
x3655,
Type
7985,”
on
page
97
to
determine
which
components
are
customer
replaceable
units
(CRU)
and
which
components
are
field
replaceable
units
(FRU).
v
If
an
action
step
is
preceded
by
“(Trained
service
technician
only),”
that
step
must
be
performed
only
by
a
trained
service
technician.
Symptom
Action
The
amount
of
system
memory
that
is
displayed
is
less
than
the
amount
of
installed
physical
memory.
1.
Make
sure
that:
v
No
error
LEDs
are
lit
on
the
operator
information
panel.
v
Memory
sparing
does
not
account
for
the
discrepancy.
v
The
memory
modules
are
seated
correctly.
v
You
have
installed
the
correct
type
of
memory
(see
“Installing
a
memory
module”
on
page
62).
v
If
you
changed
the
memory,
you
updated
the
memory
configuration
in
the
Configuration/Setup
Utility
program.
v
All
banks
of
memory
are
enabled.
The
server
might
have
automatically
disabled
a
memory
bank
when
it
detected
a
problem,
or
a
memory
bank
might
have
been
manually
disabled.
2.
Check
the
POST
error
log
for
error
message
289:
v
If
a
DIMM
was
disabled
by
a
systems-management
interrupt
(SMI),
replace
the
DIMM.
v
If
a
DIMM
was
disabled
by
the
user
or
by
POST,
run
the
Configuration/Setup
Utility
program
and
enable
the
DIMM.
3.
Run
memory
diagnostics
(see
“Running
the
diagnostic
programs”
on
page
144).
4.
Make
sure
that
there
is
no
memory
mismatch
when
the
server
is
at
the
minimum
memory
configuration
(two
512
MB
DIMMs).
5.
Add
one
pair
of
DIMMs
at
a
time,
making
sure
that
the
DIMMs
in
each
pair
match.
Install
the
DIMMs
in
the
sequence
described
in
“Installing
a
memory
module”
on
page
62.
6.
Reseat
the
DIMMs.
7.
Replace
the
following
components
one
at
a
time,
in
the
order
shown,
restarting
the
server
each
time:
a.
DIMMs
b.
(Trained
service
technician
only)
System
board
Multiple
rows
of
DIMMs
in
a
failing
branch
are
identified
as
failing.
1.
Reseat
the
DIMMs;
then,
restart
the
server.
2.
Remove
the
lowest-numbered
DIMM
pair
of
those
that
are
identified
and
replace
it
with
an
identical
pair
of
known
good
DIMMs;
then,
restart
the
server.
Repeat
as
necessary.
If
the
failures
continue
after
all
identified
pairs
are
replaced,
go
to
step
4.
3.
Return
the
removed
DIMMs,
one
pair
at
a
time,
to
their
original
connectors,
restarting
the
server
after
each
pair,
until
a
pair
fails.
Replace
each
DIMM
in
the
failed
pair
with
an
identical
known
good
DIMM,
restarting
the
server
after
each
DIMM.
Replace
the
failed
DIMM.
Repeat
step
3
until
you
have
tested
all
removed
DIMMs.
4.
(Trained
service
technician
only)
Replace
the
system
board.
130
IBM
System
x3655
Type
7985:
Problem
Determination
and
Service
Guide