IBM HS12 Service Guide - Page 129

Microprocessor problems

Page 129 highlights

v See Chapter 4, "Parts listing, Types 8014, 8028 and 1916," on page 25 to determine which components are CRUs and which components are FRUs. v If an action step is preceded by "(Trained service technician only)," that step must be performed only by a trained service technician. Symptom Action The amount of system memory 1. Make sure that: that is displayed is less than the v You have installed the correct type of memory. amount of installed physical v If you changed the memory, you updated the memory configuration in the memory. Configuration/Setup Utility program. v All banks of memory are enabled. The blade server might have automatically disabled a memory bank when it detected a problem, or a memory bank might have been manually disabled. 2. Check BMC log for error message 289: v If a DIMM was disabled by a systems-management interrupt (SMI), replace the DIMM. v If a DIMM was disabled by the user or by POST, run the Configuration/Setup Utility program and enable the DIMM. 3. Reseat the DIMM, and the optional expansion unit (if one is installed). 4. Replace the following components one at a time, in the order shown, restarting the blade server each time: a. Optional expansion unit (if one is installed) b. DIMM c. (Trained service technician only) System-board assembly Multiple rows of DIMMs in a branch are identified as failing. 1. Reseat the DIMMs; then, restart the server. 2. Remove the lowest-numbered DIMM pair of those that are identified and replace it with an identical pair of known good DIMMs; then, restart the server. Repeat as necessary. If the failures continue after all identified pairs are replaced, go to step "Memory problems" on page 116. 3. Return the removed DIMMs, one pair at a time, to their original connectors, restarting the server after each pair, until a pair fails. Replace each DIMM in the failed pair with an identical known good DIMM, restarting the server after each DIMM. Replace the failed DIMM. Repeat step "Memory problems" on page 116 until all removed DIMMs have been tested. 4. (Trained service technician only) Replace the system board. Microprocessor problems Use this information to diagnose and resolve microprocessor problems in the blade server. Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. Chapter 6. Diagnostics 117

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204

v
See Chapter 4, “Parts listing, Types 8014, 8028 and 1916,” on page 25 to determine which components are CRUs
and which components are FRUs.
v
If an action step is preceded by “(Trained service technician only),” that step must be performed only by a
trained service technician.
Symptom
Action
The amount of system memory
that is displayed is less than the
amount of installed physical
memory.
1.
Make sure that:
v
You have installed the correct type of memory.
v
If you changed the memory, you updated the memory configuration in the
Configuration/Setup Utility program.
v
All banks of memory are enabled. The blade server might have
automatically disabled a memory bank when it detected a problem, or a
memory bank might have been manually disabled.
2.
Check BMC log for error message 289:
v
If a DIMM was disabled by a systems-management interrupt (SMI), replace
the DIMM.
v
If a DIMM was disabled by the user or by POST, run the
Configuration/Setup Utility program and enable the DIMM.
3.
Reseat the DIMM, and the optional expansion unit (if one is installed).
4.
Replace the following components one at a time, in the order shown, restarting
the blade server each time:
a.
Optional expansion unit (if one is installed)
b.
DIMM
c.
(Trained service technician only) System-board assembly
Multiple rows of DIMMs in a
branch are identified as failing.
1.
Reseat the DIMMs; then, restart the server.
2.
Remove the lowest-numbered DIMM pair of those that are identified and
replace it with an identical pair of known good DIMMs; then, restart the
server. Repeat as necessary. If the failures continue after all identified pairs are
replaced, go to step “Memory problems” on page 116.
3.
Return the removed DIMMs, one pair at a time, to their original connectors,
restarting the server after each pair, until a pair fails. Replace each DIMM in
the failed pair with an identical known good DIMM, restarting the server after
each DIMM. Replace the failed DIMM. Repeat step “Memory problems” on
page 116 until all removed DIMMs have been tested.
4.
(Trained service technician only) Replace the system board.
Microprocessor problems
Use this information to diagnose and resolve microprocessor problems in the blade
server.
Follow the suggested actions in the order in which they are listed in the Action
column until the problem is solved.
Chapter 6. Diagnostics
117