IBM 8681 Hardware Maintenance Manual - Page 235

System Management Interrupt (SMI) Handler

Page 235 highlights

System Management Interrupt (SMI) Handler System Management Interrupt (SMI) Handler is the means of detecting system errors and logging error messages into the system error log. It is installed during the POST sequence at check point A9, and is functional thereafter. SMI Handler detects errors that are generated by system hardware such as CPU, memory and PCI devices. Any SMI-logged error message has "SMI Hdlr" in the SOURCE field as part of the system error message entry. Error messages are logged at different levels as system errors are detected. A single system failure could be the combination of errors, and it will cause multiple unique error messages to be logged in the error log. For example, a single PCI device failure will cause multiple PCI errors, and it will also cause multiple errors at the PCI Bridge level. Each of those errors will create an entry in the system error log. It is very important to retrieve all the SMI error messages (SOURCE = SMI Hdlr), the data in the ERROR CODE and ERROR DATA fields for each message, and the sequence in which the error messages were posted to the system error log. The following table describes SMI error messages, along with possible failing FRUs or appropriate action to be taken. SMI Error Message Memory UNC ECC Error on port A, DIMM yy Memory UNC ECC Error on port B, DIMM yy Memory SBC ECC Error on port A, DIMM yy Memory SBC ECC Error on port B, DIMM yy UNC on P6 Processor Bus A UNC on P6 Processor Bus B Error on processor An FRU/Action DIMM yy, port A DIMM yy, port B DIMM yy, port A DIMM yy, port B Suspect FRUs in the following order: 1. Processors on bus A 2. Processor daughter board A 3. Processor controller board Suspect FRUs in the following order: 1. Processors on bus B 2. Processor daughter board B 3. Processor controller board Suspect FRUs in the following order: 1. Run diagnostics on the processors 2. Processor An 3. Processor daughter board A 4. Processor controller board Netfinity 8500R - Type 8681 227

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270
  • 271
  • 272
  • 273
  • 274
  • 275
  • 276
  • 277
  • 278
  • 279
  • 280
  • 281
  • 282
  • 283
  • 284
  • 285
  • 286
  • 287
  • 288
  • 289
  • 290

System Management Interrupt (SMI)
Handler
System Management Interrupt (SMI) Handler is the means
of detecting system errors and logging error messages into
the system error log.
It is installed during the POST
sequence at check point A9, and is functional thereafter.
SMI Handler detects errors that are generated by system
hardware such as CPU, memory and PCI devices.
Any
SMI-logged error message has "SMI Hdlr" in the SOURCE
field as part of the system error message entry.
Error
messages are logged at different levels as system errors
are detected.
A single system failure could be the
combination of errors, and it will cause multiple unique
error messages to be logged in the error log.
For
example, a single PCI device failure will cause multiple
PCI errors, and it will also cause multiple errors at the PCI
Bridge level.
Each of those errors will create an entry in
the system error log.
It is very important to retrieve all the
SMI error messages (SOURCE = SMI Hdlr), the data in
the ERROR CODE and ERROR DATA fields for each
message, and the sequence in which the error messages
were posted to the system error log.
The following table describes SMI error messages, along
with possible failing FRUs or appropriate action to be
taken.
SMI Error
Message
FRU/Action
Memory UNC ECC
Error on port A,
DIMM yy
DIMM yy, port A
Memory UNC ECC
Error on port B,
DIMM yy
DIMM yy, port B
Memory SBC ECC
Error on port A,
DIMM yy
DIMM yy, port A
Memory SBC ECC
Error on port B,
DIMM yy
DIMM yy, port B
UNC on P6
Processor Bus A
Suspect FRUs in the following order:
1.
Processors on bus A
2.
Processor daughter board A
3.
Processor controller board
UNC on P6
Processor Bus B
Suspect FRUs in the following order:
1.
Processors on bus B
2.
Processor daughter board B
3.
Processor controller board
Error on processor
An
Suspect FRUs in the following order:
1.
Run diagnostics on the processors
2.
Processor An
3.
Processor daughter board A
4.
Processor controller board
Netfinity 8500R - Type 8681
227