Dell PowerEdge C5230 Dell Systems Hardware Owners Manual - Page 62

PCI Express* Errors, Processor Bus Error, Memory Bus Error

Page 62 highlights

PCI Express* Errors The hardware is programmed to generate an SMI on PCIe correctable, uncorrectable non-fatal, and uncorrectable fatal errors. The correctable PCIe errors are reported to the BMC as PCIe Bus Correctable errors. PCIe nonfatal and fatal errors are reported to the BMC as PCIe Bus Uncorrectable errors. The system event log for these errors includes the location of the device reporting an error which includes the PCIe link number, PCI bus number, PCI device number, and the PCI function number. An NMI is generated for PCIe Uncorrectable errors after they are logged. Processor Bus Error The BIOS enables the error correction and detection capabilities of the processors by setting appropriate bits in the processor model specific register (MSR) and the appropriate bits inside the chipset. In the case of unrecoverable errors on the host processor bus, proper execution of the asynchronous error handler (usually SMI) cannot be guaranteed and the handler cannot be relied upon to log such conditions. The handler records the error to the system event log only if the system has not experienced a catastrophic failure that compromises the integrity of the handler. Memory Bus Error The hardware is programmed to generate an SMI on correctable data errors in the memory array. The SMI handler records the error and the DIMM location to the system event log. Uncorrectable errors in the memory array are mapped to the SMI because the BMC cannot determine the location of the bad DIMM. The uncorrectable errors may have corrupted the contents of SMRAM. The SMI handler will log the failing DIMM number to the BMC if the SMRAM contents are still valid. The ability to isolate the failure down to a single DIMM may not be available on certain errors, and / or during early POST. 62 Using the System Setup Program

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146

62
Using the System Setup Program
PCI Express* Errors
The hardware is programmed to generate an SMI on PCIe correctable,
uncorrectable non-fatal, and uncorrectable fatal errors. The correctable PCIe
errors are reported to the BMC as PCIe Bus Correctable errors. PCIe non-
fatal and fatal errors are reported to the BMC as PCIe Bus Uncorrectable
errors. The system event log for these errors includes the location of the
device reporting an error which includes the PCIe link number, PCI bus
number, PCI device number, and the PCI function number. An NMI is
generated for PCIe Uncorrectable errors after they are logged.
Processor Bus Error
The BIOS enables the error correction and detection capabilities of the
processors by setting appropriate bits in the processor model specific register
(MSR) and the appropriate bits inside the chipset.
In the case of unrecoverable errors on the host processor bus, proper
execution of the asynchronous error handler (usually SMI) cannot be
guaranteed and the handler cannot be relied upon to log such conditions. The
handler records the error to the system event log only if the system has not
experienced a catastrophic failure that compromises the integrity of the
handler.
Memory Bus Error
The hardware is programmed to generate an SMI on correctable data errors in
the memory array. The SMI handler records the error and the DIMM location
to the system event log. Uncorrectable errors in the memory array are mapped
to the SMI because the BMC cannot determine the location of the bad
DIMM. The uncorrectable errors may have corrupted the contents of
SMRAM. The SMI handler will log the failing DIMM number to the BMC if
the SMRAM contents are still valid. The ability to isolate the failure down to
a single DIMM may not be available on certain errors, and / or during early
POST.