Intel SPSH4 Product Guide - Page 48

Field Replaceable Units and Sensor Data Records, System Event Log, Platform Event Management

Page 48 highlights

Field Replaceable Units and Sensor Data Records Field replaceable units (FRUs) are major modules in the chassis that contain active electronic circuitry. FRUs can store information-such as board serial number, part number, name, and asset tag-that can be read using the System Setup Utility (page 50). The BMC stores FRU information for the baseboard in a nonvolatile storage component on the board. The BMC uses Sensor Data Records (SDRs) to identify the sensors in the system for monitoring. SDRs provide a list of the sensors, their characteristics, location, type, and type-specific information, such as default threshold values, factors for converting a sensor reading into the appropriate units (mV, rpm, degrees Celsius), and information on the types of events that a sensor can generate. The BMC stores SDR information in a nonvolatile storage component on the baseboard. You can use the FRU/SDR Load Utility (page 62) to initialize or update the FRU and SDR information. Intel server boards are shipped from the factory with some sensors disabled, because the actual configuration of the chassis is only determined when the user completes the system configuration. Chassis-specific information in the FRU, such as chassis part number, is also absent. For these reasons, it is important to run the FRU/SDR Load Utility as part of the system setup process. You should also run the FRU/SDR Load Utility whenever you change the number of fans, processors, or power supplies in the server. System Event Log The BMC manages a system event log (SEL), where it records significant or critical system events. Such events include temperatures and voltages out of range, fan failures, and other sensor-related events. The BIOS, software, add-in cards, and other devices can also log events by sending messages to the BMC. The SEL is stored in nonvolatile storage. You can view the current contents of the SEL by using the System Setup Utility (page 50). Platform Event Management Events can trigger alerts and other actions by the BMC. The server is configured with the following set of standard events: • Temperature sensor out of range • Voltage sensor out of range • Fan failure • Chassis intrusion • Power supply fault • BIOS uncorrectable ECC error • BIOS POST error • Processor fault resilient booting (FRB) failure • Fatal nonmaskable interrupt (NMI) from a source other than the front panel switch • Watchdog timer reset, power down, or power cycle • System restart (reboot) 48 Intel SPSH4 Server Platform Product Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170

48
Intel SPSH4 Server Platform Product Guide
Field Replaceable Units and Sensor Data Records
Field replaceable units (FRUs) are major modules in the chassis that contain active electronic
circuitry.
FRUs can store information—such as board serial number, part number, name, and asset
tag—that can be read using the System Setup Utility (page 50).
The BMC stores FRU information
for the baseboard in a nonvolatile storage component on the board.
The BMC uses Sensor Data Records (SDRs) to identify the sensors in the system for monitoring.
SDRs provide a list of the sensors, their characteristics, location, type, and type-specific information,
such as default threshold values, factors for converting a sensor reading into the appropriate units
(mV, rpm, degrees Celsius), and information on the types of events that a sensor can generate.
The
BMC stores SDR information in a nonvolatile storage component on the baseboard.
You can use the FRU/SDR Load Utility (page 62) to initialize or update the FRU and SDR
information.
Intel server boards are shipped from the factory with some sensors disabled, because
the actual configuration of the chassis is only determined when the user completes the system
configuration.
Chassis-specific information in the FRU, such as chassis part number, is also absent.
For these reasons, it is important to run the FRU/SDR Load Utility as part of the system setup
process.
You should also run the FRU/SDR Load Utility whenever you change the number of fans,
processors, or power supplies in the server.
System Event Log
The BMC manages a system event log (SEL), where it records significant or critical system events.
Such events include temperatures and voltages out of range, fan failures, and other sensor-related
events.
The BIOS, software, add-in cards, and other devices can also log events by sending
messages to the BMC.
The SEL is stored in nonvolatile storage.
You can view the current contents of the SEL by using the System Setup Utility (page 50).
Platform Event Management
Events can trigger alerts and other actions by the BMC.
The server is configured with the following
set of standard events:
Temperature sensor out of range
Voltage sensor out of range
Fan failure
Chassis intrusion
Power supply fault
BIOS uncorrectable ECC error
BIOS POST error
Processor fault resilient booting (FRB) failure
Fatal nonmaskable interrupt (NMI) from a source other than the front panel switch
Watchdog timer reset, power down, or power cycle
System restart (reboot)