IBM 866631Y Hardware Maintenance Manual - Page 16

Reliability, availability, and serviceability, Upgradable BIOS, diagnostics

Page 16 highlights

Reliability, availability, and serviceability Three of the most important features in server design are reliability, availability, and serviceability (RAS). These factors help to ensure the integrity of the data stored on the server; that the server is available when you want to use it; and that should a failure occur, you can easily diagnose and repair the failure with minimal inconvenience. The following is an abbreviated list of the RAS features that the server supports. v Menu-driven setup, system configuration, SCSISelect configuration, and diagnostic programs v Power-on self-test (POST) v Integrated Netfinity Advanced System Management Processor v Predictive Failure Analysis™ (PFA) alerts v Remote system problem-determination support v Power and temperature monitoring v Power-supply redundancy monitoring v Fault-resistant startup v Hot-swap drive bays v Error codes and messages v System error logging v Upgradable BIOS, diagnostics, and Netfinity Advanced System Management Processor code v Automatic restart after a power failure v Parity checking on the SCSI bus and the PCI bus v Error correcting code (ECC) memory v Redundant hot-swap power supplies and fans v Hot-swap cooling v Chipkill™ memory protection (optional) v Support for hot-plug PCI adapters (optional) v Redundant Ethernet capabilities (with optional adapter) v Vital Product Data (VPD) on processors, processor board, I/O board, power supplies, hard disk backplane, power backplane and VRMs. v Information and diagnostic LED panels 6 Hardware Maintenance Manual: Netfinity 7100 - Type 8666

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198

Reliability, availability, and serviceability
Three of the most important features in server design are reliability, availability,
and serviceability (RAS). These factors help to ensure the integrity of the data
stored on the server; that the server is available when you want to use it; and that
should a failure occur, you can easily diagnose and repair the failure with minimal
inconvenience.
The following is an abbreviated list of the RAS features that the server supports.
v
Menu-driven setup, system configuration, SCSISelect configuration, and
diagnostic programs
v
Power-on self-test (POST)
v
Integrated Netfinity Advanced System Management Processor
v
Predictive Failure Analysis
(PFA) alerts
v
Remote system problem-determination support
v
Power and temperature monitoring
v
Power-supply redundancy monitoring
v
Fault-resistant startup
v
Hot-swap drive bays
v
Error codes and messages
v
System error logging
v
Upgradable BIOS, diagnostics, and Netfinity Advanced System Management
Processor code
v
Automatic restart after a power failure
v
Parity checking on the SCSI bus and the PCI bus
v
Error correcting code (ECC) memory
v
Redundant hot-swap power supplies and fans
v
Hot-swap cooling
v
Chipkill
memory protection (optional)
v
Support for hot-plug PCI adapters (optional)
v
Redundant Ethernet capabilities (with optional adapter)
v
Vital Product Data (VPD) on processors, processor board, I/O board, power
supplies, hard disk backplane, power backplane and VRMs.
v
Information and diagnostic LED panels
6
Hardware Maintenance Manual: Netfinity 7100 – Type 8666