Dell PowerEdge XL 5133-4 MXL 10/40GbE Switch IO Module FTOS Command Reference - Page 269

Recognize an Over-Temperature Condition, Message 1,

Page 269 highlights

Recognize an Over-Temperature Condition An over-temperature condition occurs for one of two reasons: • The card genuinely is too hot. • A sensor has malfunctioned. Inspect cards adjacent to the one reporting condition to discover the cause. • If directly adjacent cards are not a normal temperature, suspect a genuine overheating condition. • If directly adjacent cards are a normal temperature, suspect a faulty sensor. When the system detects a genuine over-temperature condition, it powers off the card. To recognize this condition, look for the system messages in Message 1. Message 1 Over Temperature Condition System Messages CHMGR-2-MAJOR_TEMP: Major alarm: chassis temperature high (temperature reaches or exceeds threshold of [value]C) CHMGR-2-TEMP_SHUTDOWN_WARN: WARNING! temperature is [value]C; approaching shutdown threshold of [value]C To view the programmed alarm thresholds levels, including the shutdown value, use the show alarms threshold command (Figure 22-7). Figure 22-7. show alarms threshold Command Example FTOS#show alarms threshold -- Temperature Limits (deg C) -- BelowNormal Normal Elevated Critical Trip/Shutdown Unit0

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270
  • 271
  • 272
  • 273
  • 274
  • 275
  • 276
  • 277
  • 278
  • 279
  • 280
  • 281
  • 282
  • 283
  • 284
  • 285
  • 286
  • 287
  • 288
  • 289
  • 290

Debugging and Diagnostics
|
255
Recognize an Over-Temperature Condition
An over-temperature condition occurs for one of two reasons:
The card genuinely is too hot.
A sensor has malfunctioned.
Inspect cards adjacent to the one reporting condition to discover the cause.
If directly adjacent cards are not a normal temperature, suspect a genuine overheating condition.
If directly adjacent cards are a normal temperature, suspect a faulty sensor.
When the system detects a genuine over-temperature condition, it powers off the card. To recognize this
condition, look for the system messages in
Message 1
.
To view the programmed alarm thresholds levels, including the shutdown value, use the
show alarms
threshold
command (
Figure 22-7
).
Figure 22-7.
show alarms threshold Command Example
Troubleshoot an Over-Temperature Condition
To troubleshoot an over-temperature condition:
1.
Use the
show environment
commands to monitor the temperature levels.
2.
Check air flow through the system. Ensure the air ducts are clean and that all fans are working
correctly.
3.
After the software has determined that the temperature levels are within normal limits, the card can be
re-powered safely. To bring the stack unit back online, use the
power-on
command in EXEC mode.
In addition, Dell Force10 requires that you install blanks in all slots without a line card to control airflow
for adequate system cooling.
Message 1
Over Temperature Condition System Messages
CHMGR-2-MAJOR_TEMP: Major alarm: chassis temperature high (temperature reaches or exceeds threshold of
[value]C)
CHMGR-2-TEMP_SHUTDOWN_WARN: WARNING! temperature is [value]C; approaching shutdown threshold of [value]C
FTOS#show alarms threshold
--
Temperature Limits (deg C)
--
---------------------------------------------------------------------------
BelowNormal
Normal
Elevated
Critical
Trip/Shutdown
Unit0
<=40
41
71
81
86
FTOS#