HP StorageWorks MSA 2/8 HP StorageWorks Fabric OS Procedures V3.1.x/4.1.x User - Page 216

Watchdog (Best Practices), Actions

Page 216 highlights

Troubleshooting Watchdog (Best Practices) Watchdog is a subset of the Kernel Error Reporting Software. It is a feature that reports unexpected and fatal errors when a switch dies. The Watchdog feature ensures that the switch will not send corrupted data when the software is not properly performing its function. The ASIC has a Watchdog register that needs to be probed by the Fabric OS once every two seconds. If the ASIC detects that the Fabric OS is hung, the ASIC will wait for an additional two seconds before resetting the CPU. The switch will always reboot or fail over when a Watchdog error occurs. Actions In the event of a Watchdog error, perform the following steps: ■ Collect the output of the supportshow command and contact Technical Support. ■ (Optional) Turn on settasklogmode in the event of a Watchdog error; this will allow more information to be collected. Do not enable this mode by default as it will slow traffic. ■ See specific error message for additional actions. See "Kernel Software Watchdog Related Errors" on page 217. 216 Fabric OS Procedures Version 3.1.x/4.1.x User Guide

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130
  • 131
  • 132
  • 133
  • 134
  • 135
  • 136
  • 137
  • 138
  • 139
  • 140
  • 141
  • 142
  • 143
  • 144
  • 145
  • 146
  • 147
  • 148
  • 149
  • 150
  • 151
  • 152
  • 153
  • 154
  • 155
  • 156
  • 157
  • 158
  • 159
  • 160
  • 161
  • 162
  • 163
  • 164
  • 165
  • 166
  • 167
  • 168
  • 169
  • 170
  • 171
  • 172
  • 173
  • 174
  • 175
  • 176
  • 177
  • 178
  • 179
  • 180
  • 181
  • 182
  • 183
  • 184
  • 185
  • 186
  • 187
  • 188
  • 189
  • 190
  • 191
  • 192
  • 193
  • 194
  • 195
  • 196
  • 197
  • 198
  • 199
  • 200
  • 201
  • 202
  • 203
  • 204
  • 205
  • 206
  • 207
  • 208
  • 209
  • 210
  • 211
  • 212
  • 213
  • 214
  • 215
  • 216
  • 217
  • 218
  • 219
  • 220
  • 221
  • 222
  • 223
  • 224
  • 225
  • 226
  • 227
  • 228
  • 229
  • 230
  • 231
  • 232
  • 233
  • 234
  • 235
  • 236
  • 237
  • 238
  • 239
  • 240
  • 241
  • 242
  • 243
  • 244
  • 245
  • 246
  • 247
  • 248
  • 249
  • 250
  • 251
  • 252
  • 253
  • 254
  • 255
  • 256
  • 257
  • 258
  • 259
  • 260
  • 261
  • 262
  • 263
  • 264
  • 265
  • 266
  • 267
  • 268
  • 269
  • 270

Troubleshooting
216
Fabric OS Procedures Version 3.1.x/4.1.x User Guide
Watchdog (Best Practices)
Watchdog is a subset of the Kernel Error Reporting Software. It is a feature that
reports unexpected and fatal errors when a switch dies. The Watchdog feature
ensures that the switch will not send corrupted data when the software is not
properly performing its function.
The ASIC has a Watchdog register that needs to be probed by the Fabric OS once
every two seconds. If the ASIC detects that the Fabric OS is hung, the ASIC will
wait for an additional two seconds before resetting the CPU. The switch will
always reboot or fail over when a Watchdog error occurs.
Actions
In the event of a Watchdog error, perform the following steps:
Collect the output of the
supportshow
command and contact Technical
Support.
(Optional) Turn on
settasklogmode
in the event of a Watchdog error;
this will allow more information to be collected. Do not enable this mode by
default as it will slow traffic.
See specific error message for additional actions. See “
Kernel Software
Watchdog Related Errors”
on page 217.