HP Brocade 8/12c Fabric OS Troubleshooting and Diagnostics Guide v6.4.0 (53-10 - Page 28

Software Fault:Software Watchdog, Software Fault:Kernel Panic

Page 28 highlights

2 Switch boot issues • reboot • haFailover • fastBoot • firmwareDownload The RRD feature is activated and halts rebooting when an unexpected reboot reason is shown continuously in the reboot history within a certain period of time. The period of time is switch dependent. The following are considered unexpected reboots: • Reset A reset reboot may be caused by one of the following: - Power-cycle of the switch or CP. - Linux reboot command. - Hardware watchdog timeout. - Heartbeat loss related reboot. • Software Fault:Kernel Panic - If the system upon detecting an internal fatal error from which it cannot safely recover, generally it will output an error message to the console, dump a stack trace for debugging and then performs an automatic reboot. - After a kernel panic, the system may not have enough time to write the reboot reason causing the reboot reason to be empty. This is treated as an Unknown/reset case. • Software fault - Software Fault:Software Watchdog - Software Fault:ASSERT. • Software recovery failure This is an HA bootup related issue and happens when switch is unable to recover to a stable state. HASM log contains more detail and specific information on this type of failure, such as one of the following: - Failover recovery failed: This occurs when failover recovery failed and has to reboot the CP. - Failover when standby CP unready: Occurs when the active CP has to failover, but the standby CP is not ready to takeover mastership. - Failover when LS trans incomplete: Takes place when a logical switch transaction is incomplete. • Software bootup failure This is an HA bootup related issue and happens when a switch is unable to load the firmware to a usable state. HASM log contains more detail and specific information on this type of failure, such as one of the following: - System bring up timed out: The CP failed to come up within the time allotted. - LS configuration timed out and failed: Logical switch configuration failed and timed out. After RRD is activated, admin level permission is required to login enter the supportShow or supportSave command to collect a limited amount of data to resolve the issue. ATTENTION The limited supportSave used with the RRD feature does not support USB. 14 Fabric OS Troubleshooting and Diagnostics Guide 53-1001769-01

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127
  • 128
  • 129
  • 130

14
Fabric OS Troubleshooting and Diagnostics Guide
53-1001769-01
Switch boot issues
2
reboot
haFailover
fastBoot
firmwareDownload
The RRD feature is activated and halts rebooting when an unexpected reboot reason is shown
continuously in the reboot history within a certain period of time. The period of time is switch
dependent. The following are considered unexpected reboots:
Reset
A reset reboot may be caused by one of the following:
-
Power-cycle of the switch or CP.
-
Linux reboot command.
-
Hardware watchdog timeout.
-
Heartbeat loss related reboot.
Software Fault:Kernel Panic
-
If the system upon detecting an internal fatal error from which it cannot safely recover,
generally it will output an error message to the console, dump a stack trace for debugging
and then performs an automatic reboot.
-
After a kernel panic, the system may not have enough time to write the reboot reason
causing the reboot reason to be empty. This is treated as an Unknown/reset case.
Software fault
-
Software Fault:Software Watchdog
-
Software Fault:ASSERT.
Software recovery failure
This is an HA bootup related issue and happens when switch is unable to recover to a stable
state. HASM log contains more detail and specific information on this type of failure, such as
one of the following:
-
Failover recovery failed: This occurs when failover recovery failed and has to reboot the CP.
-
Failover when standby CP unready: Occurs when the active CP has to failover, but the
standby CP is not ready to takeover mastership.
-
Failover when LS trans incomplete: Takes place when a logical switch transaction is
incomplete.
Software bootup failure
This is an HA bootup related issue and happens when a switch is unable to load the firmware
to a usable state. HASM log contains more detail and specific information on this type of
failure, such as one of the following:
-
System bring up timed out: The CP failed to come up within the time allotted.
-
LS configuration timed out and failed: Logical switch configuration failed and timed out.
After RRD is activated, admin level permission is required to login enter the
supportShow
or
supportSave
command to collect a limited amount of data to resolve the issue.
ATTENTION
The limited
supportSave
used with the RRD feature does not support USB.