HP Integrity Superdome SX2000 Windows Integrity Management Agents Reference - Page 41

Too Few Bulk Power Supplies Available Probable Cause: One or more Bulk Power, No Cabinet Start

Page 41 highlights

Event ID Event Severity Event Description 7758 Error Copy of complex profile on sub and cells don't match Probable Cause: MP NVRAM was erased by removing MP from system without setting 'NVRAM SAVE' switch to on. MP was replaced with cabinet's AC Breakers 'off'. Either of first two causes and replacing or installing a cell board with cabinet's AC Breakers 'off'. Recommended Action: Remove cell board causing problem. Power complex on and allow cells to distribute their copy of complex profile to MP, then add new cell following proper OLA procedures. Remove improper cell board. Execute MP Handler 'CC' command and choose 'Last Profile'. This will load the sub with what should be the same copy as the cells. Then add new cell board. SNMP Trap: hpevtCmplxProfilIncoherent 7758 in HPIFPTRAP.MIB. 7760 Error Duplicate cabinet number detected Probable Cause: When adding a new cabinet to the complex or replacing the UGUY, the cabinet number switch was set to a number already in use. Recommended Action: Turn off AC breakers to cabinet with duplicate number. Check all other cabinet numbers in the complex for validity. Set cabinet number switch on UGUY-PCB in new cabinet (s) to proper cabinet number. Turn on AC breakers for cabinet(s). SNMP Trap: hpevtDuplicateCabinet - 7760 in HPIFPTRAP.MIB. 7767 Error MP ID command must be run Probable Cause: This is the first time the machine has been powered on and there is no valid complex profile anywhere. Recommended Action: Run 'CC' command and generate genesis profile. SNMP Trap: hpevtIdCommandRequired - 7767 in HPIFPTRAP.MIB. 7771 Warning MP Battery is low Probable Cause: MP was running on battery for too long. Someone didn't set 'NVRAM Save' switch to 'off'. Recommended Action: Replace battery as per MP Battery Remove and Replace procedures. SNMP Trap: hpevtNvramBatteryFail - 7771 in HPIFPTRAP.MIB. 7773 Error Partition being reset due to watchdog timeout expiring Probable Cause: The watchdog mechanism triggers the MP to reset a partition if its OS becomes unresponsive. An unresponsive OS is detected when the OS fails to refresh the watchdog timer before it expires. PA systems refresh the watchdog timer by emitting an event with data field set to activity level/timeout, and the timeout field specifies the desired timeout. IPF systems refresh the watchdog timer using the IPMI clear watchdog command. The MP emits this event when timer expiration triggers resetting the partition. OS-specific and platform-specific procedures are used to enable/disable the watchdog timer from resetting the partition. See platform and OS documentation for details. Recommended Action: Find out why the partition's OS had hung. The cause could be bad HW that crashed the partition, or in rare cases, a combination of events that caused the OS to be unable to refresh the watchdog timer. Look for other events preceding the timeout for clues t 7774 Warning PDHC FW was reset by hardware due to firmware inactivity. Probable Cause: Processor dependent hardware controller (PDHC) Hardware Failed; causing inactivity. Recommended Action: Even though the PDHC will reset itself without interrupting the cell, HP Support personnel should be contacted to troubleshoot the PDH daughtercard and/or cell board as soon as possible. SNMP Trap: hpevtPdhcWatchdogTimedOut - 7774 in HPIFPTRAP.MIB. 7781 Warning Power Up Aborted, Over Temp Probable Cause: Reporting Error Recommended Action: Troubleshoot ambient air sensor/cable/PM3. SNMP Trap: hpevtAbortPowerupOth - 7781 in HPIFPTRAP.MIB. 7782 Error Too Few Bulk Power Supplies Available Probable Cause: One or more Bulk Power Supplies are missing or in fault condition at Cabinet Power Up. Recommended Action: Contact your HP support representative to check for faulty Bulk Power Supplies Add Bulk Power Supplies, if under populated. SNMP Trap: hpevtAbortPwrupBps - 7782 in HPIFPTRAP.MIB. 7783 Error No Cabinet Start, Insufficient Blowers Probable Cause: The number of blowers required is a hard number. It is not dependent upon the number of entities installed in a Cabinet. The Utilities Subsystem is not allowing the Cabinet to power up due to an insufficient number of installed blowers. Recommended Action: Install missing Cabinet Blowers. If proper number of blowers are installed, troubleshoot blower presence detection. SNMP Trap: hpevtAbortStartBlowr - 7783 in HPIFPTRAP.MIB. Platform Agent Events 41

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127

Event Description
Event Severity
Event ID
Copy of complex profile on sub and cells don't match Probable Cause: MP NVRAM
was erased by removing MP from system without setting 'NVRAM SAVE' switch to
on. MP was replaced with cabinet's AC Breakers 'off'. Either of first two causes and
replacing or installing a cell board with cabinet's AC Breakers 'off'. Recommended
Action: Remove cell board causing problem. Power complex on and allow cells to
distribute their copy of complex profile to MP, then add new cell following proper
OLA procedures. Remove improper cell board. Execute MP Handler 'CC' command
and choose 'Last Profile'. This will load the sub with what should be the same copy
as the cells. Then add new cell board. SNMP Trap: hpevtCmplxProfilIncoherent -
7758 in HPIFPTRAP.MIB.
Error
7758
Duplicate cabinet number detected Probable Cause: When adding a new cabinet to
the complex or replacing the UGUY, the cabinet number switch was set to a number
already in use. Recommended Action: Turn off AC breakers to cabinet with duplicate
number. Check all other cabinet numbers in the complex for validity. Set cabinet
number switch on UGUY-PCB in new cabinet (s) to proper cabinet number. Turn on
AC breakers for cabinet(s). SNMP Trap: hpevtDuplicateCabinet - 7760 in
HPIFPTRAP.MIB.
Error
7760
MP ID command must be run Probable Cause: This is the first time the machine has
been powered on and there is no valid complex profile anywhere. Recommended
Action: Run 'CC' command and generate genesis profile. SNMP Trap:
hpevtIdCommandRequired - 7767 in HPIFPTRAP.MIB.
Error
7767
MP Battery is low Probable Cause: MP was running on battery for too long. Someone
didn't set 'NVRAM Save' switch to 'off'. Recommended Action: Replace battery as
per MP Battery Remove and Replace procedures. SNMP Trap: hpevtNvramBatteryFail
- 7771 in HPIFPTRAP.MIB.
Warning
7771
Partition being reset due to watchdog timeout expiring Probable Cause: The watchdog
mechanism triggers the MP to reset a partition if its OS becomes unresponsive. An
unresponsive OS is detected when the OS fails to refresh the watchdog timer before
it expires. PA systems refresh the watchdog timer by emitting an event with data
field set to activity level/timeout, and the timeout field specifies the desired timeout.
IPF systems refresh the watchdog timer using the IPMI clear watchdog command.
The MP emits this event when timer expiration triggers resetting the partition.
OS-specific and platform-specific procedures are used to enable/disable the watchdog
timer from resetting the partition. See platform and OS documentation for details.
Recommended Action: Find out why the partition's OS had hung. The cause could
be bad HW that crashed the partition, or in rare cases, a combination of events that
caused the OS to be unable to refresh the watchdog timer. Look for other events
preceding the timeout for clues t
Error
7773
PDHC FW was reset by hardware due to firmware inactivity. Probable Cause:
Processor dependent hardware controller (PDHC) Hardware Failed; causing inactivity.
Recommended Action: Even though the PDHC will reset itself without interrupting
the cell, HP Support personnel should be contacted to troubleshoot the PDH
daughtercard and/or cell board as soon as possible. SNMP Trap:
hpevtPdhcWatchdogTimedOut - 7774 in HPIFPTRAP.MIB.
Warning
7774
Power Up Aborted, Over Temp Probable Cause: Reporting Error Recommended
Action: Troubleshoot ambient air sensor/cable/PM3. SNMP Trap:
hpevtAbortPowerupOth - 7781 in HPIFPTRAP.MIB.
Warning
7781
Too Few Bulk Power Supplies Available Probable Cause: One or more Bulk Power
Supplies are missing or in fault condition at Cabinet Power Up. Recommended
Action: Contact your HP support representative to check for faulty Bulk Power
Supplies Add Bulk Power Supplies, if under populated. SNMP Trap:
hpevtAbortPwrupBps - 7782 in HPIFPTRAP.MIB.
Error
7782
No Cabinet Start, Insufficient Blowers Probable Cause: The number of blowers
required is a hard number. It is not dependent upon the number of entities installed
in a Cabinet. The Utilities Subsystem is not allowing the Cabinet to power up due to
an insufficient number of installed blowers. Recommended Action: Install missing
Cabinet Blowers. If proper number of blowers are installed, troubleshoot blower
presence detection. SNMP Trap: hpevtAbortStartBlowr - 7783 in HPIFPTRAP.MIB.
Error
7783
Platform Agent Events
41