HP Integrity Superdome SX1000 Windows Integrity Management Agents Reference - Page 8

Hot Swap Cage: SCSI cable removed Probable Cause: This alert indicates that a SCSI

Page 8 highlights

Event ID Event Severity Event Description 8 Error A measured voltage in the server has gone far outside the factory specified lower voltage range. Probable Cause: The voltage in the server has gone outside the factory set range. A bad component, blown fuse, poorly seated module, loose cable, or debris could be responsible for this failure. Recommended Action: Check all boards, power supplies, and modules that either supply or use this voltage rail. SNMP Trap: HPEnvironment - 8 in HPIFP02TRAP.MIB. 9 Error Voltage sensor crossed lower non-recoverable threshold Probable Cause: The voltage in the server has gone far outside the factory set range and could damage system components. A bad component, blown fuse, poorly seated module, loose cable, or debris could be responsible for this failure. Recommended Action: Check all boards, power supplies, and modules that either supply or use this voltage rail. SNMP Trap: HPEnvironment - 9 in HPIFP02TRAP.MIB. A measured voltage in the server has gone outside the factory specified upper voltage range. Probable Cause: The voltage in the server has gone outside the factory set range. A bad component, blown fuse, poorly seated module, loose cable, or debris could be responsible for this failure. Recommended Action: Check all boards, power supplies, and modules that either supply or use this voltage rail. SNMP Trap: 10 Warning HPEnvironment - 10 in HPIFP02TRAP.MIB. 12 Error Voltage sensor crossed upper non-recoverable threshold Probable Cause: The voltage in the server has gone far outside the factory set range and could damage system components. A bad component, blown fuse, poorly seated module, loose cable, or debris could be responsible for this failure. Recommended Action: Check all boards, power supplies, and modules that either supply or use this voltage rail. SNMP Trap: HPEnvironment - 12 in HPIFP02TRAP.MIB. The server's built-in sensors have detected an open chassis door. Probable Cause: The server has detect that the chassis door or other access panel is not securely closed. Recommended Action: Close any open panels or chassis doors. SNMP Trap: 26 Warning HPChassis - 26 in HPIFP02TRAP.MIB. Hot Swap Cage: SCSI cable removed Probable Cause: This alert indicates that a SCSI cable, Jumper or Duplex Connector has either been disconnected or removed. This may make some SCSI devices inaccessible. Recommended Action: Ensure that all the required SCSI cables, jumpers or duplex connectors are correctly connected to the cage, disks, and/or the controller. SNMP Trap: SystemHW - 113 in 113 Warning HPIFP02TRAP.MIB. 518 Error Uncorrectable multi-bit ECC error has occurred Probable Cause: There has been an ECC double-bit error in one of the server's ECC memory modules. When an ECC double-bit memory error is detected, the system generates a Non-Maskable Interrupt that halts the system to prevent errors from propagating to other subsystems. Data being written or transmitted at the time may have been lost. Recommended Action: Make a note of the failed memory bank/board number and slot number, contact HP support to replace the failed module. SNMP Trap: HPECCMemory - 518 in HPIFP02TRAP.MIB. 699 Error Machine Check Initiated Probable Cause: A Machine Check Abort event means the hardware detected a critical error. This event is generated whenever a system error due to processor, firmware, hardware and operating system is encountered. MCA events may be either recoverable or non-recoverable. If it is recoverable, the system will attempt to recover from the error for the purpose of maintaining high availability. An example of which is automatic disabling of a failing processor. For non-recoverable errors, the system will either stop or reboot to prevent data corruption and unreliable operation. Recommended Action: When this event is generated, it is highly advisable to consult both the operating system and hardware event logs to find out if there are other events that may help identify the cause of the MCA. If an MCA event occurs that causes the system to reboot, the failing component may be automatically disabled and the system continue to run but at a degraded performance level while awaiting repair. Therefore, for an MCA event, HP recommends contacting HP Customer Support to determine if a repair is needed. SNMP Trap: SystemFW - 699 in HPIFP02TRAP.MIB. 8 Management Agents Event Tables

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96
  • 97
  • 98
  • 99
  • 100
  • 101
  • 102
  • 103
  • 104
  • 105
  • 106
  • 107
  • 108
  • 109
  • 110
  • 111
  • 112
  • 113
  • 114
  • 115
  • 116
  • 117
  • 118
  • 119
  • 120
  • 121
  • 122
  • 123
  • 124
  • 125
  • 126
  • 127

Event Description
Event Severity
Event ID
A measured voltage in the server has gone far outside the factory specified lower
voltage range. Probable Cause: The voltage in the server has gone outside the factory
set range. A bad component, blown fuse, poorly seated module, loose cable, or debris
could be responsible for this failure. Recommended Action: Check all boards, power
supplies, and modules that either supply or use this voltage rail. SNMP Trap:
HPEnvironment - 8 in HPIFP02TRAP.MIB.
Error
8
Voltage sensor crossed lower non-recoverable threshold Probable Cause: The voltage
in the server has gone far outside the factory set range and could damage system
components. A bad component, blown fuse, poorly seated module, loose cable, or
debris could be responsible for this failure. Recommended Action: Check all boards,
power supplies, and modules that either supply or use this voltage rail. SNMP Trap:
HPEnvironment - 9 in HPIFP02TRAP.MIB.
Error
9
A measured voltage in the server has gone outside the factory specified upper voltage
range. Probable Cause: The voltage in the server has gone outside the factory set
range. A bad component, blown fuse, poorly seated module, loose cable, or debris
could be responsible for this failure. Recommended Action: Check all boards, power
supplies, and modules that either supply or use this voltage rail. SNMP Trap:
HPEnvironment - 10 in HPIFP02TRAP.MIB.
Warning
10
Voltage sensor crossed upper non-recoverable threshold Probable Cause: The voltage
in the server has gone far outside the factory set range and could damage system
components. A bad component, blown fuse, poorly seated module, loose cable, or
debris could be responsible for this failure. Recommended Action: Check all boards,
power supplies, and modules that either supply or use this voltage rail. SNMP Trap:
HPEnvironment - 12 in HPIFP02TRAP.MIB.
Error
12
The server's built-in sensors have detected an open chassis door. Probable Cause:
The server has detect that the chassis door or other access panel is not securely closed.
Recommended Action: Close any open panels or chassis doors. SNMP Trap:
HPChassis - 26 in HPIFP02TRAP.MIB.
Warning
26
Hot Swap Cage: SCSI cable removed Probable Cause: This alert indicates that a SCSI
cable, Jumper or Duplex Connector has either been disconnected or removed. This
may make some SCSI devices inaccessible. Recommended Action: Ensure that all
the required SCSI cables, jumpers or duplex connectors are correctly connected to
the cage, disks, and/or the controller. SNMP Trap: SystemHW - 113 in
HPIFP02TRAP.MIB.
Warning
113
Uncorrectable multi-bit ECC error has occurred Probable Cause: There has been an
ECC double-bit error in one of the server's ECC memory modules. When an ECC
double-bit memory error is detected, the system generates a Non-Maskable Interrupt
that halts the system to prevent errors from propagating to other subsystems. Data
being written or transmitted at the time may have been lost. Recommended Action:
Make a note of the failed memory bank/board number and slot number, contact HP
support to replace the failed module. SNMP Trap: HPECCMemory - 518 in
HPIFP02TRAP.MIB.
Error
518
Machine Check Initiated Probable Cause: A Machine Check Abort event means the
hardware detected a critical error. This event is generated whenever a system error
due to processor, firmware, hardware and operating system is encountered. MCA
events may be either recoverable or non-recoverable. If it is recoverable, the system
will attempt to recover from the error for the purpose of maintaining high availability.
An example of which is automatic disabling of a failing processor. For non-recoverable
errors, the system will either stop or reboot to prevent data corruption and unreliable
operation. Recommended Action: When this event is generated, it is highly advisable
to consult both the operating system and hardware event logs to find out if there are
other events that may help identify the cause of the MCA. If an MCA event occurs
that causes the system to reboot, the failing component may be automatically disabled
and the system continue to run but at a degraded performance level while awaiting
repair. Therefore, for an MCA event, HP recommends contacting HP Customer
Support to determine if a repair is needed. SNMP Trap: SystemFW - 699 in
HPIFP02TRAP.MIB.
Error
699
8
Management Agents Event Tables