HP StorageWorks Virtual Array 7100 Appendix A. Disk Array Controller Log - Page 15

x14a, 0x14b, 0x14c, 0x14d, 0x14e, 0x14f

Page 15 highlights

Event Number (dec/hex) Event Name Predictive Maintenance Implication Suspected Components Description Logged? Mfg Fail? 329/0x149 330/0x14a 331/0x14b 332/0x14c 333/0x14d 334/0x14e 335/0x14f 336/0x150 337/0x151 338/0x152 339/0x153 340/0x154 341/0x155 342/0x156 343/0x157 344/0x158 Failure Occurrance Controller, Manager (BGM) has discovered that there are not Backplane enough good fans to cool the system. The BGM is shuttingThe NVRAM is not posted to disk, so it is critical to repair the system quickly. Fan Missing At Initialization Y Ignore - See None N This error code indicates that the BGM has started accompanying timing that a fan is missing from the enclosure. The errors. total length of time a fan has been missing will be checked periodically and if it exceeds TIME_FAN_MISSING_ALLOWED_BGM (approx. 10 minutes) the BGM will shut off the power supply. Power Down Due To Missing Y Single Fan, Controller, N This error code indicates that the BGM is shutting Fan Occurrance Backplane off the power supply because a fan has been missing from the enclosure for too long (approx. 10 minutes). Replace Battery Y Single Battery, N This error code indicates that the batteries have Occurrance Controller failed a discharge test or have dropped below an acceptable voltage level. Both batteries should be replaced. Cache Shrink Attempted After Y Ignore - see None N Attempting to shrink write cache with valid writes Shutdown Warning errors during still in cache. shutdown Controller Failed Y Ignore if Controller N This error code indicates that a controller was recovered with discovered to be bad during the poweron process reset This can be caused by the controller having difficulty in establishing communication. Ignore if recovered by power cycle or reset. SIMM Failed Y Single SIMM, N This error code indicates that a SIMM was Occurrance Controller discovered to be bad during the poweron process. Extended Drive Insertion Event Y Ignore unless no Disk Drive, N A backend SCSI channel was held reset longer than operator activity Controller, the time allowed for a drive hotplug. Probably Back End FC caused by a partially inserted drive module, a failing Link drive, a failing controller, or a bent pin on a connector. Disk Interface State Event Y Single Controller, N The disk interface controller and/or software state Occurrance not disk, disk information was incorrect for observed interface with other entries interface activity. Uncorrectable ECC Error Y > 1 in 6 months SIMM, N This error code indicates that the system During Initialization if no operator Controller experienced a unrecoverable error - The batteries activity have been fully discharged. - The batteries were disconnected from the memory - New SDRAM memory was added All other occurance is a true memory error Correctable ECC Error During Y > 1 in 6 months SIMM, N This error code indicates that the system Initialization if no operator Controller experienced a correctable memory - The batteries activity have been fully discharged. - The batteries were disconnected from the memory - New SDRAM memory was added All other occurance is a true memory error Inter-controller data path error Y > 1 in 6 mo. Controller, Y Failure in the test of data movement between midplane controllers over the high-speed bus. This controller has been placed in isolation mode. Inter-controller address decode Y > 1 in 6 mo. Controller, Y Failure in the test which exercises all address bits error midplane while moving data between controllers over the high-speed bus. This controller has been placed in isolation mode. Inter-controller mirror error Y > 1 in 6 mo. Controller, Y Failure in the test which performs mirrored writes midplane and reads of data between controllers over the high- speed bus. This controller has been placed in isolation mode. If this is a transient error, reset or power cycle will recover, otherwise hardware has failed. controller signature mismatch Y probably caused Controller, N Controller doesn't match signature in midplane by user action Backplane EEPROM. The user has probably inserted a controller with a valid image from another enclosure. This controller is in isolation mode. controller firmware mismatch Y probably caused Controller N Firmware on controllers is not identical. This by user action controller is in isolation mode. remote memory config error Y Controller N The remote controller reports that it was unable to configure its memory. The local controller is in isolation mode.

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27

Event
Number
(dec/hex)
Event
Name
Logged?
Predictive
Maintenance
Implication
Suspected
Components
Mfg Fail?
Description
Failure
Occurrance
Controller,
Backplane
Manager (BGM) has discovered that there are not
enough good fans to cool the system.
The BGM is
shuttingThe NVRAM is not posted to disk, so it is
critical to repair the system quickly.
329/0x149
Fan Missing At Initialization
Y
Ignore - See
accompanying
errors.
None
N
This error code indicates that the BGM has started
timing that a fan is missing from the enclosure.
The
total length of time a fan has been missing will be
checked periodically and if it exceeds
TIME_FAN_MISSING_ALLOWED_BGM
(approx. 10 minutes) the BGM will shut off the
power supply.
330/0x14a
Power Down Due To Missing
Fan
Y
Single
Occurrance
Fan, Controller,
Backplane
N
This error code indicates that the BGM is shutting
off the power supply because a fan has been missing
from the enclosure for too long (approx. 10
minutes).
331/0x14b
Replace Battery
Y
Single
Occurrance
Battery,
Controller
N
This error code indicates that the batteries have
failed a discharge test or have dropped below an
acceptable voltage level.
Both batteries should be
replaced.
332/0x14c
Cache Shrink Attempted After
Shutdown Warning
Y
Ignore - see
errors during
shutdown
None
N
Attempting to shrink write cache with valid writes
still in cache.
333/0x14d
Controller Failed
Y
Ignore if
recovered with
reset
Controller
N
This error code indicates that a controller was
discovered to be bad during the poweron process
This can be caused by the controller having
difficulty in establishing communication. Ignore if
recovered by power cycle or reset.
334/0x14e
SIMM Failed
Y
Single
Occurrance
SIMM,
Controller
N
This error code indicates that a SIMM was
discovered to be bad during the poweron process.
335/0x14f
Extended Drive Insertion Event
Y
Ignore unless no
operator activity
Disk Drive,
Controller,
Back End FC
Link
N
A backend SCSI channel was held reset longer than
the time allowed for a drive hotplug.
Probably
caused by a partially inserted drive module, a failing
drive, a failing controller, or a bent pin on a
connector.
336/0x150
Disk Interface State Event
Y
Single
Occurrance not
with other entries
Controller,
disk, disk
interface
N
The disk interface controller and/or software state
information was incorrect for observed interface
activity.
337/0x151
Uncorrectable ECC Error
During Initialization
Y
> 1 in 6 months
if no operator
activity
SIMM,
Controller
N
This error code indicates that the system
experienced a unrecoverable error - The batteries
have been fully discharged.
- The batteries were
disconnected from the memory - New SDRAM
memory was added All other occurance is a true
memory error
338/0x152
Correctable ECC Error During
Initialization
Y
> 1 in 6 months
if no operator
activity
SIMM,
Controller
N
This error code indicates that the system
experienced a correctable memory - The batteries
have been fully discharged.
- The batteries were
disconnected from the memory - New SDRAM
memory was added All other occurance is a true
memory error
339/0x153
Inter-controller data path error
Y
> 1 in 6 mo.
Controller,
midplane
Y
Failure in the test of data movement between
controllers over the high-speed bus.
This controller
has been placed in isolation mode.
340/0x154
Inter-controller address decode
error
Y
> 1 in 6 mo.
Controller,
midplane
Y
Failure in the test which exercises all address bits
while moving data between controllers over the
high-speed bus.
This controller has been placed in
isolation mode.
341/0x155
Inter-controller mirror error
Y
> 1 in 6 mo.
Controller,
midplane
Y
Failure in the test which performs mirrored writes
and reads of data between controllers over the high-
speed bus.
This controller has been placed in
isolation mode.
If this is a transient error, reset or
power cycle will recover, otherwise hardware has
failed.
342/0x156
controller signature mismatch
Y
probably caused
by user action
Controller,
Backplane
N
Controller doesn’t match signature in midplane
EEPROM. The user has probably inserted a
controller with a valid image from another
enclosure.
This controller is in isolation mode.
343/0x157
controller firmware mismatch
Y
probably caused
by user action
Controller
N
Firmware on controllers is not identical.
This
controller is in isolation mode.
344/0x158
remote memory config error
Y
Controller
N
The remote controller reports that it was unable to
configure its memory.
The local controller is in
isolation mode.