IBM 86655RY Hardware Maintenance Manual - Page 216

206

Hardware Maintenance Manual: Netfinity 7600

–

Type 8665 Models 1RY, 2RY

•

Analyzes data from periodic internal measurements

•

Recommends replacement when specific thresholds are exceeded

The data from periodic internal measurements is collected when data sectors are

accessed.

Data scrubbing performs the following operations:

•

Forces all data sectors to be read

•

Provides more data to improve the accuracy of PFA

The thresholds have been determined by examining the history logs of drives

that have failed in actual customer operation. When PFA detects a threshold

exceeded failure, the system administrator can be notified through Netfinity

Director. The design goal of PFA is to provide a minimum of 24 hours warning

before a drive experiences

“

catastrophic

”

failure.

2.

A cable breaking, a component burning out, a solder connection failing, are all

examples of

“

on/off

”

unpredictable catastrophic failures

. As assembly and

component processes have improved, these types of defects have been reduced

but not eliminated. PFA cannot always provide warning for on/off unpredictable

failures.

Device Event Table:

This table contains counters indicating the number of times

unexpected events were reported through the storage subsystem. These events may

be caused by several sources, including:

ServeRAID controller, Cables (external and internal), Connectors, Hot-Swap

Backplane(s), Hot-Swap Drive Trays, Target Devices (Disk Drives, CD-ROMs, etc.),

and SCSI Terminators.

The Device Event Table can be displayed using the IPSSEND program or the .

Using the IPSSEND program

Note:

In the following command, replace

<controller>

with the ServeRAID

controller number.

At a command prompt, type the following:

ipssend getevent <controller> device

Frequently asked questions regarding the Device Event Table:

In the

Device Event Table, what are hard events?:

The hard event count entry in the device

event table is a count of events detected by the SCSI I/O processor since the Device

Event Table was last cleared. These events are usually not caused by the target device.

The controller processor can detect many types of events. Usually these events are

related to SCSI cabling, back planes or internal problems in the ServeRAID controller.

Hard events are usually not related to the hard drives or other SCSI devices that are

on the bus.

How should hard events be handled?:

If you find a hard event entered into the Event log,

first check to see if there is a discernible pattern to the events in the device error table.

For example a large number of events on a particular drive or channel may indicate a

problem with the cabling or back plane for that particular drive, channel, etc. Always

check for cables being properly seated, bent pins, pushed pins, damaged cables and

proper termination. Before replacing the ServeRAID controller, replace the SCSI

cables followed by the back plane. If you have exhausted all other possibilities, then

replace the ServeRAID controller. Remember that the ServeRAID card is the least

likely item in the subsystem to cause hard events and the most expensive to replace.

In the Device Event Table what is the meaning of soft events?:

The soft event entry in the

device error table is a count of the SCSI check conditions (other than unit attention)

Section	Page
About this manual	5
Important safety information	5
Online support	6
General checkout	11
General information	13
Features and specifications	13
Server features	15
Reliability, availability, and serviceability	16
Controls and indicators	17
Information LED panel	19
Diagnostics	21
Diagnostic tools overview	21
POST	22
Small computer system interface messages	22
Solving ServeRAID problems	23
Diagnostic programs and error messages	34
Light path diagnostics	37
Power checkout	41
Temperature checkout	41
Recovering BIOS	42
Replacing the battery	42
Diagnosing errors	44
Configuring the server	53
Using the Configuration/Setup Utility program	53
Using the SCSISelect utility program	59
Installing options	63
Major components of the Netfinity 7600	63
Component locations	64
Before you begin	70
Removing the server top cover and bezel	71
Working with adapters	73
Installing internal drives	76
Installing memory-module kits	81
Installing a microprocessor kit	83
Installing a hot-swap power supply	86
Replacing a hot-swap fan	88
Completing the installation	89
Connecting external options	91
Input/output ports	91
Cabling the server	103
Installing the server in a rack	103
Netfinity Manager	105
Managing your IBM Netfinity server with Netfinity Manager	106
Netfinity Manager documentation	106
Netfinity Manager system requirements	106
Starting the Netfinity Manager installation program	108
Netfinity Manager database support	115
Starting Netfinity Manager	125
Getting more information about Netfinity Manager	132
Installation options	133
FRU information (service only)	137
Diagnostic switch card	137
Disconnecting the shuttle	138
Front LED card assembly	138
I/O Legacy board	139
Memory card removal	140
PCI switch card	142
Power backplane assembly	142
Processor/PCI backplane	143
Removing the shuttle	145
SCSI backplane assembly	145
SCSI daughter card	146
Installing and configuring ServeRAID controllers	149
Features and connector locations of ServeRAID-4H controller	149
Features and connector locations of ServeRAID-4L controller	151
Features and connector locations of ServeRAID-4M controller	153
Using a ServeRAID-4x controller in a server with Hot-plug PCI features	155
Step 1: Installing and cabling a ServeRAID controller	156
Step 2: Updating BIOS and firmware code	161
Step 3: Configuring ServeRAID controllers	161
Obtaining ServeRAID updates	172
ServeRAID device driver order on Windows 2000 and Windows NT 4.0	174
Using utility programs	175
Introduction to IBM ServeRAID cluster solution	185
Monitoring and updating an IBM ServeRAID cluster solution	186
POST (ISPR) error codes and procedures	190
Recovery procedures for defunct (DDD) drives	194
Channel record table	201
Reference information	202
Symptom-to-FRU index	225
Beep symptoms	225
No beep symptoms	228
Diagnostic panel LEDs	228
Diagnostic error codes	230
Error symptoms	235
Power supply LED errors	235
POST error codes	236
ServeRAID POST (ISPR) error codes	242
ServeRAID	244
SCSI error codes	246
Temperature error messages	246
Fan error messages	247
Power error messages	247
System shutdown	248
DASD checkout	249
Host Built-In Self Test (BIST) checkout	249
I2C bus fault messages	249
Undetermined problems	251
Parts listing (Type 8665)	253
Part A	253
Part B	254
System	255
Keyboards	256
Power cords	257
Related service information	259
Safety information	259
Send us your comments!	289
Problem determination tips	290
Notices	290

IBM 86655RY Hardware Maintenance Manual - Page 216

Device Event Table, Using the IPSSEND program, Backplanes, Hot-Swap Drive Trays

Page 216 highlights