IBM 8872 Service Guide - Page 50

Checkout

procedure

The

checkout

procedure

is

the

sequence

of

tasks

that

you

should

follow

to

diagnose

a

problem

in

the

server.

About

the

checkout

procedure

Before

performing

the

checkout

procedure

for

diagnosing

hardware

problems,

review

the

following

information:

v

Read

the

safety

information

beginning

on

page

vii.

v

The

diagnostic

programs

provide

the

primary

methods

of

testing

the

major

components

of

the

server,

such

as

the

I/O

board,

Ethernet

controller,

keyboard,

mouse

(pointing

device),

serial

ports,

and

hard

disk

drives.

You

can

also

use

them

to

test

some

external

devices.

If

you

are

not

sure

whether

a

problem

is

caused

by

the

hardware

or

by

the

software,

you

can

use

the

diagnostic

programs

to

confirm

that

the

hardware

is

working

correctly.

v

When

you

run

the

diagnostic

programs,

a

single

problem

might

cause

more

than

one

error

message.

When

this

happens,

correct

the

cause

of

the

first

error

message.

The

other

error

messages

usually

will

not

occur

the

next

time

you

run

the

diagnostic

programs.

Exception:

If

there

are

multiple

error

codes

or

light

path

diagnostics

LEDs

that

indicate

a

microprocessor

error,

the

error

might

be

in

a

microprocessor

or

in

a

microprocessor

socket.

See

“Microprocessor

problems”

on

page

42

for

information

about

diagnosing

microprocessor

problems.

v

Before

running

the

diagnostic

programs,

you

must

determine

whether

the

failing

server

is

part

of

a

shared

hard

disk

drive

cluster

(two

or

more

servers

sharing

external

storage

devices).

If

it

is

part

of

a

cluster,

you

can

run

all

diagnostic

programs

except

the

ones

that

test

the

storage

unit

(that

is,

a

hard

disk

drive

in

the

storage

unit)

or

the

storage

adapter

that

is

attached

to

the

storage

unit.

The

failing

server

might

be

part

of

a

cluster

if

any

of

the

following

conditions

is

true:

–

You

have

identified

the

failing

server

as

part

of

a

cluster

(two

or

more

servers

sharing

external

storage

devices).

–

One

or

more

external

storage

units

are

attached

to

the

failing

server

and

at

least

one

of

the

attached

storage

units

is

also

attached

to

another

server

or

unidentifiable

device.

–

One

or

more

servers

are

located

near

the

failing

server.

Important:

If

the

server

is

part

of

a

shared

hard

disk

drive

cluster,

run

one

test

at

a

time.

Do

not

run

any

suite

of

tests,

such

as

“quick”

or

“normal”

tests,

because

this

might

enable

the

hard

disk

drive

diagnostic

tests.

v

If

the

server

is

halted

and

a

POST

error

code

is

displayed,

see

“Error

logs”

on

page

18.

If

the

server

is

halted

and

no

error

message

is

displayed,

see

“Troubleshooting

tables”

on

page

36

and

“Solving

undetermined

problems”

on

page

90.

v

For

information

about

power-supply

problems,

see

“Solving

power

problems”

on

page

88

and

“Power-supply

LEDs”

on

page

57.

v

For

intermittent

problems,

check

the

error

log;

see

“Error

logs”

on

page

18

and

“Diagnostic

programs,

messages,

and

error

codes”

on

page

59.

34

IBM

xSeries

460

Type

8872

and

xSeries

MXE

460

Type

8874:

Problem

Determination

and

Service

Guide

Section	Page
Contents	5
Safety	9
Guidelines for trained service technicians	10
Inspecting for unsafe conditions	10
Guidelines for servicing electrical equipment	10
Safety statements	12
Chapter 1. Introduction	17
Related documentation	17
Notices and statements in this document	18
Features and specifications	19
Server controls, LEDs, and connectors	20
Front view	20
Rear view	21
Internal LEDs, connectors, and jumpers	24
I/O board internal connectors and jumpers	24
Memory-card connectors	25
Memory-card LEDs	25
Microprocessor-board connectors and LEDs	26
PCI-X board connectors	26
PCI-X board LEDs	27
SAS-backplane connectors	27
Chapter 2. Diagnostics	29
Diagnostic tools	29
POST	29
POST beep codes	30
Beep code descriptions	30
No-beep symptoms	34
Error logs	34
Viewing error logs from the Configuration/Setup Utility program	35
Viewing the BMC log from the diagnostic programs	36
POST error codes	36
Checkout procedure	50
About the checkout procedure	50
Performing the checkout procedure	51
Checkpoint codes (trained service technicians only)	51
Troubleshooting tables	52
CD or DVD drive problems	52
General problems	53
Hard disk drive problems	53
Intermittent problems	54
Keyboard, mouse, or pointing-device problems	54
USB keyboard, mouse, or pointing-device problems	55
Memory problems	57
Microprocessor problems	58
Monitor problems	59
Optional-device problems	61
Power problems	62
Serial port problems	63
ServerGuide problems	64
Software problems	64
Universal Serial Bus (USB) port problems	65
Video problems	65
Light path diagnostics	65
Remind button	67
Light path diagnostic LEDs	68
Power-supply LEDs	73
Diagnostic programs, messages, and error codes	75
Running the diagnostic programs	75
Diagnostic text messages	76
Viewing the test log	76
Diagnostic error codes	76
Real Time Diagnostics	93
Recovering from a BIOS update failure	93
System-error log messages	94
Solving SCSI problems	104
Solving power problems	104
Solving Ethernet controller problems	104
Solving undetermined problems	106
Calling IBM for service	107
Chapter 3. Parts listing, Type 8872 and Type 8874	109
Replaceable server components	110
Power cords	111
Chapter 4. Removing and replacing server components	115
Installation guidelines	115
System reliability guidelines	116
Working inside the server with the power on	116
Handling static-sensitive devices	116
Returning a device or component	117
Removing and replacing Tier 1 CRUs	118
Adapter	118
DVD drive	120
Hot-swap fan	121
Hot-swap power supply	122
Memory card and memory module (DIMM)	124
Removing and replacing a memory card	124
Removing and replacing a DIMM	125
Remote Supervisor Adapter II SlimLine	127
ServeRAID-8i adapter	128
Top cover and bezel	129
Removing and replacing Tier 2 CRUs	130
Battery	130
I/O board	131
Operator information panel assembly	133
PCI-X adapter guide	134
Power-supply structure	135
SAS backplane	136
Removing and replacing FRUs	137
Front-panel assembly	137
Microprocessor tray and microprocessor	138
Removing and installing a microprocessor	138
Thermal grease	141
PCI-X board assembly	142
PCI-X switch card assembly	144
Power backplane	145
Scalability cartridge assembly	146
Chapter 5. Configuration information and instructions	149
Updating the firmware	149
Configuring the server	149
Using the ServerGuide Setup and Installation CD	149
Using the UpdateXpress program	150
Using the Configuration/Setup Utility program	150
Starting the Configuration/Setup Utility program	150
Configuration/Setup Utility menu choices	151
Passwords	154
Installing and using the baseboard management controller utility programs	155
Using the SAS/SATA Configuration Utility program	156
Configuring the Ethernet controller	156
Using the PXE boot agent utility program	156
Using the ServeRAID configuration programs	157
Using the Scalable Partition Web Interface	157
Appendix A. Getting help and technical assistance	159
Before you call	159
Using the documentation	159
Getting help and information from the World Wide Web	160
Software service and support	160
Hardware service and support	160
Appendix B. Notices	161
Edition notice	161
Trademarks	162
Important notes	162
Product recycling and disposal	163
Battery return program	163

IBM 8872 Service Guide - Page 50

Checkout, procedure, About, checkout

Page 50 highlights