IBM x3655 Service Guide - Page 142

Checkout

procedure

The

checkout

procedure

is

the

sequence

of

tasks

that

you

should

follow

to

diagnose

a

problem

in

the

server.

About

the

checkout

procedure

Before

you

perform

the

checkout

procedure

for

diagnosing

hardware

problems,

review

the

following

information:

v

Read

the

safety

information

that

begins

on

page

vii.

v

The

diagnostic

programs

provide

the

primary

methods

of

testing

the

major

components

of

the

server,

such

as

the

system

board,

Ethernet

controller,

keyboard,

mouse

(pointing

device),

serial

ports,

and

hard

disk

drives.

You

can

also

use

them

to

test

some

external

devices.

If

you

are

not

sure

whether

a

problem

is

caused

by

the

hardware

or

by

the

software,

you

can

use

the

diagnostic

programs

to

confirm

that

the

hardware

is

working

correctly.

v

When

you

run

the

diagnostic

programs,

a

single

problem

might

cause

more

than

one

error

message.

When

this

happens,

correct

the

cause

of

the

first

error

message.

The

other

error

messages

usually

will

not

occur

the

next

time

you

run

the

diagnostic

programs.

Exception:

If

there

are

multiple

error

codes

or

light

path

diagnostic

LEDs

that

indicate

a

microprocessor

error,

the

error

might

be

in

the

microprocessor

or

in

the

microprocessor

socket.

See

“Microprocessor

problems”

on

page

131

for

information

about

diagnosing

microprocessor

problems.

v

Before

you

run

the

diagnostic

programs,

you

must

determine

whether

the

failing

server

is

part

of

a

shared

hard

disk

drive

cluster

(two

or

more

servers

that

share

external

storage

devices).

If

it

is

part

of

a

cluster,

you

can

run

all

diagnostic

programs

except

the

ones

that

test

the

storage

unit

(that

is,

a

hard

disk

drive

in

the

storage

unit)

or

the

storage

adapter

that

is

attached

to

the

storage

unit.

The

failing

server

might

be

part

of

a

cluster

if

any

of

the

following

conditions

are

true:

–

You

have

identified

the

failing

server

as

part

of

a

cluster

(two

or

more

servers

that

share

external

storage

devices).

–

One

or

more

external

storage

units

are

attached

to

the

failing

server

and

at

least

one

of

the

attached

storage

units

is

also

attached

to

another

server

or

unidentifiable

device.

–

One

or

more

servers

are

located

near

the

failing

server.

Important:

If

the

server

is

part

of

a

shared

hard

disk

drive

cluster,

run

one

test

at

a

time.

Do

not

run

any

suite

of

tests,

such

as

“quick”

or

“normal”

tests,

because

this

might

enable

the

hard

disk

drive

diagnostic

tests.

v

If

the

server

is

halted

and

a

POST

error

code

is

displayed,

see

“Error

logs”

on

page

116.

If

the

server

is

halted

and

no

error

message

is

displayed,

see

“Troubleshooting

tables”

on

page

126

and

“Solving

undetermined

problems”

on

page

177.

v

For

information

about

power-supply

problems,

see

“Solving

power

problems”

on

page

175

and

“Power-supply

LEDs”

on

page

142.

v

For

intermittent

problems,

check

the

error

log;

see

“Error

logs”

on

page

116

and

“Diagnostic

programs,

messages,

and

error

codes”

on

page

144.

Performing

the

checkout

procedure

To

perform

the

checkout

procedure,

complete

the

following

steps:

1.

Is

the

server

part

of

a

cluster?

124

IBM

System

x3655

Type

7985:

Problem

Determination

and

Service

Guide

Section	Page
Contents	5
Safety	9
Guidelines for trained service technicians	10
Inspecting for unsafe conditions	10
Guidelines for servicing electrical equipment	10
Safety statements	12
Chapter 1. Introduction	19
Related documentation	19
Notices and statements in this document	20
Features and specifications	22
Server controls, LEDs, and connectors	23
Front view	23
Rear view	25
Internal connectors, LEDs, and jumpers	26
System-board optional-device connectors	27
Riser-card optional-device connectors	28
System-board internal connectors	29
Power-backplane-card internal connectors	29
System-board external connectors	30
System-board jumpers	31
System-board LEDs	33
Riser-card assembly LEDs	33
Chapter 2. Configuration information and instructions	35
Updating the firmware	35
Configuring the server	35
Using the ServerGuide Setup and Installation CD	35
Using the Configuration/Setup Utility program	36
Using the ServeRAID configuration programs	36
Using the ServeRAID Configuration Utility program	37
Using ServeRAID Manager	37
Using the baseboard management controller	39
Enabling and configuring SOL using the OSA SMBridge management utility program	39
Installing the OSA SMBridge management utility program	48
Using the baseboard management controller utility programs	50
Updating the UUID	51
Updating the DMI/SMBIOS data	51
Chapter 3. Removing and replacing server components	53
Installation guidelines	53
System reliability guidelines	54
Working inside the server with the power on	54
Handling static-sensitive devices	55
Returning a device or component	55
Connecting the cables	55
Removing and replacing Tier 1 CRUs	57
Removing the cover	57
Installing the cover	58
Removing the air baffle	59
Installing the air baffle	60
Removing an adapter	61
Installing an adapter	62
Removing the external SAS cable	63
Installing the external SAS cable	64
Removing the Remote Supervisor Adapter II SlimLine	66
Installing the Remote Supervisor Adapter II SlimLine	67
Removing the ServeRAID SAS controller	68
Installing a ServeRAID SAS controller	69
Removing a hard disk drive	70
Installing a hard disk drive	71
Removing a CD-RW/DVD drive	73
Installing a CD-RW/DVD drive	74
Removing an optional tape drive	75
Installing an optional tape drive	75
Installing the tape drive in a 3.5-inch model server	75
Installing the tape drive in a 2.5-inch model server	77
Removing a memory module	80
Installing a memory module	80
Removing a hot-swap fan	82
Installing a hot-swap fan	83
Removing the fan-bracket assembly	84
Installing the fan-bracket assembly	85
Removing a hot-swap power supply	86
Installing a hot-swap power supply	87
Removing the battery	88
Installing the battery	90
Removing and replacing Tier 2 CRUs	91
Removing the operator information panel assembly	92
Installing the operator information panel assembly	93
Installing and removing the hard disk drive backplane	94
Removing the 3.5-inch-drive backplane	94
Installing the 3.5-inch-drive backplane	95
Removing the 2.5-inch-drive backplane	96
Installing the 2.5-inch-drive backplane	97
Removing the CD/DVD media backplane	98
Installing the CD/DVD media backplane	99
Removing the power backplane	100
Installing the power backplane	101
Removing the riser-card assembly	102
Installing the riser-card assembly	103
Removing and replacing FRUs	104
Replacing the 3.5-inch-drive center bracket	104
Removing the center bracket	104
Installing the center bracket	105
Removing a microprocessor	105
Installing a microprocessor	106
Removing a heat-sink retention module	108
Installing a heat-sink retention module	109
Removing the system board and shuttle	111
Installing the system board and shuttle	112
Chapter 4. Parts listing, System x3655, Type 7985	115
Replaceable server components	115
Power cords	121
Chapter 5. Diagnostics	125
Diagnostic tools	125
POST	125
POST beep codes	125
Beep code descriptions	126
No-beep symptoms	133
Error logs	134
Viewing error logs from the Configuration/Setup Utility program	135
Viewing the BMC system event log from the diagnostic programs	135
Clearing the error logs	135
POST error codes	136
Checkout procedure	142
About the checkout procedure	142
Performing the checkout procedure	142
Troubleshooting tables	144
CD or DVD drive problems	144
General problems	145
Hard disk drive problems	145
Intermittent problems	146
USB keyboard, mouse, or pointing-device problems	147
Memory problems	148
Microprocessor problems	149
Monitor problems	150
Optional-device problems	152
Power problems	153
Serial port problems	154
ServerGuide problems	154
Software problems	155
Universal Serial Bus (USB) port problems	156
Video problems	156
Light path diagnostics	156
Remind button	158
Light path diagnostics LEDs	159
Power-supply LEDs	160
Diagnostic programs, messages, and error codes	162
Running the diagnostic programs	162
Diagnostic text messages	163
Viewing the test log	163
Diagnostic error codes	163
Recovering the BIOS code	174
System event/error log messages	175
IPMI BMC system-error log messages	182
BIOS-logged BMC system-error log messages	191
Solving SCSI problems	193
Solving power problems	193
Solving Ethernet controller problems	194
Solving undetermined problems	195
Problem determination tips	196
Calling IBM for service	196
Appendix A. Getting help and technical assistance	199
Before you call	199
Using the documentation	199
Getting help and information from the World Wide Web	200
Software service and support	200
Hardware service and support	200
IBM Taiwan product service	200
Appendix B. Notices	201
Trademarks	201
Important notes	202
Product recycling and disposal	203
Battery return program	204
Electronic emission notices	205
Federal Communications Commission (FCC) statement	205
Industry Canada Class A emission compliance statement	205
Australia and New Zealand Class A statement	205
United Kingdom telecommunications safety requirement	205
European Union EMC Directive conformance statement	206
Taiwanese Class A warning statement	206
Chinese Class A warning statement	206
Japanese Voluntary Control Council for Interference (VCCI) statement	206

IBM x3655 Service Guide - Page 142

Checkout, procedure, About, checkout, Performing

Page 142 highlights