background image

1.

   

If

 

you

 

have

 

not

 

installed

 

the

 

ppc64

 

Linux

 

utilities,

 

perform

 

the

 

installation

 

now.

 

For

 

instructions,

 

go

 

to

 

the

 

Linux

 

on

 

POWER

 

Web

 

site

 

at

 

http://

techsupport.services.ibm.com/server/lopdiags/.

 

2.

   

Reject

 

the

 

TEMP

 

image.

 

Using

 

the

 

Linux

 

operating

 

system,

 

type

 

the

 

following

 

command:

 

echo

 

1>

 

/proc/rtas/manage_flash

 

3.

   

Shut

 

down

 

the

 

blade

 

server

 

using

 

the

 

operating

 

system.

 

4.

   

Restart

 

the

 

blade

 

system

 

management

 

processor

 

from

 

the

 

management

 

module.

 

5.

   

Turn

 

on

 

the

 

blade

 

server.

You

 

might

 

need

 

to

 

update

 

the

 

firmware

 

code

 

to

 

the

 

latest

 

version.

 

See

 

“Upgrading

 

the

 

system

 

firmware”

 

on

 

page

 

33

 

for

 

more

 

information

 

on

 

updating

 

the

 

firmware

 

code.

 

Booting

 

the

 

system

 

This

 

section

 

provides

 

an

 

overview

 

on

 

how

 

to

 

interpret

 

the

 

console

 

output

 

of

 

the

 

host

 

firmware.

 

The

 

output

 

is

 

grouped

 

into

 

several

 

parts,

 

which

 

are

 

detailed

 

below.

 

1.

   

The

 

first

 

part

 

of

 

the

 

boot

 

process

 

displays

 

the

 

system

 

name

 

and

 

build

 

date.

 

***************************************************************************

 

CellBlade1

 

starting.

 

Check

 

Boot

 

ROM...OK,

 

FW

 

is

 

[Feb

 

14

 

2006

 

10:16:17]

 

Note:

  

If

 

the

 

flash

 

image

 

is

 

corrupted,

 

an

 

error

 

is

 

displayed.

 

2.

   

The

 

memory

 

is

 

initialized.

 

It

 

takes

 

several

 

seconds

 

to

 

initialize

 

the

 

Rambus

 

memory.

 

The

 

screen

 

displays

 

details

 

of

 

the

 

vendor

 

and

 

the

 

speed

 

of

 

memory

 

modules.

 

MEMORY

 

  

Modules

    

=

 

Samsung

 

256MB,

 

3200

 

Mhz

 

  

XDRlibrary

 

=

 

v0.32,

 

Bin

 

A/C,

 

RevB,

 

DualDD

 

  

Calibrate

  

=

 

Done

 

3.

   

The

 

next

 

screen

 

displays

 

system

 

information.

 

It

 

shows

 

revision

 

information

 

about

 

the

 

chipset,

 

SMP

 

size,

 

boot

 

date/time,

 

and

 

the

 

available

 

memory.

 

SYSTEM

 

INFORMATION

 

  

Processor

  

=

 

Cell

 

BE(TM)

 

DD3.1

 

  

I/O

 

Bridge

 

=

 

SB

 

3.2

 

  

Timebase

   

=

 

14318

 

kHz

 

(external)

 

  

SMP

 

Size

   

=

 

2

 

(4

 

threads)

 

  

Boot-Date

  

=

 

Feb

 

14

 

2006

 

11:41

 

  

Memory

     

=

 

1024MB

 

(BE0:

 

512MB,

 

BE1:

 

512MB)

 

4.

   

The

 

open

 

firmware

 

section

 

provides

 

checkpoints

 

and

 

an

 

overview

 

of

 

which

 

adapters

 

are

 

available

 

in

 

the

 

system.

 

The

 

details

 

of

 

the

 

adapter

 

list

 

are

 

not

 

meaningful.

 

Note:

  

The

 

warning

 

(!)

 

Permanent

 

Boot

 

ROM

 

is

 

only

 

be

 

displayed

 

if

 

host

 

firmware

 

boots

 

from

 

PERM

 

and

 

not

 

TEMP.

OPENFIRMWARE

 

 

SLOF

 

Setup

 

=

 

(!)

 

Permanent

 

Boot

 

ROM

 

 

SLOF

 

Setup

 

=

 

Adapters:

 

              

5000

 

:

 

1095

 

680

     

Sil0680

 

 

SLOF

 

Setup

 

=

 

Ready

 

5.

   

The

 

build

 

ref

 

displays

 

the

 

host

 

firmware

 

image

 

version.

 

 

Build

 

Ref

  

=

 

CB1-FW-6.06.0@releae

 

6.

   

The

 

legal

 

information,

 

keystroke

 

and

 

command

 

hints

 

are

 

displayed.

 

After

 

this,

 

the

 

operating

 

system

 

boots.

 

 

18

 

BladeCenter

 

QS20

 

Type

 

0200:

 

Problem

 

Determination

 

and

 

Service

 

Guide

Summary of Contents for BladeCenter QS20

Page 1: ...BladeCenter QS20 Type 0200 Problem Determination and Service Guide SC33 8297 00...

Page 2: ......

Page 3: ...BladeCenter QS20 Type 0200 Problem Determination and Service Guide SC33 8297 00...

Page 4: ...age 65 and the Warranty and Support Information on the IBM BladeCenter Documentation CD First Edition September 2006 Copyright International Business Machines Corporation 2006 All rights reserved US G...

Page 5: ...am in the Management Module 11 Why you should not install the blade server into blade bays 6 and 7 12 What happens if the blade server is accidentally installed into blade bays 6 and 7 12 Troubleshoot...

Page 6: ...iniBand 48 Replacing the battery 55 Completing the installation 58 Installing the blade server bezel assembly 58 Closing the blade server cover 60 Input output connectors and devices 61 Appendix A Get...

Page 7: ...18 September 1998 bzw der EMC EG Richtlinie 89 336 f r Ger te der Klasse A 72 European Union EMC Directive conformance statement 72 Taiwanese Class A warning statement 73 Japanese Voluntary Control C...

Page 8: ...vi BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 9: ...lleert eerst de veiligheidsvoorschriften Ennen kuin asennat t m n tuotteen lue turvaohjeet kohdasta Safety Information Avant d installer ce produit lisez les consignes de s curit Vor der Installation...

Page 10: ...ds such as loose or missing hardware To inspect the product for potential unsafe conditions complete the following steps 1 Make sure that the power is off and the power cord is disconnected 2 Make sur...

Page 11: ...en disconnected v If you have to work on equipment that has exposed electrical circuits observe the following precautions Make sure that another person who is familiar with the power off controls is n...

Page 12: ...safety information that comes with the blade server or optional device before you install the device x BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 13: ...quipment when there is evidence of fire water or structural damage v Disconnect the attached power cords telecommunications systems networks and modems before you open the device covers unless instruc...

Page 14: ...ttery replace it only with the same module type made by the same manufacturer The battery contains lithium and can explode if not properly used handled or disposed of Do not v Throw or immerse into wa...

Page 15: ...e device v Use of controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure DANGER Some laser products contain an embedded Cla...

Page 16: ...ice and the power switch on the power supply do not turn off the electrical current supplied to the device The device also might have more than one power cord To remove all electrical current from the...

Page 17: ...act a service technician Statement 13 DANGER Overloading a branch circuit is potentially a fire hazard and a shock hazard under certain conditions To avoid these hazards ensure that your system electr...

Page 18: ...handling ADVERTENCIA El contacto con el cable de este producto o con cables de accesorios que se venden junto con este producto pueden exponerle al plomo un elemento qu mico que en el estado de Califo...

Page 19: ...ected correctly v The BladeCenter QS20 has the latest firmware update Related documentation In addition to this document the following documentation also comes with the server v Installation and User...

Page 20: ...ices or data An attention notice is placed just before the instruction or situation in which damage could occur v Caution These statements indicate situations that can be potentially hazardous to you...

Page 21: ...s and hold the Ctrl key while you select the books Click View Book to view the selected book or books in Acrobat Reader or xpdf If you selected more than one book all the selected books are opened in...

Page 22: ...and serviceability features Three important features in server design are reliability availability and serviceability RAS These RAS features are designed to help you protect the integrity of the data...

Page 23: ...gement Module Web interface After you have installed the BladeCenter QS20 the blade server can start in any of the following ways v You can press the power control button on the front of the blade ser...

Page 24: ...press and hold the power control button for more than 4 seconds to turn off the blade server v You can use the BladeCenter Management Module Web interface to turn on the blade server remotely Note Aft...

Page 25: ...rator to aid in visually locating the blade server The location LED on the BladeCenter unit will be lit also Activity LED When this green LED is lit it indicates that there is network activity Power o...

Page 26: ...8 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 27: ...ure that the BladeCenter unit is correctly connected to a power source v Reseat the blade server in the BladeCenter unit v If the power on LED is flashing slowly turn on the blade server see Turning o...

Page 28: ...d management and configuration program This is your main access method to the blade server or The command line interface See Using the command line interface on page 11 v The serial interface See Usin...

Page 29: ...ust use the serial interface instead Starting the management and configuration program in the Management Module Use the Management Module Web based management and configuration program to v Configure...

Page 30: ...o power domains What happens if the blade server is accidentally installed into blade bays 6 and 7 If you accidentally insert a BladeCenter QS20 into blade bay slots 6 and 7 this does not affect the B...

Page 31: ...ule see the BladeCenter Type 8677 Hardware Maintenance Manual and Troubleshooting Guide for more information v If you have problems with an Ethernet switch module I O adapter or other optional device...

Page 32: ...em reporting on page 28 Information LED Amber Not used supported Not applicable Activity LED Green There is network activity No action required Power on LED Flashing rapidly The service processor on t...

Page 33: ...o blade bays 6 7 see Why you should not install the blade server into blade bays 6 and 7 on page 12 2 If you just installed an option in the blade server remove it and restart the blade server If the...

Page 34: ...s on page 29 Software problems Symptom Suggested action You suspect a software problem 1 To determine whether the problem is caused by the software make sure that v the blade server has the minimum me...

Page 35: ...t These images are referred to as TEMP and PERM respectively The system normally starts from the TEMP image and the PERM image serves as a backup If the TEMP image becomes damaged such as from a power...

Page 36: ...d 2 The memory is initialized It takes several seconds to initialize the Rambus memory The screen displays details of the vendor and the speed of memory modules MEMORY Modules Samsung 256MB 3200 Mhz X...

Page 37: ...Only yaboot is supported and must be installed on the Master Boot Record Use the Management Module set the boot list to Disk0 v Standard Bootp TFTP network boot Only the built in Gigabit Ethernet Cont...

Page 38: ...Tasks Configuration Boot Sequence Message Boot list could not be read from VPD Reason The firmware can not access the VPD It is possible that a service processor communication failure occurred Action...

Page 39: ...me subnet with no router in between and that the TFTP server is reachable from the Blade Message E3003 TFTP Could not load file filename Reason The blade failed to load a file via tftp The reason for...

Page 40: ...a UDP packet based on its transfer ID It could be that another client is using the transfer ID for this connection Action Reboot and retry the transfer If the problem persists check the configuration...

Page 41: ...irmware image incomplete Reason The firmware detected missing components and cannot continue execution Action The boot watchdog automatically boots the PERM firmware image The malfunctioning firmware...

Page 42: ...quest RRAC training BD BE1 does not request RRAC training BE BE0 does not request bus credentials BF BE1 does not request bus credentials BG BE0 still needs attention after init procedure is is done B...

Page 43: ...D memory chip DJ The VID that is set in the volterra master device does not match to the target VID DK No response from PS1 Volterra master device during VID read write DL No response from PS2 Volterr...

Page 44: ...from VPD G5 The BE core voltages can not be read from VPD during device initialization because VPD is constantly busy G6 The revision ID of the ICS9214 clock multiplier chip is not supported GA Junct...

Page 45: ...rveillance watchdog J2 1e2 successful stress cycles completed J3 1e3 successful stress cycles completed J4 1e4 successful stress cycles completed J5 1e5 successful stress cycles completed J6 1e6 succe...

Page 46: ...Diagnostic Boot Restart BladeCenter QS20 with Diagnostic Boot and Default Bootlist Restart BladeCenter QS20 with NMI Problem reporting Firmware logs and Firmware settings are located in the system s N...

Page 47: ...the blade servers has the problem troubleshoot the blade server that has the problem If the blade server is inoperative use the information in this section If you suspect that a software problem is ca...

Page 48: ...as possible v Machine type and model v Microprocessor and hard disk drive upgrades v Failure symptoms Does the blade server fail the diagnostic programs If so what are the error codes What occurs Whe...

Page 49: ...v Operating system control file setup Chapter 2 Diagnostics and troubleshooting 31...

Page 50: ...32 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 51: ...g the BladeCenter Management Module Installing the firmware You only need to install the BladeCenter QS20 firmware if you need to do either of the following v Upgrade to a newer version v Downgrade to...

Page 52: ...an old image on PERM you must copy the TEMP image to PERM side The command is echo 0 proc rtas manage_flash Note The script checks whether the board is booted from the TEMP image If not the script doe...

Page 53: ...le hvc0 boot device Specifies the device for the yaboot loader Must contain a valid device node from the device tree or a valid device alias For example setenv boot device pci 24004000000 ata d disk N...

Page 54: ...ards Mellanox InfiniBand MHEA28 1TCSB The firmware supports only this PCI Express card per socket There are two sockets available one on each I O bridge The card and its settings can be found in devic...

Page 55: ...self or request IBM to install it at no additional charge under the type of warranty service that is designated for your server v Field replaceable unit FRU FRUs must be installed only by trained serv...

Page 56: ...38 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 57: ...h now command If the blade server was not turned off press the power control button behind the blade server control panel door to turn off the blade server You do not have to shut down the BladeCente...

Page 58: ...me v Do not touch solder joints pins or exposed printed circuitry v Do not leave the device where others can handle and damage it v While the device is still in its static protective package touch it...

Page 59: ...The blade server is a hot swap device and the blade bays in the BladeCenter unit are hot swap bays Therefore you can install or remove the blade server without removing power from the BladeCenter uni...

Page 60: ...4 Pull the blade server out of the bay 5 Place either a filler blade or a new blade server in the bay within 1 minute 42 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 61: ...ver from the blade server and set it aside see Figure 7 step 2 5 Remove the second power board see Figure 7 step 3 6 Disconnect any optionally installed PCI Express cables 7 Press the two front bezel...

Page 62: ...Hazardous energy is present when the blade server is connected to the power source Always replace the blade cover before installing the blade server 44 BladeCenter QS20 Type 0200 Problem Determinatio...

Page 63: ...l cable from the serial connector 6 Pull the bezel assembly away from the blade server 7 Store the bezel assembly in a safe place Installing the IDE hard disk drive The blade server has a connector on...

Page 64: ...structions 5 Place the tray from the option kit on the system board aligning the tray with the screws on the system board Note the four screws that are under the four screw holes in the tray Set the t...

Page 65: ...niBand installed otherwise the blade will hang during the initial reboot after the OS installation 3 If your BladeCenter QS20 comes with InfiniBand option s already installed unplug the PCI Express ca...

Page 66: ...and card Installing InfiniBand To install an InfiniBand option do the following 1 Shut down the BladeCenter QS20 2 Remove the BladeCenter QS20 from BladeCenter 3 Open the top cover 4 Unlatch the front...

Page 67: ...PCIe Express adapter card Make sure the index hole of the InfiniBand card matches the index mark of the adapter see Figure 11 on page 50 Figure 10 Unplugging the control panel and serial cable connect...

Page 68: ...e front bezel and match the respective ports of the front bezel 9 Secure the assembly with the four screws that were delivered with your InfiniBand card see Figure 12 on page 51 Figure 11 InfiniBand c...

Page 69: ...al guides while ensuring that the top of the bezel rests on the foam gasket on the InfiniBand card as shown in Figure 13 on page 52 Note Make sure the serial cable routed along the inside and across t...

Page 70: ...d it is recommended that you install it in the upper slot 15 Plug the PCI Express cable into the BladeCenter QS20 s PCI Express connector closest to the InfiniBand card Leave the other end unconnected...

Page 71: ...second card when you install the connector to the upper card 18 Optional repeat the above steps for the second InfiniBand option Note When you install the lower second InfiniBand option make sure that...

Page 72: ...20 Carefully close the cover Figure 16 Cover with gasket 54 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 73: ...reset the system date and time through the operating system that you installed Statement 2 CAUTION When replacing the lithium battery use only IBM Part Number 33F8354 or 15F8409 or an equivalent type...

Page 74: ...its socket The spring mechanism will push the battery out toward you as you slide it from the socket Note You might need to lift the battery clip slightly with your fingernail to make it easier to sl...

Page 75: ...efore installing the blade server 11 Reinstall the blade server into the BladeCenter unit see BladeCenter QS20 Installation and User s Guide 12 Turn on the blade server see Turning on the blade server...

Page 76: ...ce the battery or the system board assembly reset the system date and time through the operating system that you installed For additional information see your operating system documentation Note If yo...

Page 77: ...ly 4 Carefully slide the bezel assembly onto the blade server as shown in the illustration until it clicks into place Note v If InfiniBand is installed partially insert the side fins of the bezel into...

Page 78: ...58 for instructions and Figure 19 step 2 4 If you removed the second power board replace it now see Figure 19 step 2 5 If you disconnected any PCI Express cables reconnect them now See the documentati...

Page 79: ...Take care not to exceed the cable s minimum bending radius Input output connectors and devices The BladeCenter unit contains the input output connectors that are available to the blade server See the...

Page 80: ...62 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 81: ...our IBM product The documentation that comes with BladeCenter systems also describes the diagnostic tests that you can perform Most BladeCenter systems operating systems and programs come with documen...

Page 82: ...for support telephone numbers In the U S and Canada call 1 800 IBM SERV 1 800 426 7378 Hardware service and support You can receive hardware service through IBM Services or through your IBM reseller...

Page 83: ...rranties in certain transactions therefore this statement may not apply to you This information could include technical inaccuracies or typographical errors Changes are periodically made to the inform...

Page 84: ...rposes of developing using marketing or distributing application programs conforming to the application programming interface for the operating platform for which the sample programs are written These...

Page 85: ...U S Government Users Restricted Rights Use duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp Appendix B Notices 67...

Page 86: ...emarks or registered trademarks of Sun Microsystems Inc in the United States other countries or both Adaptec and HostRAID are trademarks of Adaptec Inc in the United States other countries or both Lin...

Page 87: ...bility and fitness for a particular purpose These products are offered and warranted solely by third parties IBM makes no representations or warranties with respect to non IBM products Support if any...

Page 88: ...er listed on the battery available prior to your call In California the following applies Perchlorate Material special handling may apply See http www dtsc ca gov hazardouswaste perchlorate The forego...

Page 89: ...A digital apparatus complies with Canadian ICES 003 Avis de conformit la r glementation d Industrie Canada Cet appareil num rique de la classe A est conforme la norme NMB 003 du Canada Australia and N...

Page 90: ...t tszeichen CE zu f hren Verantwortlich f r die Konformit tserkl rung nach Paragraf 5 des EMVG ist die IBM Deutschland GmbH 70548 Stuttgart Informationen in Hinsicht EMVG Paragraf 4 Abs 1 4 Das Ger t...

Page 91: ...panese Voluntary Control Council for Interference VCCI statement The following statement applies to the BladeCenter QS20 and the 2945 InfiniBand Option Korean Class A warning statement The following s...

Page 92: ...er Used in these countries and regions 02K0546 China 13F9940 Australia Fiji Kiribati Nauru New Zealand Papua New Guinea 13F9979 Afghanistan Albania Algeria Andorra Angola Armenia Austria Azerbaijan Be...

Page 93: ...Barbados Belize Bermuda Bolivia Brazil Caicos Islands Canada Cayman Islands Costa Rica Colombia Cuba Dominican Republic Ecuador El Salvador Guam Guatemala Haiti Honduras Jamaica Japan Mexico Micrones...

Page 94: ...76 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 95: ...ronics IDE hard disk drives 45 internal installing 45 requirements 45 specifications 3 E electronic emission Class A notice 70 environment 4 error messages boot errors 20 boot list 20 network boot 21...

Page 96: ...EMP image 17 ports input output 61 power identifying problems 15 problem 15 power cords 74 power control button 6 problem how to report a 29 problem reporting 28 problems power 15 service processor 16...

Page 97: ...reject function 17 starting 17 trademarks 68 troubleshooting charts 13 overview 9 turning off the blade server 5 turning on the blade server 5 U UART for serial console 35 UART for service processor c...

Page 98: ...80 BladeCenter QS20 Type 0200 Problem Determination and Service Guide...

Page 99: ......

Page 100: ...Part Number 42C4969 Printed in USA SC33 8297 00 1P P N 42C4969...

Reviews: