background image

 

HP resources for troubleshooting  87 

To create a profile and select notifications, refer to the HP website 
(

http://www.hp.com/go/subscriberschoice

). 

Change control and proactive notification

 

HP offers Change Control and Proactive Notification to notify customers 30 to 60 days in advance of 
upcoming hardware and software changes on HP commercial products. 
For more information, refer to the HP website (

http://www.hp.com/go/pcn

). 

 

HP Care Pack Services 

HP Care Pack Services offer upgraded service levels to extend and expand bundled services with easy-to-
buy, easy-to-use support packages that help you make the most of your server investments. For more 

information, see the HP website (

http://www.hp.com/services/carepack

). 

 

Product information resources 

Additional product information 

Refer to product information on the HP Servers website 
(

http://www.hp.com/country/us/eng/prodserv/servers.html

). 

 

Registering the server 

To register the server, refer to the HP Registration website (

http://register.hp.com

). 

 

Overview of server features and installation instructions 

Refer to the server user guide on the Documentation CD or on the HP Business Support Center website 

(

http://www.hp.com/go/bizsupport

). 

 

Key features, option part numbers 

Refer to the QuickSpecs on the HP website (

http://www.hp.com

). 

 

Server and option specifications, symbols, installation warnings, 

and notices 

Refer to the server documentation and printed notices. Printed notices are available in the Reference 
Information pack. Server documentation is available in the following locations: 

 

Documentation CD that ships with the server 

 

HP Business Support Center website (

http://www.hp.com/go/bizsupport

 

HP Technical Documentation website (

http://www.docs.hp.com

 

Summary of Contents for ProLiant Server

Page 1: ...HP ProLiant Servers Troubleshooting Guide Part Number 375445 009 June 2010 Ninth Edition ...

Page 2: ...or technical or editorial errors or omissions contained herein Microsoft Windows and Windows Server are U S registered trademarks of Microsoft Corporation Intel Pentium and Itanium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries Intended audience This document is for the person who installs administers and troubleshoots serv...

Page 3: ... drive guidelines 19 SAS and SATA hard drive guidelines 19 SCSI hard drive guidelines 19 Hard drive LED combinations 20 Hot plug SCSI hard drive LED combinations 20 SAS and SATA hard drive LED combinations 20 Server updates with an HP Trusted Platform Module and BitLocker enabled 21 Diagnostic flowcharts 23 Troubleshooting flowcharts 23 Troubleshooting flowchart reference websites 23 Start diagnos...

Page 4: ...roblems 55 Modem problems 56 Network controller problems 58 Expansion board problems 59 Software problems 60 Operating system problems and resolutions 60 Operating system problems 60 Operating system updates 61 Restoring to a backed up version 62 When to Reconfigure or Reload Software 62 Linux operating systems 63 Application software problems 63 Software locks up 63 Errors occur after a software ...

Page 5: ...pported processor stepping with Intel processors 85 Unsupported processor stepping with AMD processors 85 HP resources for troubleshooting 86 Online resources 86 HP Technical Support website 86 HP Guided Troubleshooting website 86 Server documentation 86 White papers 86 Service notifications advisories and notices 86 Subscription services 86 HP Care Pack Services 87 Product information resources 8...

Page 6: ...r messages 157 A CPU Power Module System Board Socket X 158 ASR Lockup Detected Cause 158 Automatic operating system shutdown initiated due to fan failure 158 Automatic Operating System Shutdown Initiated Due to Overheat Condition 158 Blue Screen Trap Cause NT 159 Corrected Memory Error Threshold Passed Slot X Memory Module Y 159 EISA Expansion Bus Master Timeout Slot X 159 PCI Bus Error Slot X Bu...

Page 7: ...RR_6 170 MSG_CPU_RR_7 170 MSG_CPU_RR_8 171 MSG_CPU_RR_9 171 MSG_CPU_RR_10 171 MSG_CPU_RR_11 171 MSG_CPU_RR_12 171 MSG_CPU_RR_13 171 MSG_CPU_RR_14 171 MSG_CPU_RR_15 171 MSG_CPU_RR_16 171 MSG_CPU_RR_17 172 Contacting HP 173 Contacting HP technical support or an authorized reseller 173 Customer self repair 173 Server information you need 173 Operating system information you need 174 Microsoft operati...

Page 8: ...ications flowchart on page 34 Added and updated sections in Software tools and solutions on page 67 o Integrated Lights Out 3 technology on page 73 o Firmware on page 78 o HP Smart Update Manager on page 79 Added new sections to Hardware problems on page 37 o Battery pack problems on page 41 o Cable problems on page 55 Added a new section to Software problems on page 60 ROM problems on page 64 Upd...

Page 9: ...r messages and beep codes on page 115 375445 xx7 November 2008 The seventh edition of the HP ProLiant Servers Troubleshooting Guide part number 375445 xx7 included the following additions and updates Added new information about Server updates with an HP Trusted Platform Module and BitLocker enabled on page 21 to Common problem resolution on page 18 Added TPM information to Drive problems hard driv...

Page 10: ...he fifth edition of the HP ProLiant Servers Troubleshooting Guide part number 375445 xx5 included the following additions Added three new c Class server blade flowcharts o c Class server blade power on problems flowchart on page 29 o c Class server blade POST problems flowchart on page 32 o c Class server blade fault indications flowchart on page 36 Added new processor error codes o Windows Event ...

Page 11: ...ines Added hot plug SCSI hard drive LED combinations on page 20 Updated diagnostic flowcharts on page 23 Added operating system problems on page 60 Added Port 85 codes and iLO messages on page 165 Added new error messages to ADU error messages and POST error messages and beep codes Updated contacting HP o Contacting HP technical support or an authorized reseller o Server information you need ...

Page 12: ...hen a server exhibits symptoms that do not immediately pinpoint the problem use this section to begin troubleshooting The section contains a series of flowcharts that provide a common troubleshooting process for ProLiant servers The flowcharts identify a diagnostic tool or a process to help solve the problem Hardware problems on page 37 When the symptoms point to a specific component use this sect...

Page 13: ...age 16 3 Prepare the server for diagnosis on page 16 4 Use the Start diagnosis flowchart on page 25 to begin the diagnostic process Important safety information Familiarize yourself with the safety information in the following sections before troubleshooting the server Important safety information Before servicing this product read the Important Safety Information document provided with the server...

Page 14: ...pment All troubleshooting and repair procedures are detailed to allow only subassembly module level repair Because of the complexity of the individual boards and subassemblies no one should attempt to make repairs at the component level or to make modifications to any printed wiring board Improper repairs can create a safety hazard WARNING To reduce the risk of personal injury or damage to the equ...

Page 15: ...of damage may reduce the life expectancy of the device To prevent electrostatic damage Avoid hand contact by transporting and storing products in static safe containers Keep electrostatic sensitive parts in their containers until they arrive at static free workstations Place parts on a grounded surface before removing them from their containers Avoid touching pins leads or circuitry Always be prop...

Page 16: ...n page 86 2 Record any error messages displayed by the system 3 Remove all diskettes CD ROMs DVD ROMs and USB drive keys 4 Power down the server and peripheral devices if you will be diagnosing the server offline If possible always perform an orderly shutdown a Exit any applications b Exit the operating system c Power down the server 5 Disconnect any peripheral devices not required for testing any...

Page 17: ...nal processors leave one installed All additional DIMMs Leave only the minimum required to boot the server either one DIMM or a pair of DIMMs For more information see the memory guidelines in the server user guide All additional cooling fans if applicable For minimum fan configuration see the server user guide All additional power supplies if applicable leave one installed All hard drives All opti...

Page 18: ...oblems continue to occur remove and reinstall each device checking the connectors and sockets for bent pins or other damage Service notifications To view the latest service notifications refer to the HP website http www hp com go bizsupport Select the appropriate server model and then click the Troubleshoot a Problem link on the product page Firmware updates Download firmware updates from the foll...

Page 19: ... see the HP website http h20000 www2 hp com bizsupport TechSupport Document jsp lang en cc us objectID c008 68283 jumpid reg_R1002_USEN Hard drive guidelines SAS and SATA hard drive guidelines When adding hard drives to the server observe the following general guidelines The system automatically sets all drive numbers If only one hard drive is used install it in the bay with the lowest drive numbe...

Page 20: ...s being accessed but 1 it is not configured as part of an array 2 it is a replacement drive and rebuild has not yet started or 3 it is spinning up during the POST sequence Flashing Flashing Flashing Do not remove the drive Removing a drive may cause data loss in non fault tolerant configurations One or more of the following conditions may exist The drive is part of an array being selected by an ar...

Page 21: ...minate the current operation and cause data loss The drive is rebuilding erasing or it is part of an array that is undergoing capacity expansion or stripe migration Flashing irregularly Amber flashing regularly 1 Hz The drive is active but a predictive failure alert has been received for this drive Replace the drive as soon as possible Flashing irregularly Off The drive is active and it is operati...

Page 22: ...Common problem resolution 22 Moving a BitLocker protected drive to another server Adding an optional PCI device such as a storage controller or network adapter ...

Page 23: ...lowchart on page 29 POST problems flowchart on page 30 o Server and p Class server blade POST problems flowchart on page 31 o c Class server blade POST problems flowchart Operating system boot problems flowchart Server fault indications flowchart on page 33 o Server and p Class server blade fault indications flowchart on page 34 o c Class server blade fault indications flowchart Troubleshooting fl...

Page 24: ...ervice Guide select the product Select Manuals guides supplements addendums etc Under Service and maintenance information locate the link for the document 4 HP BladeSystem Power Sizer http www hp com go bladesystem powercalculator Use the Power Sizer to plan your power infrastructure and meet the needs of an HP BladeSystem solution 5 Remote management http www hp com servers lights out To locate t...

Page 25: ...Diagnostic flowcharts 25 Start diagnosis flowchart Use the following flowchart to start the diagnostic process General diagnosis flowchart ...

Page 26: ...stic flowcharts 26 The General diagnosis flowchart provides a generic approach to troubleshooting If you are unsure of the problem or if the other flowcharts do not fix the problem use the following flowchart ...

Page 27: ...may flash Both conditions represent the same symptom For the location of server LEDs and information on their statuses see the server documentation on the HP website http www hp com support Symptoms The server does not power on The system power LED is off or amber The external health LED is red flashing red amber or flashing amber The internal health LED is red flashing red amber or flashing amber...

Page 28: ...Diagnostic flowcharts 28 ...

Page 29: ...wer on problems flowchart c Class server blade power on problems flowchart For the location of server LEDs and information on their statuses see the server documentation on the HP website http www hp com support Symptoms The server does not power on ...

Page 30: ...r amber The health LED is red or amber Possible causes Improperly seated or faulty power supply Loose or faulty power cord Power source problem Improperly seated component or interlock problem POST problems flowchart Symptoms Server does not complete POST ...

Page 31: ...ted POST when the system attempts to access the boot device Server completes POST with errors Possible problems Improperly seated or faulty internal component Faulty KVM device Faulty video device Server and p Class server blade POST problems flowchart ...

Page 32: ...POST problems flowchart Operating system boot problems flowchart Symptoms Server does not boot a previously installed OS Server does not boot SmartStart Possible causes Corrupted OS Hard drive subsystem problem Incorrect boot order setting in RBSU ...

Page 33: ...lade Use iLO to remotely attach virtual devices to mount the SmartStart CD onto the server blade Use a local I O cable and drive to connect to the server blade and then restart the server blade Server fault indications flowchart Symptoms Server boots but a fault event is reported by Insight Management Agents ...

Page 34: ...d component installed Redundancy failure System overtemperature condition Server and p Class server blade fault indications flowchart Some servers have an internal health LED and an external health LED while other servers have a single system health LED The system health LED provides the same functionality as the two separate internal and external health LEDs Depending on the model the internal he...

Page 35: ...Diagnostic flowcharts 35 For the location of server LEDs and information on their statuses see the server documentation on the HP website http www hp com support ...

Page 36: ...Diagnostic flowcharts 36 c Class server blade fault indications flowchart ...

Page 37: ...sition 7 If group power capping is supported on the server be sure there is sufficient power allocation to support the server 8 Be sure no loose connections exist Loose connections on page 18 Power supply problems Action 1 Be sure no loose connections exist Loose connections on page 18 2 If the power supplies have LEDs be sure they indicate that each power supply is working properly If the LEDs in...

Page 38: ...he UPS documentation 8 If the UPS sleep mode is initiated disable sleep mode for proper operation The UPS sleep mode can be turned off through the configuration mode on the front panel 9 Change the battery to be sure damage was not caused by excessive heat particularly if a recent air conditioning outage has occurred NOTE The optimal operating temperature for UPS batteries is 25 C 77 F For approxi...

Page 39: ...mation see the server documentation 7 Be sure other components were not accidentally unseated during the installation of the new hardware component 8 Be sure all necessary software updates such as device drivers ROM updates and patches are installed and current and the correct version for the hardware is installed For example if you are using a Smart Array controller you need the latest Smart Arra...

Page 40: ...lace the system board If not be sure each of those components is working o If the system boots and video is working add each component back to the server one at a time restarting the server after each component is added to determine if that component is the cause of the problem When adding each component back to the server be sure to disconnect power to the server and follow the guidelines and cau...

Page 41: ...If a battery fails completely the HP Smart Array controller detects this condition and automatically restricts write cache functions to protect user data To help ensure uninterrupted performance levels HP recommends replacing battery packs at 3 year intervals In NiMH batteries the charging and discharging processes create and recombine inert gases which can cause the button cell to swell in size b...

Page 42: ...the cables are working properly Replace with known functional cables to test whether the original cables were faulty 4 Be sure the correct current driver is installed Diskette drive problems Diskette drive light stays on Action 1 Be sure no loose connections on page 18 exist 2 Be sure the diskette is not damaged Run the diskette utility on the diskette CHKDSK on some systems 3 Be sure the diskette...

Page 43: ...ination tables in Hard drive LED combinations on page 20 If the drive fault LED is flashing replace the hard drive See the server maintenance and service guide If the drive fault LED is not flashing and the operating system supports HP Insight Diagnostics version 7 40 or later HP Insight Diagnostics on page 75 perform the following a Run the Smart Array SCSI Diagnosis feature on page 75 b Perform ...

Page 44: ... CD and run ADU version 7 31 or later Array diagnostic software on page 76 For ADU report analysis contact HP support Contacting HP on page 173 System completes POST but drive fails Action 1 Be sure no loose connections on page 18 exist 2 Be sure no device conflict exists 3 Be sure the hard drive is cabled properly and terminated if necessary 4 Be sure the hard drive data cable is working by repla...

Page 45: ...ot defective by installing the hard drive in another bay 3 Run HP Insight Diagnostics on page 75 Then replace failed components as indicated 4 When the drive is a replacement drive on an array controller be sure that the drive is the same type and of the same or larger capacity than the original drive Data is inaccessible Action 1 Be sure the files are not corrupt Run the repair utility for the op...

Page 46: ...uirements of the server Refer to the server documentation 3 Be sure no ventilation problems exist If you have been operating the server for an extended period of time with the access panel removed airflow may have been impeded causing thermal damage to components Refer to the server documentation for further requirements 4 Be sure no POST error messages POST error messages and beep codes on page 1...

Page 47: ... by RBSU request a new system board and TPM board from an HP authorized service provider Contacting HP technical support or an authorized reseller on page 173 CAUTION Any attempt to remove an installed TPM from the system board breaks or disfigures the TPM security rivet Upon locating a broken or disfigured rivet on an installed TPM administrators should consider the system compromised and take ap...

Page 48: ... DIMM in a bank with a known working DIMM o Remove any third party memory To test the memory run HP Insight Diagnostics on page 75 Server is out of memory Action 1 Be sure the memory is configured properly Refer to the application documentation to determine the memory configuration requirements 2 Be sure no operating system errors are indicated 3 Be sure a memory count error Memory count error exi...

Page 49: ...on which you are testing the memory 7 Replace the memory See the server documentation Server fails to boot all DIMM LEDs illuminate amber the health LED is blinking red the system emits continuous beeps and an entry is logged to the Integrated Management Log IML Possible cause The server is an HP ProLiant G6 server with 5500 series Xeon processors installed The DIMMs are not installed according to...

Page 50: ...ocessor requirements see the server documentation 3 Be sure the server ROM is current If an unsupported processor detected message is displayed see Unsupported processor stepping with Intel processors on page 85 4 Be sure you are not mixing processor stepping core speeds or cache sizes if this is not supported on the server For more information see the server documentation CAUTION Removal of some ...

Page 51: ...a Press and hold the Eject button for at least 10 seconds b Allow up to 10 minutes for the tape to rewind and eject The green Ready LED should flash 3 Power cycle the drive Allow up to 10 minutes for the drive to become ready again 4 Check for conflicts in backup software services 5 Check the SCSI HBA Driver configuration of the drive 6 Inspect media and cables and discard any that are faulty or d...

Page 52: ...d cartridge seam o Usage in incorrect environment 5 Check if the Tape Error LED is flashing a Reload the suspect tape If the Tape Error LED stops flashing the problem has cleared b Load a new or known good tape If the Tape Error LED stops flashing the problem has cleared c Reload the suspect tape If the Tape Error LED flashes discard the suspect media as faulty 6 Discard any media that has been us...

Page 53: ...he rest of the server particularly with the cables that connect to the system board Be sure no foreign material exists such as screws bits or slot bracket blanks that may be short circuiting components External device problems Video problems Screen is blank for more than 60 seconds after you power up the server Action 1 Be sure the monitor power cord is plugged into a working grounded earthed AC o...

Page 54: ...o expansion board Monitor does not function properly with energy saver features Action Be sure the monitor supports energy saver features and if it does not disable the features Video colors are wrong Action Be sure the 15 pin VGA cable is securely connected to the correct VGA port on the server and to the monitor Be sure the monitor and any KVM switch are compatible with the VGA output of the ser...

Page 55: ...re no loose connections on page 18 exist 3 Be sure the correct printer drivers are installed Printer output is garbled Action Be sure the correct printer drivers are installed Cable problems Drive errors retries timeouts and unwarranted drive failures when using an older Mini SAS cable Action The Mini SAS connector life expectancy is 250 connect disconnect cycles for external internal and cable Mi...

Page 56: ...fer to the HP website http www hp com for a complete list of AT commands AT commands are not visible Action Set the echo command to On using the AT command ATE Data is displayed as garbled characters after the connection is established Action 1 Be sure both modems have the same settings including speed data parity and stop bits 2 Be sure the software is set for the correct terminal emulation a Rec...

Page 57: ...tring is AT F C1 D2 K3 Connection errors are occurring Action 1 Check the maximum baud rate for the modem to which you are connecting and then change the baud rate to match 2 If the line you are accessing requires error control to be turned off do so using the AT command AT Q6 C0 3 Be sure no line interference exists Retry the connection by dialing the number several times If conditions remain poo...

Page 58: ...er to the server and operating system documentation 6 Be sure the controller is enabled in RBSU 7 Check the PCI Hot Plug power LED to be sure the PCI slot is receiving power if applicable 8 Be sure the server ROM is up to date 9 Be sure the controller drivers are up to date 10 Be sure a valid IP address is assigned to the controller and that the configuration settings are correct 11 Run Insight Di...

Page 59: ... to be sure the correct drivers are installed 5 Refer to the operating system documentation to be sure that the driver parameters match the configuration of the network controller Problems are occurring with the network interconnect blades Action Be sure the network interconnect blades are properly seated and connected Expansion board problems System requests recovery method during expansion board...

Page 60: ...ks up Action Scan for viruses with an updated virus scan utility General protection fault occurs A general protection fault or general protection error occurs when the Microsoft operating system terminates suddenly with an error including but not limited to Miscalculating the amount of RAM needed for an allocation Transferring execution to a segment that is not executable Writing to a read only or...

Page 61: ...S installation or fail to boot after installation on servers with either three or four Intel dual core processors installed Action Microsoft Windows Server 2003 based media System may hang during installation or during boot Windows Server 2003 SP1 Slipstream does not exhibit this issue If SP1 Slipstream media is not available the base media installation can be performed using one of the following ...

Page 62: ...stem to its original factory state deletes the current hardware configuration information including array setup and disk partitioning and erases all connected hard drives completely Refer to the instructions for using this utility 2 Be sure the server has adequate resources processor speed hard drive space and memory for the software 3 Be sure the server ROM is current and the configuration is cor...

Page 63: ...stem log for entries indicating why the software failed 2 Check for incompatibility with other software on the server 3 Check the support website of the software vendor for known problems 4 Review log files for changes made to the server which may have caused the problem 5 Scan the server for viruses with an updated virus scan utility Errors occur after a software setting is changed Action Check t...

Page 64: ... in size Verification that the ROM version to which you are upgrading can be used for all the servers or array controllers that you are upgrading Follow the instructions for the Remote ROM Flash procedure that accompany the software Command line syntax error If the correct command line syntax is not used an error message describing the incorrect syntax is displayed and the program exits Correct th...

Page 65: ...the supported servers list an error message appears and the program exits Only supported systems can be upgraded using the Remote ROM Flash utility To determine if the server is supported see the HP website http www hp com support System requests recovery method during a firmware update When updating the firmware on a BitLocker encrypted server always disable BitLocker before updating the firmware...

Page 66: ...eturn the system board for a service replacement To switch to the backup ROM when the System ROM is not corrupt use RBSU HP ROM Based Setup Utility on page 67 Server blades If the system ROM is corrupted the system automatically switches to the redundant ROM in most cases If the system does not automatically switch to the redundant ROM perform the following steps 1 Power down the server 2 Remove t...

Page 67: ...ation suite for ProLiant For more information about SmartStart software see the HP Insight Foundation suite for ProLiant or the HP website http www hp com go foundation SmartStart Scripting Toolkit The SmartStart Scripting Toolkit is a server deployment product that delivers an unattended automated installation for high volume server deployments The SmartStart Scripting Toolkit is designed to supp...

Page 68: ...he Enter key Default configuration settings are applied to the server at one of the following times Upon the first system power up After defaults have been restored Default configuration settings are sufficient for proper typical server operation but configuration settings can be modified using RBSU The system will prompt you for access to RBSU with each power up Auto configuration process The aut...

Page 69: ... For more information about BIOS Serial Console see the BIOS Serial Console User Guide on the Documentation CD or the HP website http www hp com support smartstart documentation Configuring AMP modes Not all ProLiant servers support all AMP modes RBSU provides menu options only for the modes supported by the server Advanced memory protection within RBSU enables the following advanced memory Advanc...

Page 70: ...he Configuring Arrays on HP Smart Array Controllers Reference Guide on the Documentation CD or the HP website http www hp com Diagnostics tasks The ACU Diagnostics feature replaces the Array Diagnostic Utility supported by SmartStart v8 20 and earlier For each controller or for all of them you can select the following tasks View Diagnostic Report ACU generates and displays the diagnostic report Ge...

Page 71: ...t ID on an HP ProLiant G4 or G5 server use the following procedure After you replace the system board you must re enter the server serial number and the product ID 1 During the server startup sequence press the F9 key to access RBSU 2 Select the System Options menu 3 Select Serial Number The following warning is displayed WARNING WARNING WARNING The serial number is loaded into the system during t...

Page 72: ...restart when a catastrophic operating system error occurs such as a blue screen ABEND or panic A system fail safe timer the ASR timer starts when the System Management driver also known as the Health Driver is loaded When the operating system is functioning properly the system periodically resets the timer However when the operating system fails the timer expires and restarts the server ASR increa...

Page 73: ...leshooting features through the iLO or iLO 2 interface Diagnose iLO or iLO 2 using HP SIM through a web browser and SNMP alerting For more information about iLO or iLO 2 features which may require an iLO Advanced Pack or iLO Advanced for BladeSystem license see the iLO or iLO 2 documentation on the Documentation CD or on the HP website http www hp com servers lights out Integrated Lights Out 3 tec...

Page 74: ...se button on the home screen of the SmartStart CD SmartStart software on page 67 Redundant ROM support The server enables you to upgrade or configure the ROM safely with redundant ROM support The server has a single ROM that acts as two separate ROM images In the standard implementation one side of the ROM contains the current ROM program version while the other side of the ROM contains a backup v...

Page 75: ...iagnosis feature NOTE This feature is only available in HP Insight Diagnostics Online Edition The HP Insight Diagnostics Online Edition HP Insight Diagnostics on page 75 provides the capability to use non intrusive system level checks to diagnose Smart Array SCSI hard drives Diagnosis supports SCSI SATA and SAS hard drives that are attached to a Smart Array controller and configured as part of a l...

Page 76: ...al ways including the following From within HP SIM From within Survey Utility From within operating system specific IML viewers o For NetWare IML Viewer o For Windows IML Viewer o For Linux IML Viewer Application From within the iLO 3 user interface From within HP Insight Diagnostics on page 75 For more information see the Management CD in the HP Insight Foundation suite for ProLiant Array diagnos...

Page 77: ...ated with HP Systems Insight Manager SIM It provides comprehensive remote monitoring notification advisories dispatch and proactive service support for nearly all HP servers storage network and SAN environments plus selected Dell and IBM Windows servers that have a support obligation with HP It also enables HP to deliver higher levels of proactive support in line with HP Mission Critical Services ...

Page 78: ...pares installed software versions and available updates Administrators can configure VCA to point to a repository managed by VCRM For more information about version control tools see the HP Systems Insight Manager Help Guide and the Version Control User Guide on the HP Systems Insight Manager website http www hp com go hpsim ProLiant Support Packs PSPs represent operating system specific bundles o...

Page 79: ...oads the latest components from Web except Linux RPMs Enables direct update of BMC firmware iLO and LO100i For more information about HP Smart Update Manager and to access the HP Smart Update Manager User Guide see the HP website http www hp com go foundation System Online ROM flash component utility This utility is not available on HP ProLiant G6 servers or later For more information see Firmware...

Page 80: ...most of your server investments For more information see the HP website http www hp com services carepack Firmware maintenance HP has developed technologies to help ensure that HP servers provide maximum uptime with minimal maintenance Many of these technologies also reduce server management efforts enabling administrators to work on issues and resolve problems without taking servers offline The p...

Page 81: ... any reason This feature protects the existing ROM version even if you experience a power failure while flashing the ROM You can choose which ROM to use in RBSU HP ROM Based Setup Utility on page 67 Disaster recovery support The Disaster Recovery feature is supported on servers that do not support Redundant ROM When a ROM flash fails or the system ROM becomes corrupted disaster recovery enables ad...

Page 82: ...tation 4 Update the firmware to the current version supported for the hardware configuration 5 Verify the firmware update by checking the firmware version 6 If a TPM is installed and enabled on the server enable BitLocker after the firmware update is complete For more information see the operating system documentation Several tools are available for updating firmware HP recommends the following me...

Page 83: ...rvers is available as a SoftPaq download from the HP website http www hp com support The Enhanced SoftPaq download contains utilities to restore or upgrade the System ROM on ProLiant servers ROMPaq Diskette A Windows based utility to create a bootable 1 44 MB diskette that can be used to restore or update the System ROM locally ROMPaq USB Key A Windows based utility to partition format and copy fi...

Page 84: ...ce opens automatically o If you use a USB drive key you must start the interface manually Open a command line interface and enter one of the following commands to access the Firmware Maintenance CD In Windows _autorun autorun_win In Linux autorun 2 Read the End User License Agreement If you agree to the terms of the license agreement click Agree to continue The Firmware Maintenance CD interface is...

Page 85: ...M after removing the processor on page 85 Updating system ROM without removing the processor If the Unsupported Processor Detected message is displayed and you choose to leave the processor installed the system will only boot the following devices Systems ROMPaq Diskette installed in a legacy diskette drive Systems ROMPaq Diskette installed in a USB diskette drive Systems ROMPaq USB Key in diskett...

Page 86: ...hite papers contain in depth details and procedures Topics include HP products HP technology OS networking products and performance Refer to one of the following websites HP Business Support Center http www hp com go bizsupport HP Industry Standard Server Technology Papers http h18004 www1 hp com products servers technology whitepapers index html Service notifications advisories and notices To vie...

Page 87: ...ditional product information Refer to product information on the HP Servers website http www hp com country us eng prodserv servers html Registering the server To register the server refer to the HP Registration website http register hp com Overview of server features and installation instructions Refer to the server user guide on the Documentation CD or on the HP Business Support Center website h...

Page 88: ...ion instructions and board layouts Refer to the hood labels and the server user guide The hood labels are inside the access panels of the server and the server user guide is available in the following locations Documentation CD that ships with the server HP Business Support Center website http www hp com go bizsupport HP Technical Documentation website http www docs hp com External cabling informa...

Page 89: ...the SmartStart installation poster if the server supports SmartStart in the HP ProLiant Essentials Foundation Pack Installation and configuration information for the server setup software Refer to the server user guide on the Documentation CD the server installation poster shipped with the server and the SmartStart installation poster if the server supports SmartStart in the HP ProLiant Essentials...

Page 90: ...Guide on the Management CD or the HP website http www hp com go hpsim Fault tolerance security care and maintenance configuration and setup Refer to the server documentation available in the following locations Documentation CD that ships with the server HP Business Support Center website http www hp com go bizsupport HP Technical Documentation website http www docs hp com ...

Page 91: ...celerator board Action Install an array accelerator board on an array controller If an array accelerator board is installed check for proper seating on the array controller board Accelerator Error Log Description List of the last 32 parity errors on transfers to or from the memory on the array accelerator board Displays starting memory address transfer count and operation read and write Action If ...

Page 92: ... drives Posted writes operations are restored Accelerator Status Dirty Data Detected Unable to write dirty data to drives Description At least one cache line contains dirty data that the controller has been unable to flush write to the drives This problem usually occurs when a problem with the drive or drives occurs Action Resolve the problem with the drive or drives The controller can then write ...

Page 93: ...board has been permanently disabled It will remain disabled until it is reinitialized using ACU Action Check the Disable Code field Run ACU Array Configuration Utility on page 70 to reinitialize the array accelerator board Accelerator Status Possible Data Loss in Cache Description Possible data loss was detected during power up due to all batteries being below sufficient voltage level and no prese...

Page 94: ...f the batteries do not recharge within 36 powered on hours Board in Use by Expand Operation Description Array accelerator memory is in use by a capacity expansion or RAID migration Action The array accelerator is automatically re enabled for caching when the capacity expansion or RAID operation completes Board not Attached Description An array controller is configured for use with array accelerato...

Page 95: ...tup utility to configure the NVRAM Controller Firmware Needs Upgrading Description Controller firmware is below the latest recommended version Action Update the controller to the latest firmware version Firmware maintenance on page 80 Controller is Located in Special Video Slot Description Controller is installed in the slot for special video control signals If the controller is used in this slot ...

Page 96: ...ware Needs Upgrading Description Firmware on this physical drive is below the latest recommended version Action Update the drive to the latest firmware version Firmware maintenance on page 80 Drive Bay X has Insufficient Capacity for its Configuration Description Drive has insufficient capacity to be used in this logical drive configuration Action Replace this drive with a larger capacity drive Dr...

Page 97: ...ing down any external drive enclosures Drive Bay X is Failed Description The indicated physical drive has failed Action 1 Check for loose cable connections Loose connections on page 18 2 If cable connectors are secure replace the drive Drive Bay X is Undergoing Drive Recovery Description This drive is being rebuilt from the corresponding mirror or parity data Action No action is required Drive Bay...

Page 98: ...ive X Indicates Position Y Description Message indicates a designated physical drive which seems to be scrambled or in a drive bay other than the one for which it was originally configured Action 1 Examine the graphical drive representation on ADU Array diagnostic software on page 76 to determine proper drive locations 2 Power down the server 3 Remove drive X and place it in drive position Y 4 Rea...

Page 99: ... Sufficient Voltage Description The operation of the array accelerator board has been disabled due to less than 75 of the battery packs being at the sufficient voltage level Action Replace the array accelerator board if the batteries do not recharge within 36 powered on hours Less Than 75 of Batteries at Sufficient Voltage Battery Pack X Below Reference Voltage Description Battery pack on the arra...

Page 100: ...m does not detect a configured physical drive or an external storage unit that was previously detected before the last system shutdown This event can occur if the user removes one or more drives after the system is powered down or if a loose cable or malfunction prevents the drives from spinning up Action If a drive or enclosure has been removed or disconnected do the following 1 Power down the se...

Page 101: ...figured physical drive or an external storage unit that was previously detected before the last system shutdown This event can occur if the user removes one or more drives after the system is powered down or if a loose cable or malfunction prevents the drives from spinning up Action If a drive or enclosure has been removed or disconnected do the following 1 Power down the server 2 Check cabling 3 ...

Page 102: ... are using the same capacity array accelerator Processor Reduced Power Mode Enabled in RBSU Description Processors clocked down Action If you select the reduced power mode in RBSU the processor are displayed as their reduced speed during POST This message indicates that the RBSU reduced power mode has been enabled and also indicates the maximum speed for the installed processors Processor Not Star...

Page 103: ...ondition that caused the error if possible or replace the drive SCSI Port X Drive ID Y Firmware Needs Upgrading Description Drive firmware may cause problems and should be upgraded Action Update the drive to the latest firmware version Firmware maintenance on page 80 SCSI Port X Drive ID Y Has Exceeded the Following Threshold s Description The monitor and performance threshold for this drive has b...

Page 104: ...tunity Refer to the server documentation for drive replacement information before performing this operation SCSI Port X Drive ID Y S M A R T Predictive Failure Errors Have Been Detected in the Power Monitor and Performance Data SOLUTION Please replace this drive when conditions permit Description A predictive failure warning for this hard drive has been generated indicating a drive failure is immi...

Page 105: ...en Action Be sure the side panel of the storage unit is securely closed Storage Enclosure on SCSI Bus X Indicated a Power Supply Failure SOLUTION Replace the power supply Description A power supply in the external storage unit has failed Action Replace the power supply Storage Enclosure on SCSI Bus X Indicated an Overheated Condition SOLUTION Make sure all cooling fans are operating properly Also ...

Page 106: ...ion error detected A configured array of drives was moved from another controller that supported more drives than this controller supports SOLUTION Upgrade the firmware on this controller If this doesn t solve the problem then power down system and move the drives back to the original controller Description You have exceeded the maximum number of drives supported for this controller and the connec...

Page 107: ...fer to the server documentation for supported configurations and cabling guidelines 2 Restore to the original configuration Swapped cables or configuration error detected The configuration information on the attached drives is not backward compatible with this controller s firmware SOLUTION Upgrade the firmware on this controller If this doesn t solve the problem then power down system then move d...

Page 108: ...other controller If both controllers give POST messages in one slot but not the other it is a system board problem If one of the controllers gives POST messages and the other controller does not replace the controller that is giving the POST messages Contact an authorized service provider for any warranty replacements The Redundant Controllers Installed are not the Same Model SOLUTION Power down t...

Page 109: ...y error messages displayed by the controller If this does not solve the problem contact an HP authorized service provider Contacting HP technical support or an authorized reseller on page 173 Unknown Disable Code Description A code was returned from the array accelerator board that ADU does not recognize Action Obtain the latest version of ADU Array diagnostic software on page 76 Unrecoverable Rea...

Page 110: ...ects that NVRAM is corrupted The default values are restored This message does not display if a user has intentionally invalidated the configuration through RBSU by erasing NVRAM WARNING Resetting Corrupted System Environment Description This informational message is displayed when the System Environment Variables are corrupted The default values are restored This message does not display if a use...

Page 111: ... list of all ADU Array diagnostic software on page 76 error messages ADU is being replaced by the ACU diagnostics feature Diagnostics tasks on page 70 If the following versions are installed on the server see the messages in this section ADU version 8 0 through ADU version 8 25 ACU diagnostics 8 28 and later Array Accelerator The batteries were hot removed Action Replace the batteries Array Accele...

Page 112: ...ion Upgrade the HBA If the problem persists contact HP support Contacting HP on page 173 Drive Offline due to Erase Operation The logical drive is offline from having an erase in progress Action No action is required The logical drive will be offline temporarily Logical drive migrate and extend operations are not possible while the erase operation is in progress Drive Offline due to Erase Operatio...

Page 113: ...will start when I O is performed on the drive When background parity initialization completes the performance of the logical drive will improve Action No action is required Logical drive state The current array controller is performing capacity expansion extension or migration on this logical drive Action No action is required Further configuration is disabled until the process completes Logical d...

Page 114: ...ing HP on page 173 NVRAM Error Bootstrap NVRAM image failed checksum test but a backup image was found and successfully restored A system restart is needed Action Restart the server NVRAM Error Bootstrap NVRAM image failed checksum test and could not be restored This error may or may not be recoverable A firmware update might be able to correct the error Action Update the controller firmware If th...

Page 115: ... a recommended configuration Action To correctly connect the cables to the storage system see the product user guide POST error messages and beep codes Introduction to POST error messages The error messages and codes in this section include all messages generated by ProLiant servers Some messages are informational only and do not indicate any error A server generates only the codes that are applic...

Page 116: ...None Advanced Memory Protection mode Multi board mirrored memory with Advanced ECC Xxxx MB System memory and xxxx MB memory reserved for Mirroring Audible Beeps None Possible Cause This message indicates Mirrored Memory is enabled and indicates the amount of memory reserved for this feature Action None Advanced Memory Protection mode RAID memory with Advanced ECC Xxxx MB System memory and xxxx MB ...

Page 117: ...sing or failed Action Install fans or replace any failed fans Fatal DMA Error Audible Beeps None Possible Cause The DMA controller has experienced a critical error that has caused an NMI Action Run Insight Diagnostics HP Insight Diagnostics on page 75 and replace failed components as indicated Fatal Express Port Error Audible Beeps None Possible Cause A PCI Express port has experienced a fatal err...

Page 118: ...sical ROM part Fibre Channel Mezzanine Balcony Not Supported Audible Beeps 2 short Description The Fibre Channel adapter is not supported on the server Action Install the supported Fibre Channel adapter High Temperature Condition detected by Processor X Audible Beeps None Possible Cause Ambient temperature exceeds recommended levels fan solution is insufficient or fans have failed Action Adjust th...

Page 119: ... Use only supported DIMM pairs when populating memory sockets Refer to the applicable server user guide memory requirements Invalid Password System Halted Audible Beeps None Possible Cause An invalid password was entered Action Enter a valid password to access the system Invalid Password System Restricted Audible Beeps None Possible Cause A valid password that does not have permissions to access t...

Page 120: ... Refer to the server documentation for supported processors Be sure that all installed processors are the same speed Network Server Mode Active and No Keyboard Attached Audible Beeps None Possible Cause A keyboard is not connected An error has not occurred but a message is displayed to indicate the keyboard status Action No action is required NMI Button Pressed Audible Beeps None Possible Cause Th...

Page 121: ...curs replace the keyboard Parity Check 2 System DIMM Memory Audible Beeps None Possible Cause An uncorrectable error memory event occurred in a memory DIMM Action Run Insight Diagnostics HP Insight Diagnostics on page 75 to identify failed DIMMs Then use the DIMM LEDs to identify failed DIMMs and replace the DIMMs PCI Bus Parity Error PCI Slot X Audible Beeps None Possible Cause A PCI device has g...

Page 122: ...system ROM Audible Beeps None Possible Cause The system recognizes both the system ROM and redundant ROM as valid This is not an error Action None REDUNDANT ROM ERROR Backup ROM Invalid run ROMPAQ to correct error condition Audible Beeps None Possible Cause The backup system ROM is corrupted The primary ROM is valid Action Run ROMPaq Utility to flash the system so that the primary and backup ROMs ...

Page 123: ... contain a temperature sensor All supported DIMMs for this platform include internal temperature sensors Action See the server documentation for supported DIMMs Install only DIMMs supported by the server This system only supports 667 MHz Front Side Bus Speed Processors One or more 800 MHz Front Side Bus Speed Processors have been initialized at 667 MHz System Halted Audible beeps 1 long 1 short Po...

Page 124: ...w 15 minutes for the process to complete Successful completion is indicated by a series of beeps of increasing pitch USB Tape based One button Disaster Recovery OBDR drive detected Press F8 for configuration options Select a configuration option 1 Enable OBDR 2 Exit Audible Beeps None Possible Cause A USB tape device that supports One Button Disaster Recovery OBDR is installed in the system Action...

Page 125: ...ected System cannot proceed Audible beeps 1 long 1 short Possible cause One or more 800 MHz front side bus speed processors have been initialized at 667 MHz Action Correct the processor configuration WARNING ProLiant Demand Based Power Management cannot be supported with the following processor configuration The system will run in Full Performance mode Audible Beeps None Possible Cause The system ...

Page 126: ...s and similar devices CAUTION Only authorized technicians trained by HP should attempt to remove the system board If you believe the system board requires replacement contact HP Technical Support Contacting HP on page 173 before proceeding Action Replace the system board Run the server setup utility 102 System Board Failure CMOS Test Failed Audible Beeps None Possible Cause 8237 DMA controllers 82...

Page 127: ...nsight Diagnostics HP Insight Diagnostics on page 75 and replace failed components as indicated 162 System Options Not Set Audible Beeps 2 long Possible Cause Configuration is incorrect The system configuration has changed since the last boot addition of a hard drive for example or a loss of power to the real time clock has occurred The real time clock loses power if the onboard battery is not fun...

Page 128: ...Ms installed when no corresponding processor is detected Description Processor is required to be installed for memory to be used Action Populate the processor socket or remove the DIMM 207 Invalid Memory Configuration DIMMs must be installed in pairs or sequentially Audible beeps 1 long 1 short Possible cause The system is configured with only one FBDIMM and the system does not support single FBDI...

Page 129: ...atched DIMMs within DIMM Bank Audible Beeps 1 long 1 short Possible Cause Installed DIMMs in the same bank are of different sizes Action Install correctly matched DIMMs 207 Invalid Memory Configuration Mismatched DIMMs within DIMM Bank Memory in Bank X Not Utilized Audible Beeps 1 long 1 short Possible Cause Installed DIMMs in the same bank are of different sizes Action Install correctly matched D...

Page 130: ...rimary width of x8 Action Install DIMMs that have a primary width of x4 if Advanced ECC memory support is required 209 Online Spare Memory Configuration No Valid Banks for Online Spare Audible Beeps 1 long 1 short Possible Cause Two valid banks are not available to support an online spare memory configuration Action Install or reinstall DIMMs to support online spare configuration 209 Online Spare ...

Page 131: ... 1 long 1 short Possible Cause A problem exists with a memory board powering up properly Action Exchange DIMMs and retest Replace the memory board if problem persists 210 Memory Board Failure on board X Audible Beeps 1 long 1 short Possible Cause A problem exists with a memory board powering up properly Action Exchange DIMMs and retest Replace the memory board if problem persists 212 Processor Fai...

Page 132: ...se controller failure occurred Action 1 Be sure the keyboard and mouse are connected CAUTION Only authorized technicians trained by HP should attempt to remove the system board If you believe the system board requires replacement contact HP Technical Support Contacting HP on page 173 before proceeding 2 Run Insight Diagnostics HP Insight Diagnostics on page 75 and replace failed components as indi...

Page 133: ...hardware conflict in the system is preventing the parallel port from working correctly Action 1 If you have recently added new hardware remove it to see if the hardware is the cause of the conflict 2 Run the server setup utility to reassign resources for the parallel port and manually resolve the resource conflict 3 Run Insight Diagnostics HP Insight Diagnostics on page 75 and replace failed compo...

Page 134: ...dary Floppy Port Address Assignment Conflict Audible Beeps 2 short Possible Cause A hardware conflict in the system is preventing the diskette drive from operating properly Action 1 Run the server setup utility to configure the diskette drive port address and manually resolve the conflict 2 Run Insight Diagnostics HP Insight Diagnostics on page 75 and replace failed components as indicated 1100 Se...

Page 135: ...ronmental requirements for the server Space and airflow o Always allow adequate ventilation o Always populate the racks with blanking panels and the enclosures with blade blanks o Always populate the server with air baffles blanks and heatsinks o Always operate the server with the access panel installed Temperature Only operate the server in a room where the temperature does not exceed the recomme...

Page 136: ...re they are working 2 Be sure each fan cable is properly connected if applicable and each fan is properly seated 3 If the problem persists replace the failed fans 1611 Fan x Failure Detected Fan Zone I O Audible Beeps 2 short Possible Cause Required fan is not installed or spinning Action 1 Check the fans to be sure they are working 2 Be sure each fan cable is properly connected if applicable and ...

Page 137: ...roperly connected and each fan is properly seated 3 If the problem persists replace the failed fans 4 If a known working replacement fan is not spinning replace the assembly 1611 Power Supply Zone Fan Assembly Failure Detected Single fan failure Assembly will provide adequate cooling Audible Beeps None Possible Cause Required fan is not spinning Action Replace the failed fan to provide redundancy ...

Page 138: ...gged or Power Supply Fan Failure in Bay X Audible Beeps None Possible Cause The power supply has failed or it is installed but not connected to the system board or AC power source Action Reseat the power supply firmly and check the power cable or replace power supply 1616 Power Supply Configuration Failure A working power supply must be installed in Bay 1 for proper cooling System Halted Audible B...

Page 139: ...d Audible Beeps None Possible Cause The specified Smart Array controller Bootstrap NVRAM was restored in one of the following ways It was detected as corrupt and the backup copy was restored It was automatically updated because a newer version was available Action 1 Reboot the server 2 If the problem still exists update the controller to the latest firmware version 1711 Slot X Drive Array RAID ADG...

Page 140: ...er detects a checksum failure but is unable to reprogram the backup ROM Action 1 Update the controller to the latest firmware version Firmware maintenance on page 80 2 If the problem persists replace the controller 1714 Slot X Drive Array Controller Redundant ROM Reprogramming Failure Backup ROM has automatically been activated Check firmware version Audible Beeps None Possible Cause The controlle...

Page 141: ...Possible Cause The firmware does not support the number of devices currently attached to the controller Action If release notes indicate that support for additional devices has been added upgrade to the latest version of controller firmware Remove some of the devices attached to the controller 1719 Slot X Drive Array A controller failure event occurred prior to this power up previous lock up code ...

Page 142: ...the indicated drive It may fail at some time in the future Action If the drive is part of a non fault tolerant configuration back up all data before replacing the drive and restore all data afterward If the drive is part of a fault tolerant configuration do not replace the drive unless all other drives in the array are online 1724 Slot X Drive Array Physical Drive Position Change s Detected Logica...

Page 143: ...ed Too many logical drives Audible Beeps None Possible Cause The controller has detected an additional array of drives that was connected when the power was off The logical drive configuration information has been updated to add the new logical drives The maximum number of logical drives supported is 32 Additional logical drives will not be added to the configuration Action No action is required 1...

Page 144: ...pdate the Smart Array firmware to the correct version 1736 HP Trusted Platform Module Error Audible Beeps 2 short Possible Cause A TPM is installed but the System ROM is unable to communicate with the TPM Action Request a new system board and TPM board from an HP authorized service provider Contacting HP technical support or an authorized reseller on page 173 When installing or replacing a TPM obs...

Page 145: ...logical drive s corresponding to these disk drives Audible Beeps None Possible Cause A problem exists with the storage enclosure redundant cabling A single path was found to drives that were previously connected redundantly Action Check the storage box I O module and cable to restore redundant paths to the drives then do one of the following If the redundant cables paths were not purposefully remo...

Page 146: ...completion followed by a list of drives Audible Beeps None Possible Cause A drive erase operation was previously initiated by the user and is in progress or is scheduled for all drives in the list Action None required 1745 Slot X Drive Array Drive Erase Operation Completed The following disk drive s have been erased and will remain offline until hot replaced or re enabled by the Array Configuratio...

Page 147: ...ccelerator memory module has been detached ALL logical drives have been disabled To avoid data loss re attach drives to original controller or upgrade controller To discard all data and create a new configuration run the Array Configuration Utility Audible Beeps None Possible Causes The Array Accelerator memory module was removed or is defective The drives were moved to a controller that does not ...

Page 148: ...r is not supported Action Replace the Array Accelerator module with the correct model for this controller If this occurs after upgrading to a larger module update the controller firmware before attaching the new module 1762 Slot X Drive Array Controller Firmware Upgrade Needed Unsupported Array Accelerator Attached Audible Beeps None Possible Cause The current controller firmware does not support ...

Page 149: ...udible Beeps None Possible Cause Data was lost while the array was expanded therefore the drives have been temporarily disabled Capacity expansion failed due to Array accelerator or hard drive failed or was removed expansion progress data lost Expansion progress data could not be read from array accelerator Expansion aborted due to unrecoverable drive errors Expansion aborted due to array accelera...

Page 150: ... connected properly and securely 4 Update the storage device to the latest firmware version Firmware maintenance on page 80 5 If the problem persists replace the cable backplane or Smart Array Controller 1775 Slot X Drive Array Storage Enclosure Cabling Problem Detected OUT port of this box is attached to OUT port of previous box Turn system and storage box power OFF and check cables Drives in thi...

Page 151: ...cement contact HP Technical Support Contacting HP on page 173 before proceeding 3 Reboot the server after replacing each item a Drive backplane fan board b Drive backplane c I O board 1777 Slot X Drive Array Storage Enclosure Problem Detected followed by one or more of the following followed by one or more of the following SCSI Port Y Cooling Fan Malfunction Detected SCSI Port Y Overheated Conditi...

Page 152: ...Port Y SCSI ID Z Restore data from backup if replacement drive X has been installed Audible Beeps None Possible Cause More drives failed or were replaced than the fault tolerance level allows Unable to rebuild array If drives have not been replaced this message indicates an intermittent drive failure Action Be sure the system is always powered up and down correctly When powering up the system all ...

Page 153: ...eeps None Possible Cause Drive array configuration not detected Action Run ACU Array Configuration Utility on page 70 Power down the system and check SCSI cable connections to be sure the drives are connected properly Run ADU Array diagnostic software on page 76 if previous positions are unknown Then turn the system power off and move the drives to their original positions To avoid data loss updat...

Page 154: ...iled or replacement drive has not yet been rebuilt This message is displayed if the F2 key was pressed during a previous boot or if the F1 key was pressed during a previous boot and the system rebooted before the rebuild of the drive completed Action Perform one of the suggested actions o Press the F1 key to retry Automatic Data Recovery to the drive Data will be automatically restored to drive X ...

Page 155: ...tic software on page 76 to resolve Be sure the cable is routed properly 1789 Slot X Drive Array SCSI Drive s Not Responding Check cables or replace the following SCSI drives SCSI Port Y SCSI ID Z Select F1 to continue drive array will remain disabled Select F2 to failed drives that are not responding Interim Recovery Mode will be enabled if configured for fault tolerance Audible Beeps None Possibl...

Page 156: ...ed on the drive Power was not restored within enough time to save the data 2 Perform orderly system shutdowns to avoid leaving data in the array accelerator 1794 Drive Array Array Accelerator Battery Charge Low Array Accelerator is temporarily disabled Array Accelerator will be re enabled when battery reaches full charge Audible Beeps None Possible Cause The battery charge is below 75 percent Post...

Page 157: ...cache may be disabled or the controller might not be usable until this problem is corrected Action Replace the array accelerator daughter board 1799 Drive Array Drive s Disabled Due to Array Accelerator Data Loss Select F1 to continue with logical drives disabled Select F2 to accept data loss and to re enable logical drives Audible Beeps None Possible Cause One or more logical drives failed due to...

Page 158: ...ard Socket X A CPU Power Module Slot X Socket Y Failed Event Type Power module failure Action Replace the power module In the case of an embedded power module replace the system board ASR Lockup Detected Cause Event Type System lockup Action Examine the IML Integrated Management Log on page 76 to determine the cause of the lockup For more information refer to the HP ROM Based Setup Utility User Gu...

Page 159: ...iable operation EISA Expansion Bus Master Timeout Slot X EISA Expansion Bus Slave Timeout EISA Expansion Board Error Slot X EISA Expansion Bus Arbitration Error Event Type Expansion bus error Action Power down the server and then replace the EISA board PCI Bus Error Slot X Bus Y Device Z Function X Event Type Expansion bus error Action Replace the PCI board Processor Correctable Error Threshold Pa...

Page 160: ...Fan X Location Event Type Fan failure Action Replace the fan System Fans Not Redundant Event Type Fans not redundant Action Add a fan or replace the failed fan System Overheating Zone X Location Event Type Overheating condition Action Check fans System Power Supplies Not Redundant Event Type Power supply not redundant Action Add a power supply or replace the failed power supply System Power Supply...

Page 161: ...port For more information refer to the HP BladeSystem Maintenance and Service Guide on the HP website http www hp com products servers proliant bl p class info 2 Access the diagnostics For more information refer to the HP BladeSystem Maintenance and Service Guide on the HP website http www hp com products servers proliant bl p class info Server blade management module error codes Server blade erro...

Page 162: ...orm the following steps to resolve the problem Stop when the problem is resolved 1 Press the server blade management module reset button 2 Replace the power backplane For more information refer to the HP BladeSystem Maintenance and Service Guide on the HP website http www hp com products servers proliant bl p class info Server blade management module power backplane B error codes LED code 12 1 12 ...

Page 163: ...ers proliant bl p class info Interconnect Module A 10 Connector Error Code LED code 15 1 or 15 2 Location Interconnect module side A 10 connector Action Perform the following steps to resolve the problem Stop when the problem is resolved 1 Press the server blade management module reset button 2 Reseat the interconnect module For more information refer to the HP BladeSystem Maintenance and Service ...

Page 164: ...erconnect Module B 6 Connector Error Code LED code 18 1 or 18 2 Location Interconnect module side B 6 connector Action Perform the following steps to resolve the problem Stop when the problem is resolved 1 Press the server blade management module reset button 2 Reseat the interconnect module For more information refer to the HP BladeSystem Maintenance and Service Guide on the HP website http www h...

Page 165: ...9 7 10 7 11 7 12 or 7 13 Location Power management board Action Perform the following steps to resolve the problem Stop when the problem is resolved 1 Reseat the power management module 2 Replace the power management module Power management module backplane error codes LED code 8 1 8 2 8 3 8 4 8 5 8 6 8 7 or 8 8 Location Power management backplane Action Perform the following steps to resolve the ...

Page 166: ...for the appropriate troubleshooting steps Processor related port 85 codes Processor related port 85 codes display in the format 3xh IMPORTANT Reboot the server after completing each numbered step If the error condition continues proceed with the next step To troubleshoot processor related error codes 1 Bring the server to base configuration by removing all components that are not required by the s...

Page 167: ...nd PPM slot 1 must be populated at all times or the server does not function properly o PPMs except the PPM installed in slot 1 o DIMMs except the first bank o Hard drives o Peripheral devices 3 Reseat the remaining memory boards rebooting after each installation to isolate any failed memory boards if applicable 4 Replace the DIMMs with a remaining bank of memory 5 Replace the memory board if appl...

Page 168: ...s To troubleshoot all other port 85 codes IMPORTANT Reboot the server after completing each numbered step If the error condition continues proceed with the next step 1 Bring the server to base configuration by removing all components that are not required by the server to complete POST This process can include removing all o Expansion boards o Processors except the processor installed in socket 1 ...

Page 169: ... have been started by the operating system The system will continue to operate Action Confirm that the license agreement in use supports all of the installed processors Message ID 4169 Severity Warning Description The processor in slot X socket X has corrected an excessive number of internal errors The system will continue to operate Action Replace the processor Message ID 4190 Severity Error Desc...

Page 170: ...hed correctly do not remove them Check diagnostics and the Integrated Management Log for heat related events Upgrade to the latest versions of system BIOS and Insight Diagnostics Replace the processor MSG_CPU_RR_5 Event type Refresh count is out of range Action Replace the board that contains the memory controller MSG_CPU_RR_6 Event type Unable to perform arithmetic operations on registers Action ...

Page 171: ...subtraction instruction has failed Action Replace the processor MSG_CPU_RR_12 Event type MMX multiply instruction has failed Action Replace the processor MSG_CPU_RR_13 Event type MMX logical instruction has failed Action Replace the processor MSG_CPU_RR_14 Event type MMX shift instruction has failed Action Replace the processor MSG_CPU_RR_15 Event type MMX pack unpack instruction has failed Action...

Page 172: ...sure proper ventilation and cooling for the server Ensure the processor heatsinks are attached correctly do not remove them Check diagnostics and the Integrated Management Log for heat related events Upgrade to the latest versions of system BIOS and Insight Diagnostics Replace the processor ...

Page 173: ... may be recorded or monitored o If you have purchased a Care Pack service upgrade call 1 800 633 3600 For more information about Care Packs refer to the HP website http www hp com hps In other locations see the Contact HP worldwide in English webpage http welcome hp com country us en wwcontact html Customer self repair What is customer self repair HP s customer self repair program offers you the f...

Page 174: ...ftware installed o PCAnywhere information if installed o Verification of latest drivers installed o Verification of latest ROM BIOS o Verification of latest firmware on array controllers and drives Results from attempts to clear NVRAM Operating system information you need Depending on the problem you may be asked for certain pieces of information Be prepared to access the information listed in the...

Page 175: ...rsion A detailed description of the problem and any associated error messages Linux operating systems Collect the following information Operating system distribution and version Look for a file named etc distribution release for example etc redhat release Kernel version in use Output from the following commands performed by root o lspci v o uname a o cat proc meminfo o cat proc cpuinfo o rpm ga o ...

Page 176: ...sed on the server including the names versions dates and sizes can be taken directly from the CONFIG TXT or SURVEY TXT files If HP drivers are installed o Version of the PSP used o List of drivers from the PSP Printouts or electronic copies to e mail to a support technician of o SYS SYSTEM SYS LOG ERR o SYS SYSTEM ABEND LOG o SYS ETC CPQLOG LOG o SYS SYSTEM CONFIG TXT o SYS SYSTEM SURVEY TXT Curre...

Page 177: ...problem and any associated error messages IBM OS 2 operating systems Collect the following information Operating system version number and printouts or electronic copies to e mail to a support technician of o IBMLAN INI o PROTOCOL INI o CONFIG SYS o STARTUP CMD o SYSLEVEL information in detail o TRAPDUMP information if a TRAP error occurs A directory listing of o C o C OS2 o C OS2 BOOT o HPFS386 I...

Page 178: ... or Customer JumpStart Which software group selected for installation End User Support Entire Distribution Developer System Support or Core System Support If HP drivers are installed with a DU o DU number o List of drivers in the DU diskette The drive subsystem and file system information o Number and size of partitions and logical drives o File system on each logical drive A list of all third par...

Page 179: ...ation Utility ADG Advanced Data Guarding also known as RAID 6 ADU Array Diagnostics Utility AMP Advanced Memory Protection ASR Automatic Server Recovery BMC baseboard management controller CCITT International Telegraph and Telephone Consultative Committee CMOS complementary metal oxide semiconductor CPU central processing unit CS cable select ...

Page 180: ... Supplement EISA Extended Industry Standard Architecture ESD electrostatic discharge FBDIMM fully buffered DIMM FDT Firmware Deployment Tool HP SIM HP Systems Insight Manager IDE integrated device electronics iLO Integrated Lights Out iLO 2 Integrated Lights Out 2 iLO 3 Integrated Lights Out 3 IMD Integrated Management Display ...

Page 181: ...e differential MMX multimedia extensions NMI non maskable interrupt NVRAM non volatile memory OBDR One Button Disaster Recovery ORCA Option ROM Configuration for Arrays PCI X peripheral component interconnect extended POST Power On Self Test PPM processor power module PSP ProLiant Support Pack PXE Preboot Execution Environment ...

Page 182: ...ght Lights Out Edition II RIS reserve information sector RPM Red Hat Package Manager SAN storage area network SAS serial attached SCSI SATA serial ATA SIM Systems Insight Manager SIMM single inline memory module SP1 Service Pack 1 SSD support software diskette TPM trusted platform module UPS uninterruptible power system ...

Page 183: ...Acronyms and abbreviations 183 USB universal serial bus VCA Version Control Agent VCRM Version Control Repository Manager VGA video graphics array ...

Page 184: ...n process 68 automatic backup 81 automatic data recovery rebuild 152 Automatic Server Recovery ASR 72 B backup issue tape drive 51 backup restoring 62 batteries insufficient warning when low 38 batteries replacing 41 battery 38 41 99 111 134 battery pack array accelerator 143 beep codes 115 BIOS Serial Console 69 blank screen 53 blue screen event 159 boot options 69 boot problems 65 booting proble...

Page 185: ...lems 41 42 43 drivers 77 88 drives disabled 149 157 drives troubleshooting 43 DVD ROM drive 41 E ECC errors 92 EISA expansion bus master timeout 159 electrostatic discharge 15 end user license agreement EULA 84 energy saver features 54 erase operation 112 113 Erase Utility 74 error codes HP BladeSystem p Class infrastructure 161 error codes Insight Diagnostic processor 169 error codes processor 16...

Page 186: ... handler 118 iLO Integrated Lights Out 73 89 125 iLO messages 165 IMD Integrated Management Display 157 IML Integrated Management Log 49 76 128 157 Important Safety Information document 13 incorrect drive replacement 155 information required 173 174 infrastructure error codes 161 Insight Diagnostics 75 77 157 169 170 171 172 Insight Diagnostics processor error codes 169 170 171 172 Insight Remote ...

Page 187: ...ion support 78 89 operating systems 60 61 63 78 89 174 operating systems supported 78 89 option ROM 81 Option ROM Configuration for Arrays ORCA 71 ORCA Option ROM Configuration for Arrays 71 OS boot problems flowchart 32 overheating 160 P panic error 61 parallel port 133 parameters 64 parity errors 91 98 121 part numbers 87 88 passwords 119 patches 61 PCI boards 40 PCI bus error 159 PCI device 123...

Page 188: ... 114 115 redundant cabling configuration 144 145 redundant path failure 115 redundant ROM 65 74 81 122 140 registering the server 87 reloading software 62 Remote Insight Lights Out Edition II RILOE II 61 73 remote ROM flash 64 65 remote ROM flash problems 64 remote support and analysis tools 77 replacement drives detected 152 required information 173 174 resources 86 resources troubleshooting 86 r...

Page 189: ...fan failure 160 system fans 160 system fans not redundant 160 system not supported 65 System Online ROM flash component utility 79 system overheating 160 system power supplies not redundant 160 system power supply failure 160 system ROM 80 85 System ROMPaq Firmware Upgrade Utility 83 T tape drives 51 tape drives failure of 51 teardown procedures 88 technical support 173 technical topics 88 telepho...

Page 190: ...Index 190 W warnings 14 87 website HP 86 87 websites reference 23 86 what s new 8 when to reconfigure or reload software 62 white papers 86 88 Windows Event Log processor error codes 169 ...

Reviews: