background image

IBM NeXtScale nx360 M4
Installation and Service Guide

Machine Type:

5455

Summary of Contents for NeXtScale nx360 M4

Page 1: ...IBM NeXtScale nx360 M4 Installation and Service Guide Machine Type 5455 ...

Page 2: ...tion document and the Safety Information and Environmental Notices and User Guide documents on the IBM Documentation CD Fourth Edition June 2014 Copyright Lenovo 2014 LIMITED AND RESTRICTED RIGHTS NOTICE If data or software is delivered pursuant to a General Services Administration GSA contract use reproduction or disclosure is subject to restrictions set forth in Contract No GS 35F 05925 ...

Page 3: ...er 31 Changing the Power Policy option to the default settings after loading UEFI defaults 32 Using the integrated management module 32 Using the remote presence and blue screen capture features 33 Using the embedded hypervisor 35 Configuring the Ethernet controller 36 Enabling Features on Demand Ethernet software 36 Enabling Features on Demand RAID software 36 Configuring RAID arrays 36 IBM Advan...

Page 4: ...cage 119 Removing and replacing Tier 1 CRUs 121 Removing the operator information panel 121 Installing the operator information panel 123 Removing the power paddle card from the GPU tray 124 Replacing the power paddle card on to the GPU tray 125 Removing the system battery 126 Replacing the system battery 127 Removing a memory module 129 Installing a memory module 130 Removing the optional 3 5 inc...

Page 5: ...t results for the DSA memory stress test 670 DSA Nvidia GPU test results 674 Test results for the DSA Nvidia GPU test 674 DSA optical drive test results 680 Test results for the DSA optical drive test 681 DSA system management test results 685 Test results for the DSA system management test 685 DSA tape drive test results 700 Test results for the DSA tape drive test 700 Getting help and technical ...

Page 6: ...iv IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 7: ...iften Ennen kuin asennat tämän tuotteen lue turvaohjeet kohdasta Safety Information Avant d installer ce produit lisez les consignes de sécurité Vor der Installation dieses Produkts die Sicherheitshinweise lesen Prima di installare questo prodotto leggere le Informazioni sulla Sicurezza Les sikkerhetsinformasjonen Safety Information før du installerer dette produktet Antes de instalar este produto...

Page 8: ... you must determine how serious the hazard is and whether you must correct the problem before you work on the product Consider the following conditions and the safety hazards that they present Electrical hazards especially primary power Primary voltage on the frame can cause serious or fatal electrical shock Explosive hazards such as a damaged CRT face or a bulging capacitor Mechanical hazards suc...

Page 9: ...gency power off EPO switch disconnecting switch or electrical outlet so that you can turn off the power quickly in the event of an electrical accident Disconnect all power before you perform a mechanical inspection work near power supplies or remove or install main units Before you work on the equipment disconnect the power cord If you cannot disconnect the power cord have the customer power off t...

Page 10: ...ables or perform installation maintenance or reconfiguration of this product during an electrical storm Connect all power cords to a properly wired and grounded electrical outlet Connect to properly wired outlets any equipment that will be attached to this product When possible use one hand only to connect or disconnect signal cables Never turn on any equipment when there is evidence of fire water...

Page 11: ... such as CD ROMs DVD drives fiber optic devices or transmitters are installed note the following Do not remove the covers Removing the covers of the laser product could result in exposure to hazardous laser radiation There are no serviceable parts inside the device Use of controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation expo...

Page 12: ...e The device also might have more than one power cord To remove all electrical current from the device ensure that all power cords are disconnected from the power source 1 2 Statement 6 CAUTION If you install a strain relief bracket option over the end of the power cord that is connected to the device you must connect the other end of the power cord to an easily accessible power source Statement 8...

Page 13: ...as this label attached There are no serviceable parts inside these components If you suspect a problem with one of these parts contact a service technician Statement 12 CAUTION The following label indicates a hot surface nearby Statement 26 CAUTION Do not place any object on top of rack mounted devices Statement 27 CAUTION Hazardous moving parts are nearby Copyright Lenovo 2014 xi ...

Page 14: ...ack cabinet Always install stabilizer brackets on the rack cabinet Always install servers and optional devices starting from the bottom of the rack cabinet Always install the heaviest devices in the bottom of the rack cabinet xii IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 15: ...service and assistance see the Warranty Information document for your compute node You can download the IBM ServerGuide Setup and Installation CD to help you configure the hardware install device drivers and install the operating system For a list of supported optional devices for the server see http www ibm com systems info x86servers serverproven compat us See the Rack Installation Instructions ...

Page 16: ...rvice label which is on the cover of the server provides a QR code for mobile access to service information You can scan the QR code using a QR code reader and scanner with a mobile device and get quick access to the IBM Service Information website The IBM Service Information website provides additional information for parts installation and replacement videos and error codes for server support Th...

Page 17: ...cription of the document is displayed under Topic Description To select more than one document press and hold the Ctrl key while you select the documents Click View Book to view the selected document or documents in Acrobat Reader or xpdf If you selected more than one document all the selected documents are opened in Acrobat Reader or xpdf To search all the documents type a word or word string in ...

Page 18: ...caution and danger statements in this document are also in the multilingual Safety Information document which is on the IBMDocumentation CD Each statement is numbered for reference to the corresponding statement in your language in the Safety Information document The following notices and statements are used in this document Note These notices provide important tips guidance or advice Important Th...

Page 19: ...EDs Software RAID supportability for RAID level 0 RAID level 1 or RAID level 10 Hardware RAID supportability for RAID level 0 RAID level 1 RAID level 5 or RAID level 10 Wake on LAN WOL Drive expansion bays depending on the model Supports up to eight 3 5 inch SATA if the storage tray is installed up to 7 in the storage tray and 1 in the compute node two 2 5 inch SATA SAS or four 1 8 inch solid stat...

Page 20: ...ive humidity 8 to 85 Maximum dew point 27 C 80 6 F Storage non operating Temperature 1 C to 60 C 33 8 F to 140 0 F Maximum altitude 3 050 m 10 000 ft Relative humidity 5 to 80 Maximum dew point 29 C 84 2 F Shipment non operating 8 Temperature 40 C to 60 C 40 F to 140 0 F Maximum altitude 10 700 m 35 105 ft Relative humidity 5 to 100 Maximum dew point 29 C 84 2 F 9 Particulate contamination Attenti...

Page 21: ...n any hardware a properly functioning wrist strap must be used by any personnel who contacts IT equipment 6 5 C hr for data centers employing tape drives and 20 C hr for data centers employing disk drives 7 Chassis is removed from original shipping container and is installed but not in use for example during repair maintenance or upgrade 8 The equipment acclimation period is 1 hour per 20 C of tem...

Page 22: ...nt module on page 32 and the Integrated Management Module II User s Guide at the http www ibm com supportportal Large system memory capacity The compute node supports up to 128 GB of system memory The memory controller provides support for up to 8 industry standard registered ECC DDR3 on low profile LP DIMMs on the system board For the most current list of supported DIMMs see http www ibm com syst...

Page 23: ...e the availability of the compute node when you need it and the ease with which you can diagnose and correct problems The compute node has the following RAS features Advanced Configuration and Power Interface ACPI Automatic server restart ASR Built in diagnostics using DSA Preboot Built in monitoring for temperature voltage and hard disk drives Customer support center 24 hours per day 7 days a wee...

Page 24: ...er Battery holder Figure 3 Major components of the compute node Major components of the storage tray Use this information to locate the major components on the storage tray The storage tray is installed on the top of a compute node Each storage tray supports up to seven 3 5 inch LFF SATA hard disk drives The ServeRAID adapter can be connects from compute node via PCIe interface to support RAID lev...

Page 25: ...he storage tray Major components of the GPU tray Use this information to locate the major components on the GPU tray The GPU tray is installed on the top of a compute node Each GPU tray supports up to two Graphics Processing Unit GPU enclosure full height full length The following illustration shows the major components of the GPU tray Chapter 1 The IBM NeXtScale nx360 M4 Compute Node Type 5455 11...

Page 26: ...ils about the controls connectors and LEDs The following illustration identifies the buttons connectors and LEDs on the control panel 00000000000 00000000000 00000000000 00000000000 00000 00000 00000 00000 00000 00000 00000 Ethernet 1 connector shared management port Ethernet 2 connector Management connector dedicated management port KVM connector Power on LED power button Locator LED System error...

Page 27: ...button for 4 seconds forces the operating system to shut down immediately Data loss is possible Locator LED The system administrator can remotely light this blue LED to aid in visually locating the compute node Check log LED When this yellow LED is lit it indicates that a system error has occurred Check the Event logs on page 53 for additional information System error LED When this yellow LED is l...

Page 28: ...out tag a little to prevent interfere with the KVM cable Turning on the compute node Use this information for details about turning on the compute node After you connect the compute node to power through the IBM NeXtScale n1200 Enclosure the compute node can be started in any of the following ways You can press the power button on the front of the compute node see Compute node controls connectors ...

Page 29: ...tion for information about shutting down the operating system The compute node can be turned off in any of the following ways You can press the power button on the compute node see Compute node controls connectors and LEDs on page 12 This starts an orderly shutdown of the operating system if this feature is supported by the operating system If the operating system stops functioning you can press a...

Page 30: ...panel SATA connector LED signal connector PCI riser connector 1 3V lithium battery USB hypervisor key 10GB ethernet card connector DIMM 5 DIMM 6 Figure 8 Internal connectors on system board System board external connectors The following illustration shows the external connectors on the system board 16 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 31: ...connector KVM connector Figure 9 External connectors on system board System board switches and jumpers The following illustration shows the location and description of the switches and jumpers Chapter 1 The IBM NeXtScale nx360 M4 Compute Node Type 5455 17 ...

Page 32: ... it to access the switches Notes 1 Before you change any switch settings or move any jumpers turn off the server Review the information in Safety on page v Installation guidelines on page 89 Handling static sensitive devices on page 91 and Turning off the compute node on page 15 2 Any system board switch or jumper block that is not shown in the illustrations in this document are reserved 18 IBM Ne...

Page 33: ...to light the error LEDs The error LEDs that were lit while the system board tray was running will be lit again while the button is pressed The following illustration shows the LEDs and controls on the system board RTMM hearbeat LED Ethernet card error LED Microprocessor 2 error LED DIMM 8 7 error LEDs DIMM 4 3 error LEDs Microprocessor LED mismatch DIMM 2 1 error LEDs DIMM 6 5 error LEDs System bo...

Page 34: ...20 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 35: ...ler to acquire and apply UpdateXpress System Packs and individual firmware and device driver updates For additional information and to download the UpdateXpress System Pack Installer go to the ToolsCenter for System x and BladeCenter at http www ibm com support entry portal docdisplay lndocid TOOL CENTER and click UpdateXpress System Pack Installer When you click an update an information page is d...

Page 36: ...integrated management module on page 32 and the Integrated Management Module II User s Guide at http www 947 ibm com support entry portal docdisplay lndocid migr 5086346 VMware ESXi embedded hypervisor An optional USB flash device with VMware ESXi embedded hypervisor software is available for purchase Hypervisor is virtualization software that enables multiple operating systems to run on a host sy...

Page 37: ... configuring and managing RAID arrays Server configuration RAID array configuration before operating system is installed RAID array management after operating system is installed ServeRAID H1110 adapter LSI Utility Setup utility press Ctrl C ServerGuide Human Interface Infrastructure HII MegaRAID Storage Manager MSM SAS2IRCU Command Line Utility for Storage Management ServeRAID M1115 adapter MegaR...

Page 38: ...re any supported IBM server model The setup program provides a list of tasks that are required to set up your server model On a server with a ServeRAID adapter or SAS SATA controller with RAID capabilities you can run the SAS SATA RAID configuration program to create logical drives Note Features and functions can vary slightly with different versions of the ServerGuide program Typical operating sy...

Page 39: ...his information to start up the Setup utility To start the Setup utility complete the following steps Step 1 Turn on the server Note Approximately 5 to 10 seconds after the server is connected to power the power control button becomes active Step 2 When the prompt F1 Setup is displayed press F1 If you have set an administrator password you must type the administrator password to access the full Se...

Page 40: ...devices and input output I O ports You can configure the serial ports configure remote console redirection enable or disable integrated Ethernet controllers the SAS SATA controllers SATA optical drive channels PCI slots and video controller If you disable a device it cannot be configured and the operating system will not be able to detect it this is equivalent to disconnecting the device Power Sel...

Page 41: ...the default settings Reset IMM Select this choice to reset IMM Recovery Select this choice to view or change the system recovery parameters POST Attempts Select this choice to view or change the number of attempts to POST POST Attempts Limit Select this choice to view or change the Nx boot failure parameters System Recovery Select this choice to view or change system recovery settings POST Watchdo...

Page 42: ...ST event log and the system event log You can use the arrow keys to move between pages in the error log This choice is on the full Setup utility menu only The POST event log contains the most recent error codes and messages that were generated during POST The system event log contains POST and system management interrupt SMI events and all events that are generated by the baseboard management cont...

Page 43: ...n administrator password is intended to be used by a system administrator it limits access to the full Setup utility menu If you set only an administrator password you do not have to type a password to complete the system startup but you must type the administrator password to access the Setup utility menu If you set a power on password for a user and an administrator password for a system adminis...

Page 44: ...pers on page 17 for more information 1 2 3 1 2 3 NMI button Lightpath button UEFI boot recovery jumper Clear CMOS jumper Figure 12 Power on password switch Attention Before you change any switch settings or move any jumpers turn off the server then disconnect all power cords and external cables See the safety information that begins Safety on page v Do not change settings or move jumpers on any sy...

Page 45: ...item from the menu and press Enter The next time the server starts it returns to the startup sequence that is set in the Setup utility Starting the backup server firmware Use this information to start the backup server firmware The system board contains a backup copy area for the server firmware This is a secondary copy of the server firmware that you update only during the process of updating the...

Page 46: ...orts the following basic systems management features Active Energy Manager Alerts in band and out of band alerting PET traps IPMI style SNMP e mail Auto Boot Failure Recovery ABR Automatic microprocessor disable on failure and restart in a two microprocessor configuration when one microprocessor signals an internal error When one of the microprocessors fail the server will disable the failing micr...

Page 47: ...tart the server identify the server and perform other management functions Any standard Telnet client application can access the SOL connection For more information about IMM see the Integrated Management Module II User s Guide at http www 947 ibm com support entry portal docdisplay lndocid migr 5086346 Using the remote presence and blue screen capture features The remote presence and blue screen ...

Page 48: ...mal for example 5E F3FCFFFE5EAAD0 Obtaining the IP address for the IMM Use this information to obtain the IP address for the IMM To access the web interface to use the remote presence feature you need the IP address or host name of the IMM You can obtain the IMM IP address through the Setup utility and you can obtain the IMM host name from the IMM network access tag The server comes with a default...

Page 49: ...d above in both the Web and CLI interfaces Using the embedded hypervisor The VMware ESXi embedded hypervisor software is available on the optional IBM USB flash device with embedded hypervisor The USB flash device can be installed in USB connectors on the system board see Internal cable routing and connectors on page 175 for the location of the connectors Hypervisor is virtualization software that...

Page 50: ... to http www ibm com supportportal Enabling Features on Demand Ethernet software Use this information to enable Features on Demand Ethernet software You can activate the Features on Demand FoD software upgrade key for Fibre Channel over Ethernet FCoE and iSCSI storage protocols that is integrated in the integrated management module For more information and instructions for activating the Features ...

Page 51: ...n the file as a script The ASU program supports scripting environments through a batch processing mode For more information and to download the ASU program go to http www ibm com support entry portal docdisplay lndocid TOOL ASU Updating IBM Systems Director Use this information to update the IBM Systems Director If you plan to use IBM Systems Director to manage the server you must check for the la...

Page 52: ...nterface and click View updates Step 11 Select the updates that you want to install and click Install to start the installation wizard Updating the Universal Unique Identifier UUID The Universal Unique Identifier UUID must be updated when the system board is replaced Use the Advanced Settings Utility to update the UUID in the UEFI based server The ASU is an online tool that supports several operat...

Page 53: ... a zero 0 not an O Note If you do not specify any of these parameters ASU will use the default values When the default values are used and ASU is unable to access the IMM using the online authenticated LAN access method ASU will automatically use the unauthenticated KCS access method The following commands are examples of using the userid and password default values and not using the default value...

Page 54: ...nd password default values and not using the default values Example that does not use the userid and password default values asu set SYSTEM_PROD_DATA SYsInfoUUID uuid_value host imm_ip user user_id password password Example that does use the userid and password default values asu set SYSTEM_PROD_DATA SysInfoUUID uuid_value host imm_ip Bootable media You can also build a bootable media using the ap...

Page 55: ...ing systems ibm_rndis_server_os inf device cat For Linux based operating systems cdc_interface sh Step 4 After you install ASU Type the following commands to set the DMI asu set SYSTEM_PROD_DATA SysInfoProdName m t_model access_method asu set SYSTEM_PROD_DATA SysInfoSerialNum s n access_method asu set SYSTEM_PROD_DATA SysEncloseAssetTag asset_tag access_method Where m t_model The server machine ty...

Page 56: ...ve the IPMI driver installed by default ASU provides the corresponding mapping layer To download the Advanced Settings Utility Users Guide complete the following steps Note Changes are made periodically to the IBM website The actual procedure might vary slightly from what is described in this document 1 Go to http www ibm com supportportal 2 Click on the Downloads tab at the top of the panel 3 Und...

Page 57: ...word asu set SYSTEM_PROD_DATA SysEncloseAssetTag asset_tag ho user imm_user_id password imm_password Examples that do use the userid and password default values asu set SYSTEM_PROD_DATA SysInfoProdName m t_model host imm_ip asu set SYSTEM_PROD_DATA SysInfoSerialNum s n host imm_ip asu set SYSTEM_PROD_DATA SysEncloseAssetTag asset_tag host imm_ip Bootable media You can also build a bootable media u...

Page 58: ...44 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 59: ...d before the problem occurred if possible reverse those changes This might include any of the following items Hardware components Device drivers and firmware System software UEFI firmware System input power or network connections Step 2 View the light path diagnostics LEDs and event logs The server is designed for ease of diagnosis of hardware and software problems Light path diagnostics LEDs See ...

Page 60: ...ning preboot diagnostics For more information about UpdateXpress System Packs see http www ibm com support entry portal docdisplay lndocid SERV XPRESSand Updating the firmware on page 21 For more information about the Bootable Media Creator see http www ibm com support entry portal docdisplay lndocid TOOL BOMC Be sure to separately install any listed critical updates that have release dates that a...

Page 61: ...ocument known problems and suggested solutions To search for troubleshooting procedures and RETAIN tips go to http www ibm com supportportal Step 8 Use the troubleshooting tables See Troubleshooting by symptom on page 59 to find a solution to a problem that has identifiable symptoms A single problem might cause multiple symptoms Follow the troubleshooting procedure for the most obvious symptom If ...

Page 62: ...oprocessor socket See Microprocessor problems on page 63 for information about diagnosing microprocessor problems Before you run DSA you must determine whether the failing server is part of a shared hard disk drive cluster two or more servers sharing external storage devices If it is part of a cluster you can run all diagnostic programs except the ones that test the storage unit that is a hard dis...

Page 63: ...ing results Successful completion of POST see POST on page 56 for more information Successful completion of startup which is indicated by a readable display of the operating system desktop Step 3 Is there a readable image on the monitor screen No Find the failure symptom in Troubleshooting by symptom on page 59 if necessary see Solving undetermined problems on page 73 Yes Run DSA see Running DSA P...

Page 64: ...diagnosing server problems DSA Portable runs on the server operating system and collects the following information about the server Drive health information Event logs for ServeRAID controllers and service processors Installed hardware including PCI and USB information Installed applications and hot fixes Kernel modules Light path diagnostics status Microprocessor input out hub and UEFI error logs...

Page 65: ...ices CD or DVD 7 SAS or SATA drives See Running DSA Preboot diagnostic programs on page 57 for more information on running the DSA Preboot program on the server Troubleshooting by symptom These tables list problem symptoms and actions to correct the problems See Troubleshooting by symptom on page 59 for more information Power supply LEDs The following minimum configuration is required for the serv...

Page 66: ... power to the server or a problem with the ac power source 1 Check the ac power to the server 2 Make sure that the power cord is connected to a functioning power source 3 Restart the server If the error remains check the power supply LEDs 4 If the problem remains replace the power supply This is a normal condition when no ac power is present Off Off On The power supply has failed Replace the power...

Page 67: ...ower off sequencing 1 If the LED blinks at 1Hz it is functioning properly and no action is necessary 2 If the LED is not blinking trained technician only replace the system board IMM2 heartbeat IMM2 heartbeat boot process The following steps describe the different stages of the IMM2 heartbeat sequencing process 1 When this LED is blinking fast approximately 4Hz this indicates that the IMM2 code is...

Page 68: ...ce on page 34 You can also view the IMM event log through the Dynamic System Analysis DSA program as the ASM event log For more information about IMM error messages see Appendix A Integrated Management Module II IMM2 error messages on page 179 DSA event log This log is generated by the Dynamic System Analysis DSA program and it is a chronologically ordered merge of the system event log as the IPMI...

Page 69: ...olled network ports Run DSA Portable to view the diagnostic event log requires IPMI driver or create an output file that you can send to IBM service and support using ftp or local copy Use IPMItool to view the system event log requires IPMI driver Use the web browser interface to the IMM to view the system event log locally requires RNDIS USB LAN driver The server is not hung and the integrated ma...

Page 70: ...ontrollers and service processors Hardware inventory including PCI and USB information Installed applications and hot fixes available in DSA Portable only Kernel modules available in DSA Portable only Light path diagnostics status Network interfaces and settings Performance data and details about processes that are running RAID controller configuration Service processor integrated management modul...

Page 71: ...y It has a graphical user interface that you can use to specify which diagnostics to run and to view the diagnostic and data collection results DSA Preboot provides diagnostics for the following system components if they are installed Emulex network adapter Optical devices CD or DVD Tape drives SCSI SAS or SATA Memory Microprocessor Checkpoint panel I2C bus SAS and SATA drives If you are unable to...

Page 72: ...ncerning test failures is available in the extended diagnostic results for each test Viewing the test log results and transferring the DSA collection Use this information to view the test log results and transferring the DSA collection To view the test log for the results when the tests are completed click the Success link in the Status column if you are running the DSA graphical user interface or...

Page 73: ...he troubleshooting tables to find solutions to problems that have identifiable symptoms If you cannot find a solution to the problem in these tables see Appendix C DSA diagnostic test results on page 549 for information about testing the server and Running DSA Preboot diagnostic programs on page 57 for additional information about running DSA Preboot program For additional information to help you ...

Page 74: ...y by a trained technician Go to the IBM support website at http www ibm com supportportal to check for technical information hints tips and new device drivers or to submit a request for information Symptom Action Not all drives are recognized by the hard disk drive diagnostic tests Remove the drive that is indicated by the diagnostic tests then run the hard disk drive diagnostic tests again If the...

Page 75: ...he problem is solved If an action step is preceded by Trained technician only that step must be performed only by a trained technician Go to the IBM support website at http www 947 ibm com support entry portal overview to check for technical information hints tips and new device drivers or to submit a request for information Symptom Action A problem occurs only occasionally and is difficult to dia...

Page 76: ...evice from the hub and connect it directly to the server 3 Replace the mouse or USB device Memory problems Use this information to solve memory problems Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved If an action step is preceded by Trained technician only that step must be performed only by a trained technician Go to the IBM sup...

Page 77: ... the problem is not the microprocessor or the DIMM connector 8 Trained technician only Replace the system board Multiple DIMMs in a channel are identified as failing Note Each time you install or remove a DIMM you must disconnect the server from the power source then wait 10 seconds before restarting the server 1 Reseat the DIMMs then restart the server 2 Remove the highest numbered DIMM of those ...

Page 78: ...that comes with the monitor for instructions for testing and adjusting the monitor If you cannot diagnose the problem call for service Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved If an action step is preceded by Trained technician only that step must be performed only by a trained technician Go to the IBM support website at ht...

Page 79: ... some application programs 1 Make sure that The application program is not setting a display mode that is higher than the capability of the monitor You installed the necessary device drivers for the application 2 Run video diagnostics see Running DSA Preboot diagnostic programs on page 57 If the server passes the video diagnostics the video is good see Solving undetermined problems on page 73 Trai...

Page 80: ...are listed in the Action column until the problem is solved If an action step is preceded by Trained technician only that step must be performed only by a trained technician Go to the IBM support website at http www 947 ibm com support entry portal overview to check for technical information hints tips and new device drivers or to submit a request for information Symptom Action Unable to wake the ...

Page 81: ... device are secure 2 If the device comes with test instructions use those instructions to test the device 3 If the failing device is a SCSI device make sure that The cables for all external SCSI devices are connected correctly The last device in each SCSI chain or the end of the SCSI cable is terminated correctly Any external SCSI device is turned on You must turn on an external SCSI device before...

Page 82: ...e of the same type Mixing different power supplies in the server will cause a system error the system error LED on the front panel turns on 4 Make sure that The power cords are correctly connected to the server and to a working electrical outlet The type of memory that is installed is correct The DIMMs are fully seated The LEDs on the power supply do not indicate a problem The microprocessors are ...

Page 83: ... technician Go to the IBM support website at http www 947 ibm com support entry portal overview to check for technical information hints tips and new device drivers or to submit a request for information Symptom Action The number of serial ports that are identified by the operating system is less than the number of installed serial ports 1 Make sure that Each port is assigned a unique address in t...

Page 84: ...either no logical drive is defined SCSI RAID servers or the ServerGuide System Partition is not present Run the ServerGuide program and make sure that setup is complete Software problems Use this information to solve software problems Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved If an action step is preceded by Trained technici...

Page 85: ...n overcurrent condition To diagnose a power problem use the following general procedure Step 1 Turn off the server and disconnect all power cords Step 2 Check for loose cables in the power subsystem Also check for short circuits for example if a loose screw is causing a short circuit on a circuit board Step 3 Check the lit LEDs on the operator information panel see Light path diagnostics on page S...

Page 86: ... Replace the identified component Step 5 Remove the adapters and disconnect the cables and power cords to all internal and external devices until the server is at the minimum configuration that is required for the server to start see Power supply LEDs on page 51 for the minimum configuration Step 6 Reconnect all power cords and turn on the server If the server starts successfully reseat the adapte...

Page 87: ...lled Check the LAN activity LED on the rear of the server The LAN activity LED is lit when data is active on the Ethernet network If the LAN activity LED is off make sure that the hub and network are operating and that the correct device drivers are installed Check for operating system specific causes of the problem Make sure that the device drivers on the client and server are using the same prot...

Page 88: ...0000 0000000 0000000 000000000000 000000000000 0000 0000 0000 0000 0000 0000000 0000000 0000000 000000000000 000000000000 0000 0000 0000 0000 0000 0000000 0000000 0000000 000000000000 000000000000 0000 0000 0000 0000 0000000 0000000 0000000 000000000000 000000000000 0000 0000 0000 0000 0000000 0000000 0000000 000000000000 000000000000 0000 0000 0000 0000 0000 0000000 0000000 0000000 000000000000 0...

Page 89: ...ge dccvi for information about calling IBM for service Recovering the server firmware UEFI update failure Use this information to recover the server firmware Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code If the...

Page 90: ...ecovery method Use this information to recover the server firmware and restore the server operation to the primary bank To recover the server firmware and restore the server operation to the primary bank complete the following steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server and disconnect all power cords and...

Page 91: ...power cords Step 7 Restart the server The system begins the power on self test POST Step 8 Boot the server to an operating system that is supported by the firmware update package that you downloaded Step 9 Perform the firmware update by following the instructions that are in the firmware update package readme file Step 10 Turn off the server and disconnect all power cords and external cables and t...

Page 92: ... there is a log entry or Booting Backup Image is displayed on the firmware splash screen otherwise use the in band manual recovery method Step 1 Boot the server to an operating system that is supported by the firmware update package that you downloaded Step 2 Perform the firmware update by following the instructions that are in the firmware update package readme file Step 3 Restart the server Step...

Page 93: ...attempts automatic or manual the Nx boot failure feature causes the server to revert to the default UEFI configuration and start the Setup utility so that you can make the necessary corrections to the configuration and restart the server If the server is unable to successfully complete POST with the default configuration there might be a problem with the system board To specify the number of conse...

Page 94: ...80 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 95: ... for the service See Structural parts on page 85 for the list of structural parts Tier 1 customer replaceable unit CRU Replacement of Tier 1 CRUs is your responsibility If IBM installs a Tier 1 CRU at your request you will be charged for the installation Tier 2 customer replaceable unit You may install a Tier 2 CRU yourself or request IBM to install it at no additional charge under the type of war...

Page 96: ... assembly hardware RAID 00AM453 4 2 5 inch HDD 2x cable vertical cable assembly hardware RAID 00FL148 4 2 5 inch HDD 2x cable right angle cable assembly software RAID 00FL149 7 Microprocessor Intel Xeon E5 2618L v2 2 0 GHz 15 MB 1333 MHz 50 W 6 core 00AE522 7 Microprocessor Intel Xeon E5 2648L v2 2 0 GHz 25 MB 1866 MHz 70 W 10 core 00AE523 7 Microprocessor Intel Xeon E5 2658 v2 2 4 GHz 25 MB 1866 ...

Page 97: ...2785 7 Microprocessor Intel Xeon E5 2680 v2 2 8 GHz 25 MB 1866 MHz 115 W 10 core 00Y2786 7 Microprocessor Intel Xeon E5 2690 v2 3 0 GHz 25 MB 1866 MHz 130 W 10 core 00Y2787 7 Microprocessor Intel Xeon E5 2637 v2 3 5 GHz 15 MB 1866 MHz 130 W 4 core 00Y2789 7 Microprocessor Intel Xeon E5 2643 v2 3 5 GHz 25 MB 1866 MHz 130 W 6 core 00Y2790 7 Microprocessor Intel Xeon E5 2667 v2 3 3 GHz 25 MB 1866 MHz...

Page 98: ...ps SATA non hot swap 00AD036 12 Hard disk drive 2 5 inch 1 TB 6 Gbps SATA non hot swap 00AD041 12 Hard disk drive 2 5 inch 146 GB 15K 6 Gbps SAS non hot swap 00AD046 12 Hard disk drive 2 5 inch 300 GB 15K 6 Gbps SAS non hot swap 00AD051 12 Hard disk drive 2 5 inch 300 GB 10K 6 Gbps SAS non hot swap 00AD056 12 Hard disk drive 2 5 inch 600 GB 10K 6 Gbps SAS non hot swap 00AD061 12 Hard disk drive 2 ...

Page 99: ...orx screwdriver provided on the back of the chassis 00FK488 Thermal grease kit 41Y9292 Alcohol wipe 59P4739 Structural parts Structural parts are not covered by the IBM Statement of Limited Warranty You can place an order on the structural parts from the IBM retail store The following structural parts are available for purchase from the retail store Table 7 Structural parts Type Type 5455 Index De...

Page 100: ...d States and Canada are listed by Underwriter s Laboratories UL and certified by the Canadian Standards Association CSA For units intended to be operated at 115 volts Use a UL listed and CSA certified cord set consisting of a minimum 18 AWG Type SVT or SJT three conductor cord a maximum of 15 feet in length and a parallel blade grounding type attachment plug rated 15 amperes 125 volts For units in...

Page 101: ...u Dhabi Bahrain Botswana Brunei Darussalam Channel Islands China Hong Kong S A R Cyprus Dominica Gambia Ghana Grenada Iraq Ireland Jordan Kenya Kuwait Liberia Malawi Malaysia Malta Myanmar Burma Nigeria Oman Polynesia Qatar Saint Kitts and Nevis Saint Lucia Saint Vincent and the Grenadines Seychelles Sierra Leone Singapore Sudan Tanzania United Republic of Trinidad and Tobago United Arab Emirates ...

Page 102: ...ord part number Used in these countries and regions 39M5226 India 39M5240 39M5241 Brazil 39M5375 39M5378 39M5509 Canada Germany United States of America 88 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 103: ...nical assistance on page dccvi Installation tools The following tools are required to remove or replace parts on the IBM NeXtScale nx360 M4 Compute Node Phillips screwdriver T8 torx screwdriver part number 00FK488 provided on the back of the chassis Flat blade screwdriver Installing an optional device Some compute node components are available as both optional devices and replaceable components Th...

Page 104: ...e Never move suddenly or twist when you lift a heavy object To avoid straining the muscles in your back lift by standing or by pushing up with your leg muscles Make sure that you have an adequate number of properly grounded electrical outlets for the compute node monitor and other devices Back up all important data before you make changes to disk drives Have a small flat blade screwdriver a small ...

Page 105: ...sor and heat sink You have installed the fourth and sixth fans when you installed the second microprocessor option Handling static sensitive devices Use this information to handle static sensitive devices Attention Static electricity can damage the compute node and other electronic devices To avoid damage keep static sensitive devices in their static protective packages until you are ready to inst...

Page 106: ... how many microprocessors are installed For optimum performance you must upgrade the operating system to support SMP See your operating system documentation for additional information Removing a compute node from a chassis Use this information to remove a compute node from a NeXtScale nx360 M4 compute node Before you remove a compute node complete the following steps 1 Read Safety on page v and In...

Page 107: ...ou are installing a compute node model without an integrated Ethernet controller you must install a network interface adapter before you install the compute node in the chassis for management network communication For a list of supported optional devices for the compute node see http www ibm com systems info x86servers serverproven compat us The following tables provide an indication of the quanti...

Page 108: ... 5 8 1 12 12 7 9 130 2 10 8 4 7 Note 1 OVS Oversubscription of the power system allows for more efficient use of the available system power Table 9 Compute nodes supported low line AC input with 900 watt power supply x6 Microprocessor SKU W of microprocessor s Non redundant or N 1 with OVS1 N 5 N 1 redundant N 5 N N redundant N 3 N N redundant with OVS1 N 3 1 12 12 9 11 50 2 12 12 6 10 1 12 12 7 9...

Page 109: ... 2 12 12 12 12 1 12 12 12 12 95 2 12 12 10 12 1 12 12 12 12 115 2 12 12 8 12 1 12 12 12 12 130 2 12 12 7 11 Note 1 OVS Oversubscription of the power system allows for more efficient use of the available system power Table 11 Compute nodes two 130 watt2 GPUs supported high line AC input with 1300 watt power supply x6 Microprocessor SKU W of microprocessor s Non redundant or N 1 with OVS1 N 5 N 1 re...

Page 110: ...ble system power 2 The 130 watt GPU is IBM option part number 00J6160 Table 12 Compute nodes two 225 watt2 GPUs supported high line AC input with 1300 watt power supply x6 Microprocessor SKU W of microprocessor s Non redundant or N 1 with OVS1 N 5 N 1 redundant N 5 N N redundant N 3 N N redundant with OVS1 N 3 1 6 6 5 1 microprocessor node 6 50 2 6 6 5 6 1 6 6 5 6 60 2 6 6 4 1 microprocessor node ...

Page 111: ...allows for more efficient use of the available system power 2 The 225 watt GPUs include IBM option part numbers 00D4192 00J6161 00J6163 and 00J6165 Table 13 Compute nodes two 235 watt2 GPUs supported high line AC input with 1300 watt power supply x6 Microprocessor SKU W of microprocessor s Non redundant or N 1 with OVS1 N 5 N 1 redundant N 5 N N redundant N 3 N N redundant with OVS1 N 3 1 6 6 5 1 ...

Page 112: ...ion of the power system allows for more efficient use of the available system power 2 The 235 watt GPU is IBM option part number 00FL133 Table 14 Compute nodes two 300 watt2 GPUs supported high line AC input with 1300 watt power supply x6 Microprocessor SKU W of microprocessor s Non redundant or N 1 with OVS1 N 5 N 1 redundant N 5 N N redundant N 3 N N redundant with OVS1 N 3 1 6 6 4 2 microproces...

Page 113: ...re efficient use of the available system power 2 The 300 watt GPU is IBM option part number 00J6162 1300 watt power supply supportability The following table provides the 1300 watt power supply supportability to have better performance and power efficiency Table 15 1300 watt power supply supportability FPC power bank Quantity of 1300 watt power supplies Non redundant N 1 redundant N N redundant 2 ...

Page 114: ... in the compute node initializes and synchronizes with the Chassis Management Module This process takes approximately 90 seconds The power LED flashes rapidly and the power button on the compute node does not respond until this process is complete Step 5 Turn on the compute node see Turning on the compute node on page 14 for instructions Step 6 Make sure that the power LED on the compute node cont...

Page 115: ...he configuration cable the hardware RAID signal cable and the mini SAS cable from the storage tray Step 3 Press on the release latch and slide the storage tray toward the rear of the compute node 00 00 00 000 000 000 000 00000000000 00000000000 00000000000 00000000000 00000000000 00000000000 00000000000 00000000000 1 4 3 2 Figure 19 Removal of a storage tray Step 4 Pull the storage tray out of the...

Page 116: ... Blank If a hard disk drive fails it is recommended to keep the failed hard disk drive in the storage tray until installing a new hard disk drive or a filler Step 1 Carefully lay the storage tray on a flat static protective surface orienting the storage tray with the release latch near your right hand side Step 2 Connect the configuration cable the hardware RAID signal cable and the mini SAS cable...

Page 117: ...llowing steps 1 Read Safety on page v and Installation guidelines on page 89 2 If the compute node is operating shut down the operating system 3 Press the power button to turn off the compute node see Turning off the compute node on page 15 for more information To remove the GPU tray from a compute node complete the following steps Step 1 Remove the cover see Removing the compute node cover on pag...

Page 118: ...300 watt power supply unit with high line Vin AC 200 volt to 240 volt Before you install the compute node in a chassis read Safety on page v and Installation guidelines on page 89 To install the GPU tray to compute node complete the following steps Step 1 Carefully lay the GPU tray on a flat static protective surface orienting the GPU tray with the release latch near your right hand side Step 2 Co...

Page 119: ...s your responsibility If IBM installs a structural part at your request you will be charged for the installation The illustrations in this document might differ slightly from your hardware Removing the compute node cover Use this information to remove the cover from a compute node Before you remove the compute node cover complete the following steps 1 Read Safety on page v and Installation guideli...

Page 120: ...ward the rear of the compute node Step 2 Lift the cover away from the compute node 0 0 000 000 000 000 000000 000000 000000 Push point Cover Release latch Figure 23 Remove the compute node cover Attention Do not use any tools or sharp objects to press on the release latch Doing so might result in permanent damage to the release latch Step 3 Lay the cover flat or store it for future use If you are ...

Page 121: ...azardous energy is present when the compute node is connected to the power source Always replace the compute node cover before installing the compute node To install the compute node cover complete the following steps Step 1 Carefully lay the compute node on a flat static protective surface orienting the compute node with the bezel pointing toward you Step 2 Orient the cover so that the posts on t...

Page 122: ...eps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the compute node and peripheral devices and disconnect the power cords and all external cables see Turning off the compute node on page 15 Step 3 Remove the cover see Removing the compute node cover on page 105 Step 4 Grasp the air baffle disengage pins from pin holes then ...

Page 123: ... safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the compute node and peripheral devices and disconnect the power cords and all external cables Step 3 Remove the cover see Removing the compute node cover on page 105 Step 4 Align the air baffle pins with the baffle pin holes on the left hand side of the chassis for the left air baffle then l...

Page 124: ...he peripheral devices and the compute node Removing a RAID adapter battery holder Use this information to remove a RAID adapter battery holder If a RAID adapter battery is installed remotely near the fan cage and you need to replace it complete the following steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server an...

Page 125: ...e this information to install a RAID adapter battery holder To install a RAID adapter battery holder complete the following steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server and peripheral devices and disconnect all power cords and external devices then remove the cover see Removing the compute node cover on p...

Page 126: ...tep 5 Remove the PCI riser filler from the compute node and set it aside Attention For proper cooling and airflow replace the PCI riser filler before you turn on the compute node Operating the compute node with the PCI riser filler removed might damage compute node components Replacing the PCI riser filler Use this information to install the PCI riser filler To install the PCI riser filler complet...

Page 127: ...ay Use this information to remove the filler from the GPU tray To remove the filler from the GPU tray complete the following steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the compute node and peripheral devices and disconnect the power cords and all external cables see Turning off the compute node on page 15 Step 3 R...

Page 128: ...o install the filler on to the GPU tray complete the following steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the compute node and peripheral devices and disconnect the power cords and all external cables Step 3 Remove the cover see Removing the compute node cover on page 105 Step 4 Align the filler with the bracket o...

Page 129: ...nt handle Before you remove the front handle complete the following steps 1 Read Safety on page v and Installation guidelines on page 89 2 If the compute node is installed in an IBM NeXtScale n1200 Enclosure remove it see Removing a compute node from a chassis on page 92 for instructions 3 Carefully lay the compute node on a flat static protective surface with the cover side down orienting the com...

Page 130: ...lied to you Installing the front handle Use this information to install the front handle Before you install the front handle complete the following steps 1 Read Safety on page v and Installation guidelines on page 89 2 If the compute node is installed in an IBM NeXtScale n1200 Enclosure remove it see Removing a compute node from a chassis on page 92 for instructions 3 Carefully lay the compute nod...

Page 131: ...chassis see Installing a compute node in a chassis on page 93 for instructions Removing the hard disk drive cage Use this information to remove the hard disk drive cage Before you remove the hard disk drive cage complete the following steps 1 Read Safety on page v and Installation guidelines on page 89 2 If the compute node is installed in an IBM NeXtScale n1200 Enclosure remove it see Removing a ...

Page 132: ... inch 0 0 0 0 0 0 0 0 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0 0 0 0 2 5 inch Hard disk drive cage Screws Pin Pin hole Back of chassis T8 torx screwdriver Figure 38 Removing a hard disk drive cage 2 5 inch 118 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 133: ...ree for 1 8 inch hard disk drive cage from the cage and rotate the cage from under the bezel then remove the cage from the compute node at an angle If you are instructed to return the hard disk drive cage follow all packaging instructions and use any packaging materials for shipping that are supplied to you Installing the hard disk drive cage Use this information to install the hard disk drive cag...

Page 134: ... inch 0 0 0 0 0 0 0 0 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0 0 0 0 2 5 inch Hard disk drive cage Screws Pin Pin hole Back of chassis T8 torx screwdriver Figure 41 Installing a hard disk drive cage 2 5 inch 120 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 135: ...he compute node Step 5 Install the hard disk drive backplate see Installing the hard disk drive backplate on page 140 Step 6 Insert the easy swap hard disk drives and hard disk drive bay fillers see Removing and installing drives on page 142 After you install the hard disk drive cage complete the following steps 1 Install the cover onto the compute node see Installing the compute node cover on pag...

Page 136: ...00 0000 0000 0000 0000 0000 0000 0 0 0 0 T8 torx screwdriver Back of chassis Screw Figure 43 Screw removal Step 2 Pull out the connector from the system board 000 000 000 00000 00000 00000 00000 00000 00000 000000000 000000000 000000000 000000000 000000000 Figure 44 Connector pull out Step 3 Carefully pull the operator information panel outward a little to make a space for removal 000 000 000 0000...

Page 137: ...tion panel Use this information to install the operator information panel Before you install the operator information panel read Safety on page v and Installation guidelines on page 89 To install the operator information panel complete the following steps Step 1 Position the operator information panel on the front of the compute node 000 000 000 00000 00000 00000 00000 00000 00000 000000000 000000...

Page 138: ...tor of an operator information panel Step 4 Install the screw of the operator information panel 000000000000 000000000000 000000000000 000000000000 000000000000 000000000000 0 0 0 0 0 0 0 0 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0000 0 0 0 0 T8 torx screwdriver Back of chassis Screw Figure 50 Screw installation Removing the power paddle card from the GPU tray Use this information t...

Page 139: ...re 51 Power paddle card removal Step 8 If you are instructed to return the power paddle card follow all packaging instructions and use any packaging materials for shipping that are supplied to you Replacing the power paddle card on to the GPU tray Use this information to install the power paddle card on to the GPU tray To install the power paddle card on to the GPU tray complete the following step...

Page 140: ... 800 IBM 4333 for information about battery disposal If you replace the original lithium battery with a heavy metal battery or a battery with heavy metal components be aware of the following environmental consideration Batteries and accumulators that contain heavy metals must not be disposed of with normal domestic waste They will be taken back free of charge by the manufacturer distributor or rep...

Page 141: ...over on the battery holder use your fingers to lift the battery cover from the battery connector b Use one finger to tilt the battery horizontally out of its socket pushing it away from the socket Attention Neither tilt nor push the battery by using excessive force Figure 53 System battery removal c Use your thumb and index finger to lift the battery from the socket Attention Do not lift the batte...

Page 142: ...epair or disassemble Dispose of the battery as required by local ordinances or regulations To install the replacement system battery complete the following steps Step 1 Follow any special handling and installation instructions that come with the replacement battery Step 2 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 3 Turn off the server a...

Page 143: ...onfigure the server See Using the Setup utility on page 25 for details Removing a memory module Use this information to remove a memory module To remove a dual inline memory module DIMM complete the following steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server and peripheral devices and disconnect all power cord...

Page 144: ...ard double data rate 3 DDR3 1066 1333 1600 or 1866 MHz PC3 8500 PC3 10600 PC3 12800 or PC3 14900 registered or unbuffered synchronous dynamic random access memory SDRAM dual inline memory modules DIMMs with error correcting code ECC See http www ibm com systems info x86servers serverproven compat us for a list of supported memory modules for the compute node The specifications of a DDR3 DIMM are o...

Page 145: ...sign of the DIMM Note To determine the type of a DIMM see the label on the DIMM The information on the label is in the format xxxxx nRxxx PC3v xxxxxx xx xx xxx The numeral in the sixth numerical position indicates whether the DIMM is single rank n 1 dual rank n 2 or quad rank n 4 The following rules apply to DDR3 RDIMM speed as it relates to the number of RDIMMs in a channel When you install 1 RDI...

Page 146: ... and a maximum of 128 GB of system memory using RDIMMs A minimum of one DIMM must be installed for each microprocessor For example you must install a minimum of two DIMMs if the compute node has two microprocessors installed However to improve system performance install a minimum of four DIMMs for each microprocessor DIMMs in the compute node must be the same type RDIMM or UDIMM to ensure that the...

Page 147: ...or DIMM 5 DIMM 6 Figure 56 DIMM connectors location DIMM installation sequence Depending on the server model the server may come with a minimum of one 4 GB DIMM installed in slot 4 When you install additional DIMMs install them in the order shown in the following table to optimize system performance In general all channels on the memory interface for each microprocessor can be populated in any ord...

Page 148: ...to half of the installed memory when memory mirrored channel is enabled For example if you install 8 GB of memory using RDIMMs only 4 GB of addressable memory is available when you use memory mirrored channel The following table shows the installation sequence for memory mirrored channel mode Table 19 Memory mirrored channel mode DIMM population sequence Number of installed microprocessor DIMM con...

Page 149: ...the air baffle see Replacing the air baffle on page 109 Note Close all the retaining clips even for slots without DIMMs installed before replacing the air baffle Step 11 Replace the cover see Installing the compute node cover on page 107 Step 12 Reconnect the power cords and any cables that you removed Step 13 Turn on the peripheral devices and the server Removing the optional 3 5 inch hard disk d...

Page 150: ... inch hard disk drive hardware RAID cage Step 1 Remove the cover see Removing the compute node cover on page 105 Step 2 Disconnect the power cable and the mini SAS cable from the system board and the storage tray respectively see Cabling hard disk drive with ServeRAID SAS SATA controller on page 176 136 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 151: ...any packaging materials for shipping that are supplied to you Installing the optional 3 5 inch hard disk drive hardware RAID cage Use this information to install the optional 3 5 inch hard disk drive hardware RAID cage Before you install the hard disk drive cage complete the following steps 1 Read Safety on page v and Installation guidelines on page 89 2 If the compute node is installed in an IBM ...

Page 152: ...ep 3 Align the cage with the screw holes on the system board Step 4 Using a Phillips screwdriver insert the 4 screws and secure the cage in the compute node Step 5 Install the easy swap hard disk drive 7 see Installing a 3 5 inch hard disk drive on page 143 Step 6 Connect the power cable and the mini SAS cable on the system board and the storage tray respectively see Cabling hard disk drive with S...

Page 153: ...assis on page 93 for instructions Removing the hard disk drive backplate Use this information to remove the hard disk drive backplate Before you remove the hard disk drive backplate complete the following steps 1 Read Safety on page v and Installation guidelines on page 89 2 If the compute node is installed in an IBM NeXtScale n1200 Enclosure remove it see Removing a compute node from a chassis on...

Page 154: ...age the drive or filler Step 3 Unlatch the release latch and lift out the hard disk drive backplate If you are instructed to return the hard disk drive backplate follow all packaging instructions and use any packaging materials for shipping that are supplied to you Installing the hard disk drive backplate Use this information to install the hard disk drive backplate Before you install the hard dis...

Page 155: ...plate complete the following steps 2 5 inch Hard disk drive backplate Figure 63 Installing backplate for 2 5 inch 1 8 inch solid state drive backplate Figure 64 Installing backplate for 1 8 inch Step 1 Remove the cover see Removing the compute node cover on page 105 Step 2 Align the backplate with the hard disk drive cage and the connector on the system board and press the backplate into position ...

Page 156: ... the bay in which you want to install the drive Check the instructions that come with the drive to determine whether you have to set any switches or jumpers on the drive If you are installing a SAS or SATA hard disk drive be sure to set the SAS or SATA ID for that device The compute node supports up to one 3 5 inch two 2 5 inch or four 1 8 inch easy swap SAS or SATA hard disk drives For a complete...

Page 157: ... complete the following steps Step 1 Read Safety on page v and Installation guidelines on page 89 Step 2 If the compute node is installed in an IBM NeXtScale n1200 Enclosure remove it see Removing a compute node from a chassis on page 92 for instructions Step 3 Carefully lay the compute node on a flat static protective surface orienting the compute node with the bezel pointing toward you Step 4 Re...

Page 158: ...plate complete the following steps 1 Install the cover onto the compute node see Installing the compute node cover on page 107 for instructions 2 Install the compute node into the chassis see Installing a compute node in a chassis on page 93 for instructions Removing a 2 5 inch hard disk drive Use this information to remove a 2 5 inch hard disk drive Attention Static electricity that is released t...

Page 159: ... and rotate the cage upward 1 2 2 5 inch Hard disk drive cage Figure 67 Lift the 2 5 inch hard disk drive cage upward Step 6 Push this latch gently outward a little to let the screw un hold by the latch hole Then remove the hard disk drive 1 2 Release latch Figure 68 2 5 inch hard disk drive removal Step 7 Pull the plunger of the 2 5 inch hard disk drive cage outward and rotate the cage downward u...

Page 160: ...g using drives with different speed ratings might cause all drives to operate at the speed of the slowest drive You must turn off the compute node when you perform any steps that involve installing or removing cables Attention Static electricity that is released to internal server components when the server is powered on might cause the server to halt which might result in the loss of data To avoi...

Page 161: ...rive from the package Step 8 Align the drive with the bay of the hard disk drive cage then carefully slide the drive into the drive bay until the drive snaps into place 2 5 inch Hard disk drive Figure 71 2 5 inch hard disk drive installation Step 9 Pull the plunger of the 2 5 inch hard disk drive cage outward and rotate the cage downward until the cage snaps into place 1 2 Figure 72 Put the 2 5 in...

Page 162: ...ation to remove a 1 8 inch hard disk drive Attention Static electricity that is released to internal server components when the server is powered on might cause the server to halt which might result in the loss of data To avoid this potential problem always use an electrostatic discharge wrist strap or other grounding system when you work inside the server with the power on To remove a 1 8 inch ha...

Page 163: ...our 1 8 inch SAS SATA hard disk drives in the bays For a list of supported optional devices for the server see http www ibm com systems info x86servers serverproven compat us Inspect the drive and drive bay for signs of damage Make sure that the drive is correctly installed in the drive bay See the documentation for the ServeRAID adapter for instructions for installing a hard disk drive All drives...

Page 164: ...e compute node cover on page 105 Step 5 Pull the plunger of the 1 8 inch hard disk drive cage outward and rotate the cage upward 1 2 1 8 inch SSD cage Figure 76 Lift the 1 8 inch hard disk drive cage upward Step 6 Remove the filler panel if one is present Step 7 Touch the static protective package that contains the disk drive to any unpainted metal surface on the server then remove the disk drive ...

Page 165: ...nto the compute node see Installing the compute node cover on page 107 for instructions 2 Install the compute node into the chassis see Installing a compute node in a chassis on page 93 for instructions Removing a PCI riser cage assembly Note PCI riser cage brackets must be installed even if you do not install an adapter To remove a PCI riser cage assembly complete the following steps Step 1 Read ...

Page 166: ...owing steps Step 1 Read the safety information that begins on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server and peripheral devices and disconnect all power cords Step 3 Remove the cover see Removing the compute node cover on page 105 Step 4 Install the adapter in the new PCI riser cage assembly see Replacing an adapter GPU adapter on page 157 Step 5 Set any jum...

Page 167: ... Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server and peripheral devices and disconnect the power cords and all external cables Step 3 Remove the GPU tray from the compute node see Removing a GPU tray from a compute node on page 103 Step 4 Remove the cover see Removing the compute node cover on page 105 Step 5 Remove the air baffle see Removing the air baffle on p...

Page 168: ...ent from the PCI riser cage assembly see Removing an adapter GPU adapter on page 156 Step 9 Set the GPU adapter and the PCI riser cage assembly aside Step 10 If you are instructed to return the PCI riser cage assembly follow all packaging instructions and use any packaging materials for shipping that are supplied to you Replacing a PCI riser cage assembly in the GPU tray Note PCI riser cage bracke...

Page 169: ...iser connector on the system board then grasp the rear side of the PCI riser cage touch point and the front PCI rise cage suitable location of the PCI riser cage assembly Step 7 Press down firmly until the PCI riser cage assembly is seated correctly in the connector on the system board 00 00 000 000 000 000 00 00 0000 0000 0000 0000 0000000 0000000 0000000 0000000 Guide pin Front PCI riser assembl...

Page 170: ...s on Safety on page v and Installation guidelines on page 89 Step 2 Turn off the server and peripheral devices and disconnect all power cords then remove the cover see Removing the compute node cover on page 105 Step 3 Grasp the PCI riser cage assembly at the blue tabs and lift to remove the PCI riser cage assembly Step 4 Disconnect any cables from the adapter GPU adapter Step 5 Place the PCI rise...

Page 171: ...addition to the instructions in this section For configuration information see the ServeRAID documentation at http www 947 ibm com support entry portal overview When you install the new GPU adapter you must update the GPU adapter with the latest firmware Make sure that you have the latest firmware before you proceed See Updating the firmware on page 21 for more information When you install any PCI...

Page 172: ...er GPU adapter Step 5 Insert the adapter GPU adapter into the PCI riser cage assembly aligning the edge connector on the adapter GPU adapter with the connector on the PCI riser cage assembly Press the edge of the connector firmly into the PCI riser cage assembly Make sure that the adapter GPU adapter snaps into the PCI riser cage assembly securely 00000000 00000000 00000000 00000000 PCIe adapter P...

Page 173: ...configuration tasks that are required for the adapter GPU adapter Step 9 Reinstall the cover see Installing the compute node cover on page 107 Step 10 Slide the server into the rack Step 11 Reconnect the power cords and any cables that you removed Step 12 Turn on the peripheral devices and the server Removing the USB flash drive Use this information to remove the USB flash drive Before you remove ...

Page 174: ...rive out of the connector If you are instructed to return the USB flash drive follow all packaging instructions and use any packaging materials for shipping that are supplied to you Installing the USB flash drive Use this information to install the USB flash drive Before you install the USB flash drive complete the following steps 1 Read Safety on page v and Installation guidelines on page 89 160 ...

Page 175: ... installed as an optional device or as a CRU The installation procedure is the same for the optional device and the CRU To install the USB flash drive complete the following steps USB hypervisor key Figure 92 Installing USB flash drive Step 1 Remove the cover see Removing the compute node cover on page 105 Step 2 Locate the USB connector on the system board see System board internal connectors on ...

Page 176: ...e microprocessor socket Do not use any tools or sharp objects to lift the locking levers on the microprocessor socket Doing so might result in permanent damage to the system board Each microprocessor socket must always contain either a socket cover or a microprocessor and heat sink Be sure to use only the installation tools provided with the new microprocessor to remove or install the microprocess...

Page 177: ...of the microprocessor retainer b Lift the heat sink out of the server After removal place the heat sink with the thermal grease side up on a clean flat surface Heat sink Figure 93 Heat sink removal Step 7 Open the microprocessor socket release levers and retainer Microprocessor Microprocessor release lever Microprocessor release lever Figure 94 Microprocessor socket levers and retainer disengageme...

Page 178: ...e following illustration of the installation tool shows the location of the interlock latch and counterclockwise rotation of the handle before loading the microprocessor H Figure 95 Installation tool handle adjustment b Align the installation tool with the screws as shown in the following graphic and lower the installation tool on the microprocessor The installation tool rests flush on the socket ...

Page 179: ...on the socket install the socket cover that you removed in step Step 8 on page 169 on the microprocessor socket Attention The pins on the socket are fragile Any damage to the pins may require replacing the system board If you are instructed to return the microprocessor follow all packaging instructions and use any packaging materials for shipping that are supplied to you Replacing a microprocessor...

Page 180: ...processor socket 1 on the system board When one microprocessor is installed the air baffle must be installed to provide proper system cooling Do not remove the first microprocessor from the system board when you install the second microprocessor When you install the second microprocessor you must also install additional memory the fourth and sixth fans See Installing a memory module on page 130 fo...

Page 181: ...tep 4 Remove the air baffle see Removing the air baffle on page 108 Step 5 Loosen the four screws on the corners of the microprocessor retainer Step 6 Open the microprocessor socket release levers and retainer Microprocessor release lever Microprocessor release lever Figure 99 Microprocessor socket levers and retainer disengagement a Identify which release lever is labeled as the first release lev...

Page 182: ...roperly aligned Installation tool Microprocessor Alignment pins Figure 101 Installation tool alignment d Twist the handle of the installation tool assembly counterclockwise until the microprocessor is inserted into the socket and lift the installation tool out of the socket The following illustration shows the tool handle in the open position 168 IBM NeXtScale nx360 M4 Installation and Service Gui...

Page 183: ...and aligned correctly in the socket before you try to close the microprocessor retainer Do not touch the thermal material on the bottom of the heat sink or on top of the microprocessor Touching the thermal material will contaminate it Step 8 Remove the microprocessor socket cover tape or label from the surface of the microprocessor socket if one is present Store the socket cover in a safe place Ch...

Page 184: ...which release lever is labeled as the first release lever to close and close it c Close the second release lever on the microprocessor socket Attention If you are installing a new heat sink do not set down the heat sink after you remove the plastic cover Do not touch the thermal grease on the bottom of the heat sink Touching the thermal grease will contaminate it Step 10 Install the heat sink Atte...

Page 185: ...he microprocessor in the retention bracket thermal material side down d Press firmly on the heat sink e Tighten the four screws on the corners of the microprocessor retainer Step 11 Reinstall the air baffle see Replacing the air baffle on page 109 Step 12 Install the cover see Installing the compute node cover on page 107 Step 13 Slide the server into the rack Step 14 Reconnect the power cords and...

Page 186: ...following steps Step 1 Place the heat sink on a clean work surface Step 2 Remove the cleaning pad from its package and unfold it completely Step 3 Use the cleaning pad to wipe the thermal grease from the bottom of the heat sink Note Make sure that all of the thermal grease is removed Step 4 Use a clean area of the cleaning pad to wipe the thermal grease from the microprocessor then dispose of the ...

Page 187: ...cale n1200 Enclosure remove it see Removing a compute node from a chassis on page 92 for instructions 3 Carefully lay the compute node on a flat static protective surface orienting the compute node with the bezel pointing toward you 4 Obtain the following for use during the replacement procedure see Chapter 4 Parts listing IBM NeXtScale nx360 M4 Compute Node Type 5455 on page 81 Alcohol wipes part...

Page 188: ...ed only if the compute node came with a RFID tag attached to the bezel T8 torx screwdriver part number 00FK488 provided on the back of the chassis Thermal grease kit part number 41Y9292 Important When you replace the system board you must update the compute node with the latest firmware or restore the preexisting firmware Make sure that you have the latest firmware or a copy of the preexisting fir...

Page 189: ...or restore the preexisting firmware see Updating the firmware on page 21 for more information Internal cable routing and connectors This section provides information about routing the cables when you install some components in the IBM NeXtScale nx360 M4 Compute Node For more information about the requirements for cables and connecting devices see the documentation that comes with these devices Cab...

Page 190: ...inch hard disk drive with software RAID signal cable connection Cabling hard disk drive with ServeRAID SAS SATA controller The internal routing and connectors for the hard disk drive with ServeRAID SAS SATA controller The following illustrations show the internal routing and connectors for the 2 5 inch and 1 8 inch hard disk drive models with ServeRAID SAS SATA controller Note Make sure the releva...

Page 191: ... 5 inch hard disk drive with ServeRAID SAS SATA controller cable connection 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 Power cable Mini SAS cable Configuration cable 1 8 inch solid state drive cage Figure 114 1 8 inch hard disk drive with ServeRAID SAS SATA controller cable connection Chapter 5 Removing and replacing components 177 ...

Page 192: ...178 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 193: ...ion of the level of concern for the condition In the system event log severity is abbreviated to the first character The following severities can be displayed Info The event was recorded for audit purposes usually a user action or a change of states that is normal behavior Warning The event is not as severe as an error but if possible the condition should be corrected before it becomes an error It...

Page 194: ...des and messages that might not apply to this machine type and model The following is the list of Integrated Management Module II IMM2 error messages and suggested actions to correct the detected server problems For more information about Integrated Management Module II IMM2 see the Integrated Management Module II User s Guide at http www 947 ibm com support entry portal docdisplay lndocid migr 50...

Page 195: ...sure that the certificate that you are importing is correct and properly generated 40000003 00000000 Ethernet Data Rate modified from arg1 to arg2 by user arg3 This message is for the use case where a user modifies the Ethernet Port data rate May also be shown as 4000000300000000 or 0x4000000300000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM...

Page 196: ...rg2 by user arg3 This message is for the use case where a user modifies the Ethernet Port MTU setting May also be shown as 4000000500000000 or 0x4000000500000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0005 User Response Information only no action is required 40000006 00000000 Ethernet locally administered MAC addr...

Page 197: ...s or disabled the ethernet interface May also be shown as 4000000700000000 or 0x4000000700000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0007 User Response Information only no action is required 40000008 00000000 Hostname set to arg1 by user arg2 This message is for the use case where user modifies the Hostname of ...

Page 198: ...May also be shown as 4000000900000000 or 0x4000000900000000 Severity Info Serviceable No Automatically notify support No Alert Category System IMM Network event SNMP Trap ID 37 CIM Information Prefix IMM ID 0009 User Response Information only no action is required 4000000a 00000000 IP subnet mask of network interface modified from arg1 to arg2 by user arg3 This message is for the use case where a ...

Page 199: ...t Controller May also be shown as 4000000b00000000 or 0x4000000b00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0011 User Response Information only no action is required 4000000c 00000000 OS Watchdog response arg1 by arg2 This message is for the use case where an OS Watchdog has been enabled or disabled by a user ...

Page 200: ... notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0013 User Response Complete the following steps until the problem is solved 1 Make sure that the IMM network cable is connected 2 Make sure that there is a DHCP server on the network that can assign an IP address to the IMM 4000000e 00000000 Remote Login Successful Login ID arg1 from arg2 at IP address arg3 This mess...

Page 201: ...0000f00000000 or 0x4000000f00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0015 User Response Information only no action is required 40000010 00000000 Security Userid arg1 had arg2 login failures from WEB client at IP address arg3 This message is for the use case where a user has failed to log in to a Management C...

Page 202: ...0 or 0x4000001100000000 Severity Warning Serviceable No Automatically notify support No Alert Category System Remote Login SNMP Trap ID 30 CIM Information Prefix IMM ID 0017 User Response Complete the following steps until the problem is solved 1 Make sure that the correct login ID and password are being used 2 Have the system administrator reset the login ID or password 40000012 00000000 Remote a...

Page 203: ...to a Management Controller from a telnet session May also be shown as 4000001300000000 or 0x4000001300000000 Severity Info Serviceable No Automatically notify support No Alert Category System Remote Login SNMP Trap ID 30 CIM Information Prefix IMM ID 0019 User Response Make sure that the correct login ID and password are being used 40000014 00000000 The arg1 on system arg2 cleared by user arg3 Thi...

Page 204: ...e shown as 4000001500000000 or 0x4000001500000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0021 User Response Information only no action is required 40000016 00000000 ENET arg1 DHCP HSTN arg2 DN arg3 IP arg4 SN arg5 GW arg6 DNS1 arg7 This message is for the use case where a Management Controller IP address and confi...

Page 205: ...tatically using user data May also be shown as 4000001700000000 or 0x4000001700000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0023 User Response Information only no action is required 40000018 00000000 LAN Ethernet arg1 interface is no longer active This message is for the use case where a Management Controller eth...

Page 206: ...active May also be shown as 4000001900000000 or 0x4000001900000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0025 User Response Information only no action is required 4000001a 00000000 DHCP setting changed to arg1 by user arg2 This message is for the use case where a user changes the DHCP setting May also be shown as...

Page 207: ...y also be shown as 4000001b00000000 or 0x4000001b00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0027 User Response Information only no action is required 4000001c 00000000 Watchdog arg1 Screen Capture Occurred This message is for the use case where an operating system error has occurred and the screen was capture...

Page 208: ...en capture failed May also be shown as 4000001d00000000 or 0x4000001d00000000 Severity Error Serviceable No Automatically notify support No Alert Category System other SNMP Trap ID 22 CIM Information Prefix IMM ID 0029 User Response Complete the following steps until the problem is solved 1 Reconfigure the watchdog timer to a higher value 2 Make sure that the IMM Ethernet over USB interface is ena...

Page 209: ...e updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4000001f 00000000 Please ensure that the Management Controller arg1 is flashed with the correct firmware The Management Controller is unable to match its firmware to the server This message is for the use case where a Management Controller firm...

Page 210: ...o default values May also be shown as 4000002000000000 or 0x4000002000000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0032 User Response Information only no action is required 40000021 00000000 Management Controller arg1 clock has been set from NTP server arg2 This message is for the use case where a Management Cont...

Page 211: ...D 0034 User Response Complete the following steps until the problem is solved 1 Make sure that the certificate that you are importing is correct 2 Try to import the certificate again 40000023 00000000 Flash of arg1 from arg2 succeeded for user arg3 This message is for the use case where a user has successfully flashed the firmware component MC Main Application MC Boot ROM BIOS Diagnostics System P...

Page 212: ...y Info Serviceable No Automatically notify support No Alert Category System other SNMP Trap ID 22 CIM Information Prefix IMM ID 0036 User Response Information only no action is required 40000025 00000000 The arg1 on system arg2 is 75 full This message is for the use case where a Management Controller Event Log on a system is 75 full May also be shown as 4000002500000000 or 0x4000002500000000 Sever...

Page 213: ...og 75 full SNMP Trap ID 35 CIM Information Prefix IMM ID 0038 User Response To avoid losing older log entries save the log as a text file and clear the log 40000027 00000000 Platform Watchdog Timer expired for arg1 This message is for the use case when an implementation has detected a Platform Watchdog Timer Expired May also be shown as 4000002700000000 or 0x4000002700000000 Severity Error Service...

Page 214: ...Alert May also be shown as 4000002800000000 or 0x4000002800000000 Severity Info Serviceable No Automatically notify support No Alert Category System other SNMP Trap ID 22 CIM Information Prefix IMM ID 0040 User Response Information only no action is required 40000029 00000000 Security Userid arg1 had arg2 login failures from an SSH client at IP address arg3 This message is for the use case where a...

Page 215: ... where a specific type of firmware mismatch has been detected May also be shown as 4000002a00000000 or 0x4000002a00000000 Severity Error Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID 22 CIM Information Prefix IMM ID 0042 User Response Reflash the IMM firmware to the latest version 4000002b 00000000 Domain name set to arg1 Domain name set by user May also b...

Page 216: ...o Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0044 User Response Information only no action is required 4000002d 00000000 DDNS setting changed to arg1 by user arg2 DDNS setting changed by user May also be shown as 4000002d00000000 or 0x4000002d00000000 Severity Info Serviceable No Automatically notify support No Alert Category none ...

Page 217: ...le No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0046 User Response Information only no action is required 4000002f 00000000 IPv6 enabled by user arg1 IPv6 protocol is enabled by user May also be shown as 4000002f00000000 or 0x4000002f00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Inform...

Page 218: ...one SNMP Trap ID CIM Information Prefix IMM ID 0048 User Response Information only no action is required 40000031 00000000 IPv6 static IP configuration enabled by user arg1 IPv6 static address assignment method is enabled by user May also be shown as 4000003100000000 or 0x4000003100000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information ...

Page 219: ...refix IMM ID 0050 User Response Information only no action is required 40000033 00000000 IPv6 stateless auto configuration enabled by user arg1 IPv6 statless auto assignment method is enabled by user May also be shown as 4000003300000000 or 0x4000003300000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0051 User Respon...

Page 220: ...sponse Information only no action is required 40000035 00000000 IPv6 DHCP disabled by user arg1 IPv6 DHCP assignment method is disabled by user May also be shown as 4000003500000000 or 0x4000003500000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0053 User Response Information only no action is required 40000036 00000...

Page 221: ...00000 ENET arg1 IPv6 LinkLocal HstName arg2 IP arg3 Pref arg4 IPv6 Link Local address is active May also be shown as 4000003700000000 or 0x4000003700000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0055 User Response Information only no action is required 40000038 00000000 ENET arg1 IPv6 Static HstName arg2 IP arg3 P...

Page 222: ...ss is active May also be shown as 4000003900000000 or 0x4000003900000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0057 User Response Information only no action is required 4000003a 00000000 IPv6 static address of network interface modified from arg1 to arg2 by user arg3 A user modifies the IPv6 static address of a M...

Page 223: ...003b00000000 Severity Warning Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0059 User Response Complete the following steps until the problem is solved 1 Make sure that the IMM network cable is connected 2 Make sure that there is a DHCPv6 server on the network that can assign an IP address to the IMM 4000003c 00000000 Platform Watchdo...

Page 224: ... RNDIS or cdc_ether device driver for the operating system 4 Disable the watchdog 5 Check the integrity of the installed operating system 4000003d 00000000 Telnet port number changed from arg1 to arg2 by user arg3 A user has modified the telnet port number May also be shown as 4000003d00000000 or 0x4000003d00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SN...

Page 225: ...y no action is required 4000003f 00000000 Web HTTP port number changed from arg1 to arg2 by user arg3 A user has modified the Web HTTP port number May also be shown as 4000003f00000000 or 0x4000003f00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0063 User Response Information only no action is required 40000040 00...

Page 226: ... arg2 by user arg3 A user has modified the CIM HTTP port number May also be shown as 4000004100000000 or 0x4000004100000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0065 User Response Information only no action is required 40000042 00000000 CIM XML HTTPS port number changed from arg1 to arg2 by user arg3 A user has ...

Page 227: ...er May also be shown as 4000004300000000 or 0x4000004300000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0067 User Response Information only no action is required 40000044 00000000 SNMP Traps port number changed from arg1 to arg2 by user arg3 A user has modified the SNMP Traps port number May also be shown as 4000004...

Page 228: ...e shown as 4000004500000000 or 0x4000004500000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0069 User Response Information only no action is required 40000046 00000000 Remote Presence port number changed from arg1 to arg2 by user arg3 A user has modified the Remote Presence port number May also be shown as 4000004600...

Page 229: ...000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0071 User Response Information only no action is required 40000048 00000000 Inventory data changed for device arg1 new device data hash arg2 new master data hash arg3 Something has caused the physical inventory to change May also be shown as 4000004800000000 or 0x40000...

Page 230: ...000 or 0x4000004900000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0073 User Response Information only no action is required 4000004a 00000000 SNMP arg1 disabled by user arg2 A user disabled SNMPv1 or SNMPv3 or Traps May also be shown as 4000004a00000000 or 0x4000004a00000000 Severity Info Serviceable No Automatical...

Page 231: ...tically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0075 User Response Information only no action is required 4000004c 00000000 LDAP Server configuration set by user arg1 SelectionMethod arg2 DomainName arg3 Server1 arg4 Server2 arg5 Server3 arg6 Server4 arg7 A user changed the LDAP server configuration May also be shown as 4000004c00000000 or 0x4000004c0000000...

Page 232: ...000 or 0x4000004d00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0077 User Response Information only no action is required 4000004e 00000000 Serial Redirection set by user arg1 Mode arg2 BaudRate arg3 StopBits arg4 Parity arg5 SessionTerminateSequence arg6 A user configured the Serial Port mode May also be shown a...

Page 233: ... No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0079 User Response Information only no action is required 40000050 00000000 Server General Settings set by user arg1 Name arg2 Contact arg3 Location arg4 Room arg5 RackID arg6 Rack U position arg7 A user configured the Location setting May also be shown as 4000005000000000 or 0x4000005000000000 Sever...

Page 234: ...e No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0081 User Response Information only no action is required 40000052 00000000 Server arg1 scheduled for arg2 at arg3 by user arg4 A user configured a Server Power action at a specific time May also be shown as 4000005200000000 or 0x4000005200000000 Severity Info Serviceable No Automatically notify sup...

Page 235: ...o Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0083 User Response Information only no action is required 40000054 00000000 Server arg1 arg2 cleared by user arg3 A user cleared a Server Power Action May also be shown as 4000005400000000 or 0x4000005400000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap I...

Page 236: ...tify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0085 User Response Information only no action is required 40000056 00000000 SMTP Server set by user arg1 to arg2 arg3 A user configured the SMTP server May also be shown as 4000005600000000 or 0x4000005600000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information...

Page 237: ...0058 00000000 DNS servers set by user arg1 UseAdditionalServers arg2 PreferredDNStype arg3 IPv4Server1 arg4 IPv4Server2 arg5 IPv4Server3 arg6 IPv6Server1 arg7 IPv6Server2 arg8 IPv6Server3 arg9 A user configures the DNS servers May also be shown as 4000005800000000 or 0x4000005800000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Pre...

Page 238: ...equired 4000005a 00000000 LAN over USB Port Forwarding set by user arg1 ExternalPort arg2 USB LAN port arg3 A user configured USB LAN port forwarding May also be shown as 4000005a00000000 or 0x4000005a00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0090 User Response Information only no action is required 4000005b...

Page 239: ...g1 by user arg2 A user enables or disables Secure CIM XML services May also be shown as 4000005c00000000 or 0x4000005c00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0092 User Response Information only no action is required 4000005d 00000000 Secure LDAP arg1 by user arg2 A user enables or disables Secure LDAP serv...

Page 240: ...00 or 0x4000005e00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0094 User Response Information only no action is required 4000005f 00000000 Server timeouts set by user arg1 EnableOSWatchdog arg2 OSWatchdogTimout arg3 EnableLoaderWatchdog arg4 LoaderTimeout arg5 A user configures Server Timeouts May also be shown a...

Page 241: ...ay also be shown as 4000006000000000 or 0x4000006000000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0096 User Response Information only no action is required 40000061 00000000 License key for arg1 removed by user arg2 A user removes a License Key May also be shown as 4000006100000000 or 0x4000006100000000 Severity I...

Page 242: ...rt No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0098 User Response Information only no action is required 40000063 00000000 Global Login Account Security set by user arg1 PasswordRequired arg2 PasswordExpirationPeriod arg3 MinimumPasswordReuseCycle arg4 MinimumPasswordLength arg5 MinimumPasswordChangeInterval arg6 MaxmumLoginFailures arg7 LockoutAfterMaxFailures arg8 MinimumDi...

Page 243: ... May also be shown as 4000006400000000 or 0x4000006400000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0100 User Response Information only no action is required 40000065 00000000 User arg1 removed A user account was deleted May also be shown as 4000006500000000 or 0x4000006500000000 Severity Info Appendix A Integrate...

Page 244: ...e shown as 4000006600000000 or 0x4000006600000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0102 User Response Information only no action is required 40000067 00000000 User arg1 role set to arg2 A user account role assigned May also be shown as 4000006700000000 or 0x4000006700000000 Severity Info Serviceable No 230 I...

Page 245: ...erviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0104 User Response Information only no action is required 40000069 00000000 User arg1 for SNMPv3 set AuthenticationProtocol arg2 PrivacyProtocol arg3 AccessType arg4 HostforTraps arg5 User account SNMPv3 settings changed May also be shown as 4000006900000000 or 0x4000006900000000 Severity I...

Page 246: ...verity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0106 User Response Information only no action is required 4000006b 00000000 SSH Client key imported for user arg1 from arg2 User imported an SSH Client key May also be shown as 4000006b00000000 or 0x4000006b00000000 Severity Info Serviceable No Automatically notify support No A...

Page 247: ...upport No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0108 User Response Information only no action is required 4000006d 00000000 Management Controller arg1 Configuration saved to a file by user arg2 A user saves a Management Controller configuration to a file May also be shown as 4000006d00000000 or 0x4000006d00000000 Severity Info Serviceable No Automatically notify support No...

Page 248: ...lly notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0110 User Response Information only no action is required 4000006f 00000000 Alert Recipient Number arg1 updated Name arg2 DeliveryMethod arg3 Address arg4 IncludeLog arg5 Enabled arg6 EnabledAlerts arg7 AllowedFilters arg8 A user adds or updates an Alert Recipient May also be shown as 4000006f00000000 or 0x4000006...

Page 249: ...ify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0112 User Response Information only no action is required 40000071 00000000 The power cap value changed from arg1 watts to arg2 watts by user arg3 Power Cap values changed by user May also be shown as 4000007100000000 or 0x4000007100000000 Severity Info Serviceable No Automatically notify support No Alert Category none S...

Page 250: ...MP Trap ID CIM Information Prefix IMM ID 0114 User Response Information only no action is required 40000073 00000000 The maximum power cap value changed from arg1 watts to arg2 watts Maximum Power Cap value changed May also be shown as 4000007300000000 or 0x4000007300000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0...

Page 251: ...nformation only no action is required 40000075 00000000 The measured power value exceeded the power cap value Power exceeded cap May also be shown as 4000007500000000 or 0x4000007500000000 Severity Warning Serviceable No Automatically notify support No Alert Category Warning Power SNMP Trap ID 164 CIM Information Prefix IMM ID 0117 User Response Information only no action is required 40000076 0000...

Page 252: ...apping was activated by user arg1 Power capping activated by user May also be shown as 4000007700000000 or 0x4000007700000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0119 User Response Information only no action is required 40000078 00000000 Power capping was deactivated by user arg1 Power capping deactivated by us...

Page 253: ...r May also be shown as 4000007900000000 or 0x4000007900000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0121 User Response Information only no action is required 4000007a 00000000 Static Power Savings mode has been turned off by user arg1 Static Power Savings mode turned off by user May also be shown as 4000007a00000...

Page 254: ...so be shown as 4000007b00000000 or 0x4000007b00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0123 User Response Information only no action is required 4000007c 00000000 Dynamic Power Savings mode has been turned off by user arg1 Dynamic Power Savings mode turned off by user May also be shown as 4000007c00000000 or...

Page 255: ... 4000007d00000000 or 0x4000007d00000000 Severity Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0125 User Response Information only no action is required 4000007e 00000000 External throttling occurred External throttling occurred May also be shown as 4000007e00000000 or 0x4000007e00000000 Severity Info Serviceable No Automatically...

Page 256: ... Info Serviceable No Automatically notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0127 User Response Information only no action is required 40000080 00000000 Remote Control session started by user arg1 in arg2 mode Remote Control session started May also be shown as 4000008000000000 or 0x4000008000000000 Severity Info Serviceable No Automatically notify support No...

Page 257: ...notify support No Alert Category none SNMP Trap ID CIM Information Prefix IMM ID 0129 User Response Information only no action is required 40000082 00000000 The measured power value has returned below the power cap value Power exceeded cap recovered May also be shown as 4000008200000000 or 0x4000008200000000 Severity Info Serviceable No Automatically notify support No Alert Category Warning Power ...

Page 258: ...rt Category Warning Power SNMP Trap ID 164 CIM Information Prefix IMM ID 0131 User Response Information only no action is required 40000084 00000000 IMM firmware mismatch between nodes arg1 and arg2 Please attempt to flash the IMM firmware to the same level on all nodes A mismatch of IMM firmware has been detected between nodes May also be shown as 4000008400000000 or 0x4000008400000000 Severity E...

Page 259: ...00000 Severity Error Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID 22 CIM Information Prefix IMM ID 0133 User Response Attempt to flash the FPGA firmware to the same level on all nodes 40000086 00000000 Test Call Home Generated by user arg1 Test Call Home generated by user May also be shown as 4000008600000000 or 0x4000008600000000 Severity Info Serviceabl...

Page 260: ...ponse IBM Support will address the problem 40000088 00000000 Management Controller arg1 Configuration restoration from a file by user arg2 completed This message is for the use case where a user restores a Management Controller configuration from a file and it completes May also be shown as 4000008800000000 or 0x4000008800000000 Severity Info Serviceable No Automatically notify support No Alert Ca...

Page 261: ...esponse 1 Turn off the server and disconnect it from the power source You must disconnect the server from ac power to reset the IMM 2 After 45 seconds reconnect the server to the power source and turn on the server 3 Retry the operation 4000008a 00000000 Management Controller arg1 Configuration restoration from a file by user arg2 failed to start This message is for the use case where a user resto...

Page 262: ...s changed May also be shown as 4000008b00000000 or 0x4000008b00000000 Severity Info Serviceable No Automatically notify support No Alert Category System IMM Network event SNMP Trap ID 37 CIM Information Prefix IMM ID 0139 User Response Information only no action is required 80010002 0701ffff Numeric sensor NumericSensorElementName going low lower non critical has asserted CMOS Battery This message...

Page 263: ... User Response If the specified sensor is CMOS battery replace the system battery If the specified sensor is Planar 3 3V or Planar 5V trained technician only replace the system board If the specified sensor is Planar 12V complete the following steps until the problem is solved 1 Check power supply n LED 2 Remove the failing power supply 3 Follow actions in Power Problems and Solving Power Problems...

Page 264: ...s to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010701 0702ffff Numeric sensor NumericSensorElementName going high upper non critical has asserted DIMM AB Temp This message is for the use case when an implementation has detected an Upper Non critical sen...

Page 265: ... or 0x800107010703ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0490 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Mak...

Page 266: ...to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010701 1001ffff Numeric sensor NumericSensorElementName going high upper non critical has asserted PCI Riser 1 Temp This message is for the use case when an implementation has detected an Upper Non critical s...

Page 267: ...or 0x800107011002ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0490 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make...

Page 268: ...to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010701 1502ffff Numeric sensor NumericSensorElementName going high upper non critical has asserted GPU Outlet Temp This message is for the use case when an implementation has detected an Upper Non critical se...

Page 269: ...or 0x800107011a01ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0490 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make...

Page 270: ...ons to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010701 2d01ffff Numeric sensor NumericSensorElementName going high upper non critical has asserted PCH Temp This message is for the use case when an implementation has detected an Upper Non critical senso...

Page 271: ... or 0x800109010701ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0494 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make ...

Page 272: ...ons to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010901 0703ffff Numeric sensor NumericSensorElementName going high upper critical has asserted CPU1 VR Temp VCO This message is for the use case when an implementation has detected an Upper Critical senso...

Page 273: ...or 0x800109010704ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0494 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make s...

Page 274: ...ons to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010901 1002ffff Numeric sensor NumericSensorElementName going high upper critical has asserted PCI Riser 2 Temp This message is for the use case when an implementation has detected an Upper Critical senso...

Page 275: ... or 0x800109011501ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0494 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make ...

Page 276: ...ons to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010901 1a01ffff Numeric sensor NumericSensorElementName going high upper critical has asserted HDD Outlet Temp This message is for the use case when an implementation has detected an Upper Critical sensor...

Page 277: ...fff or 0x800109012c01ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0494 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Ma...

Page 278: ... obstructions to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80010902 0701ffff Numeric sensor NumericSensorElementName going high upper critical has asserted This message is for the use case when an implementation has detected an Upper Critical sensor going...

Page 279: ...gory Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0498 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make sure the fans are operating and there are no obstructions to the airflow both front and rear of ...

Page 280: ...e The system must be operating within the specifications see Features and specifications for more information 80010b01 0703ffff Numeric sensor NumericSensorElementName going high upper non recoverable has asserted CPU1 VR Temp VCO This message is for the use case when an implementation has detected an Upper Non recoverable sensor going high has asserted May also be shown as 80010b010703ffff or 0x8...

Page 281: ...port No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0498 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make sure the fans are operating and there are no obstructions to the airflow both ...

Page 282: ...e The system must be operating within the specifications see Features and specifications for more information 80010b01 1002ffff Numeric sensor NumericSensorElementName going high upper non recoverable has asserted PCI Riser 2 Temp This message is for the use case when an implementation has detected an Upper Non recoverable sensor going high has asserted May also be shown as 80010b011002ffff or 0x8...

Page 283: ...upport No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0498 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make sure the fans are operating and there are no obstructions to the airflow bot...

Page 284: ...re The system must be operating within the specifications see Features and specifications for more information 80010b01 1a01ffff Numeric sensor NumericSensorElementName going high upper non recoverable has asserted HDD Outlet Temp This message is for the use case when an implementation has detected an Upper Non recoverable sensor going high has asserted May also be shown as 80010b011a01ffff or 0x8...

Page 285: ... support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0498 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make sure the fans are operating and there are no obstructions to the airflow b...

Page 286: ... 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 80030006 2101ffff Sensor SensorElementName has deasserted Sig Verify Fail This message is for the use case when an implementation has detected a Sensor has deasserted May also be shown as 800300062101ffff or 0x800300062101ffff Severity Info Serviceable No Au...

Page 287: ...ssage is for the use case when an implementation has detected a Sensor has deasserted May also be shown as 800300122301ffff or 0x800300122301ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0509 User Response No action information only 8007010f 2201ffff Sensor SensorElementName has transitioned from normal to ...

Page 288: ...rupt disk 8007010f 2582ffff Sensor SensorElementName has transitioned from normal to non critical state I O Resources This message is for the use case when an implementation has detected a Sensor transitioned to non critical from normal May also be shown as 8007010f2582ffff or 0x8007010f2582ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category Warning Other SNMP Trap...

Page 289: ... Information Prefix PLAT ID 0520 User Response 1 Complete the administrative tasks that require the TPM physical presence switch to be in the ON position 2 Restore the physical presence switch to the OFF position 3 Reboot the system 4 Trained technician only If the error continues replace the planar 80070128 2e01ffff Sensor SensorElementName has transitioned from normal to non critical state ME Re...

Page 290: ...ity Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0522 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make sure the fans are operating ...

Page 291: ...alled and make sure the node cover is installed and completely closed 3 Make sure the fans are operating and there are no obstructions to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 5 Make sure the PCI adapter is supported by the server To confirm see the I...

Page 292: ...ications see Features and specifications for more information 5 Make sure the PCI adapter is supported by the server To confirm see the IBM ServerProven website 6 Replace the PCI adapter and make sure the PCI adapter is functioning normally 80070201 1102ffff Sensor SensorElementName has transitioned to critical from a less severe state PCI 2 Temp This message is for the use case when an implementa...

Page 293: ...ation has detected a Sensor transitioned to critical from less severe May also be shown as 800702011103ffff or 0x800702011103ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0522 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles...

Page 294: ... closed 3 Make sure the fans are operating and there are no obstructions to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 5 Make sure the PCI adapter is supported by the server To confirm see the IBM ServerProven website 6 Replace the PCI adapter and make sur...

Page 295: ...1ffff Sensor SensorElementName has transitioned to critical from a less severe state PIB Fault This message is for the use case when an implementation has detected a Sensor transitioned to critical from less severe May also be shown as 800702021501ffff or 0x800702021501ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Voltage SNMP Trap ID 1 CIM Information...

Page 296: ...t of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 5 Trained technician only Replace the system board 8007020f 2201ffff Sensor SensorElementName has transitioned to critical from a less severe state TXT ACM Module This message is for the use case when an implementation has detected a Sensor transitioned to critical from les...

Page 297: ...0x8007020f2582ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0522 User Response Complete the following step to solve PCI I O resource errors 1 Go to F1 Setup 2 System Settings 3 Device and I O ports 4 PCI 64 bit Resource and choose enable 80070214 2201ffff Sensor SensorElementName has transitioned to ...

Page 298: ...00702190701ffff or 0x800702190701ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0522 User Response 1 Check for an error LED on the system board 2 Check the system event log 3 Check for the system firmware version and update to the latest version Important Some cluster solutions require specific code l...

Page 299: ...alled microprocessors are compatible 3 Make sure the microprocessor 2 expansion board is installed correctly see Installing the microprocessor 2 expansion board 4 Trained technician only Replace microprocessor 2 5 Trained technician only Replace microprocessor 2 expansion board 8007021b 0302ffff Sensor SensorElementName has transitioned to critical from a less severe state CPU 2 QPILinkErr This me...

Page 300: ...n implementation has detected a Sensor transitioned to critical from less severe May also be shown as 800702282e01ffff or 0x800702282e01ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0522 User Response If the specified sensor is IPMB IO Error Me Error or ME Flash Error complete the following steps unt...

Page 301: ...tructions to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 5 Make sure the PCI adapter is supported by the server To confirm see the IBM ServerProven website 6 Replace the PCI adapter and make sure the PCI adapter is functioning normally 80070301 0302ffff Sen...

Page 302: ...e IBM ServerProven website 6 Replace the PCI adapter and make sure the PCI adapter is functioning normally 80070301 1101ffff Sensor SensorElementName has transitioned to non recoverable from a less severe state PCI 1 Temp This message is for the use case when an implementation has detected a Sensor transitioned to non recoverable from less severe May also be shown as 800703011101ffff or 0x80070301...

Page 303: ...le Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0524 User Response 1 Make sure there is a node filler correctly installed for the empty node slot 2 Make sure the air baffles are placed and correctly installed and make sure the node cover is installed and completely closed 3 Make sure the fans are operating and there are no ob...

Page 304: ...no obstructions to the airflow both front and rear of the server 4 Reduce the Ambient temperature The system must be operating within the specifications see Features and specifications for more information 5 Make sure the PCI adapter is supported by the server To confirm see the IBM ServerProven website 6 Replace the PCI adapter and make sure the PCI adapter is functioning normally 80070301 1104ff...

Page 305: ... To confirm see the IBM ServerProven website 6 Replace the PCI adapter and make sure the PCI adapter is functioning normally 80070614 2201ffff Sensor SensorElementName has transitioned to non recoverable TPM Phy Pres Set This message is for the use case when an implementation has detected a Sensor transitioned to non recoverable May also be shown as 800706142201ffff or 0x800706142201ffff Severity ...

Page 306: ... No action information only 80080128 2101ffff Device LogicalDeviceElementName has been added Low Security Jmp This message is for the use case when an implementation has detected a Device was inserted May also be shown as 800801282101ffff or 0x800801282101ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0536 U...

Page 307: ...g in the Setup utility 800b030c 2581ffff Non redundant Sufficient Resources from Redundancy Degraded or Fully Redundant for RedundancySetElementName has asserted Backup Memory This message is for the use case when a Redundancy Set has transitioned from Redundancy Degraded or Fully Redundant to Non redundant Sufficient May also be shown as 800b030c2581ffff or 0x800b030c2581ffff Severity Warning Ser...

Page 308: ...ceable Yes Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0810 User Response 1 Check the system event log for DIMM failure events uncorrectable or PFA and correct the failures 2 Re enable mirroring in the Setup utility 806f0007 0301ffff ProcessorElementName has Failed with IERR CPU 1 This message is for the use case when an implementat...

Page 309: ...or number 806f0007 0302ffff ProcessorElementName has Failed with IERR CPU 2 This message is for the use case when an implementation has detected a Processor Failed IERR Condition May also be shown as 806f00070302ffff or 0x806f00070302ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0042 User Response 1 Ma...

Page 310: ...091301ffff Severity Info Serviceable No Automatically notify support No Alert Category System Power Off SNMP Trap ID 23 CIM Information Prefix PLAT ID 0106 User Response No action information only 806f000f 220101ff The System ComputerSystemElementName has detected no memory in the system ABR Status This message is for the use case when an implementation has detected that memory was detected in the...

Page 311: ...eable Yes Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0132 User Response This is a UEFI detected event The UEFI POST error code for this event can be found in the logged IMM message text Please refer to the UEFI POST error code in the UEFI POST error code section of the Information Center for the appropriate user response Firmware E...

Page 312: ... as 806f000f220104ff or 0x806f000f220104ff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0795 User Response This is a UEFI detected event The UEFI diagnostic code for this event can be found in the logged IMM message text Please refer to the UEFI POST error code in the UEFI POST error code section of the ...

Page 313: ...etected that System Firmware Error No video device detected has occurred May also be shown as 806f000f22010aff or 0x806f000f22010aff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0766 User Response This is a UEFI detected event The UEFI POST error for this event can be found in the logged IMM message text...

Page 314: ...e updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4 Remove components one at a time restarting the server each time to see if the problem goes away 5 If the problem remains trained service technician replace the system board Firmware Error Sys Boot Status 806f000f 22010cff CPU voltage mismatch...

Page 315: ...ically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0184 User Response This is a UEFI detected event The UEFI POST error code for this event can be found in the logged IMM message text Please refer to the UEFI POST error code in the UEFI POST error code section of the Information Center for the appropriate user response Firmware Error Sys Boot Stat...

Page 316: ...06f00212201ffff or 0x806f00212201ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0330 User Response 1 Check the PCI LED 2 Reseat the affected adapters and riser card 3 Update the server firmware UEFI and IMM and adapter firmware Important Some cluster solutions require specific code levels or coordina...

Page 317: ...re Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4 Remove both adapters 5 Replace the riser card 6 Trained service technicians only Replace the system board 806f0021 2c01ffff Fault in slot PhysicalConnectorSyst...

Page 318: ...onnectorSystemElementName on system ComputerSystemElementName PCI 1 This message is for the use case when an implementation has detected a Fault in a slot May also be shown as 806f00213001ffff or 0x806f00213001ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0330 User Response 1 Check the PCI LED 2 Res...

Page 319: ...adapters and riser card 3 Update the server firmware UEFI and IMM and adapter firmware Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4 Remove both adapters 5 Replace the riser card 6 Trained service technicians...

Page 320: ...ed for the cluster solution before you update the code 4 Remove both adapters 5 Replace the riser card 6 Trained service technicians only Replace the system board 806f0021 3004ffff Fault in slot PhysicalConnectorSystemElementName on system ComputerSystemElementName PCI 4 This message is for the use case when an implementation has detected a Fault in a slot May also be shown as 806f00213004ffff or ...

Page 321: ... for the use case when an implementation has detected a Watchdog Timer Expired May also be shown as 806f00232101ffff or 0x806f00232101ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0368 User Response No action information only 806f0028 2101ffff Sensor SensorElementName is unavailable or degraded on managemen...

Page 322: ...s Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0036 User Response 1 Make sure that the fans are operating There are no obstructions to the airflow front and rear of the server the air baffles are in place and correctly installed and the server cover is installed and completely closed 2 Make sure that the heat sink for microproces...

Page 323: ...ed 2 Make sure that the heat sink for microprocessor n is installed correctly 3 Trained technician only Replace microprocessor n n microprocessor number 806f0109 1301ffff PowerSupplyElementName has been Power Cycled Host Power This message is for the use case when an implementation has detected a Power Unit that has been power cycled May also be shown as 806f01091301ffff or 0x806f01091301ffff Seve...

Page 324: ...the same DIMM connector check the DIMM connector If the connector contains any foreign material or is damaged replace the system board 5 Trained technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 7 Manually re enable all affected ...

Page 325: ...and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 7 Manually re enable all affected DIMMs if the server firmware version is older than UEFI v1 10 If the server firmware version is UEFI v1 10 or newer disconnect and reconnect the server to the power source and restart the server 8...

Page 326: ... only Replace the affected microprocessor 7 Manually re enable all affected DIMMs if the server firmware version is older than UEFI v1 10 If the server firmware version is UEFI v1 10 or newer disconnect and reconnect the server to the power source and restart the server 8 Trained Service technician only Replace the affected microprocessor 806f010c 2004ffff Uncorrectable error detected for Physical...

Page 327: ...table error detected for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 5 This message is for the use case when an implementation has detected a Memory uncorrectable error May also be shown as 806f010c2005ffff or 0x806f010c2005ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0138 User R...

Page 328: ...ble retain tip or firmware update that applies to this memory error 2 Swap the affected DIMMs as indicated by the error LEDs on the system board or the event logs to a different memory channel or microprocessor 3 If the problem follows the DIMM replace the failing DIMM 4 Trained technician only If the problem occurs on the same DIMM connector check the DIMM connector If the connector contains any ...

Page 329: ... DIMM connector check the DIMM connector If the connector contains any foreign material or is damaged replace the system board 5 Trained technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 7 Manually re enable all affected DIMMs if...

Page 330: ...nd check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 7 Manually re enable all affected DIMMs if the server firmware version is older than UEFI v1 10 If the server firmware version is UEFI v1 10 or newer disconnect and reconnect the server to the power source and restart the server 8 ...

Page 331: ... enable all affected DIMMs if the server firmware version is older than UEFI v1 10 If the server firmware version is UEFI v1 10 or newer disconnect and reconnect the server to the power source and restart the server 8 Trained Service technician only Replace the affected microprocessor 806f010d 0401ffff The Drive StorageVolumeElementName has been disabled due to a detected fault Computer HDD0 This ...

Page 332: ... ID 0164 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Reseat the following components a Hard disk drive wait 1 minute or more before reinstalling the drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown restarting the server each time a Hard disk drive b Cable from the system board to the backplane c Hard d...

Page 333: ...ter HDD3 This message is for the use case when an implementation has detected a Drive was Disabled due to fault May also be shown as 806f010d0404ffff or 0x806f010d0404ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0164 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Rese...

Page 334: ...inute or more before reinstalling the drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown restarting the server each time a Hard disk drive b Cable from the system board to the backplane c Hard disk drive backplane n hard disk drive number 806f010d 0406ffff The Drive StorageVolumeElementName has been disabled due to a detected fa...

Page 335: ...407ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0164 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Reseat the following components a Hard disk drive wait 1 minute or more before reinstalling the drive b Cable from the system board to the backplane 3 Replace the follo...

Page 336: ...e a Hard disk drive b Cable from the system board to the backplane c Hard disk drive backplane n hard disk drive number 806f010d 0409ffff The Drive StorageVolumeElementName has been disabled due to a detected fault 1U Storage HDD4 This message is for the use case when an implementation has detected a Drive was Disabled due to fault May also be shown as 806f010d0409ffff or 0x806f010d0409ffff Severi...

Page 337: ... Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0164 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Reseat the following components a Hard disk drive wait 1 minute or more before reinstalling the drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown restarting the ser...

Page 338: ...detected fault 1U Storage HDD7 This message is for the use case when an implementation has detected a Drive was Disabled due to fault May also be shown as 806f010d040cffff or 0x806f010d040cffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0164 User Response 1 Run the hard disk drive diagnostic ...

Page 339: ...rmware on the primary page Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 3 Trained technician only Replace the system board 806f0113 0301ffff A bus timeout has occurred on system ComputerSystemElementName CPU 1...

Page 340: ... Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0224 User Response 1 Reseat the microprocessor and then restart the server 2 Replace microprocessor n n microprocessor number 806f0123 2101ffff Reboot of system ComputerSystemElementName initiated by WatchdogElementName IPMI Watchdog This message is for the use case when an implementation ...

Page 341: ...y Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0392 User Response No action information only 806f0125 1002ffff ManagedElementName detected as absent PCI Riser 2 This message is for the use case when an implementation has detected a Managed Element is Absent May also be shown as 806f01251002ffff or 0x806f01251002ffff Sev...

Page 342: ...onse If there is no GPU storage tray installed in the system then the log event is a normal condition If there is a GPU storage tray installed in the system then check the following two portions 1 PDB Power Distribution Board cable is correctly connected from riser card to PDB 2 Replace another PDB cable 806f0125 2c01ffff ManagedElementName detected as absent Mezz Card This message is for the use ...

Page 343: ...nd standard devices such as Ethernet SCSI and SAS Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 2 Update the firmware UEFI and IMM to the latest level Updating the firmware 3 Run the DSA program 4 Reseat the ad...

Page 344: ...upported for the cluster solution before you update the code 2 Update the firmware UEFI and IMM to the latest level Updating the firmware 3 Run the DSA program 4 Reseat the adapter 5 Replace the adapter 6 Trained technician only Replace microprocessor n 7 Trained technician only Replace the system board n microprocessor number 806f0207 2584ffff ProcessorElementName has Failed with FRB1 BIST condit...

Page 345: ...or number 806f020d 0401ffff Failure Predicted on drive StorageVolumeElementName for array ComputerSystemElementName Computer HDD0 This message is for the use case when an implementation has detected an Array Failure is Predicted May also be shown as 806f020d0401ffff or 0x806f020d0401ffff Severity Warning Serviceable Yes Automatically notify support Yes Alert Category System Predicted Failure SNMP ...

Page 346: ...a Hard disk drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown restarting the server each time a Hard disk drive b Cable from the system board to the backplane c Hard disk drive backplane n hard disk drive number 806f020d 0403ffff Failure Predicted on drive StorageVolumeElementName for array ComputerSystemElementName Computer HD...

Page 347: ...4ffff Severity Warning Serviceable Yes Automatically notify support Yes Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0168 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Reseat the following components a Hard disk drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown re...

Page 348: ...om the system board to the backplane c Hard disk drive backplane n hard disk drive number 806f020d 0406ffff Failure Predicted on drive StorageVolumeElementName for array ComputerSystemElementName 1U Storage HDD1 This message is for the use case when an implementation has detected an Array Failure is Predicted May also be shown as 806f020d0406ffff or 0x806f020d0406ffff Severity Warning Serviceable ...

Page 349: ... Yes Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0168 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Reseat the following components a Hard disk drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown restarting the server each time a Hard disk drive b Cable from the sy...

Page 350: ...terSystemElementName 1U Storage HDD4 This message is for the use case when an implementation has detected an Array Failure is Predicted May also be shown as 806f020d0409ffff or 0x806f020d0409ffff Severity Warning Serviceable Yes Automatically notify support Yes Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0168 User Response 1 Run the hard disk drive diagno...

Page 351: ...d disk drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order shown restarting the server each time a Hard disk drive b Cable from the system board to the backplane c Hard disk drive backplane n hard disk drive number 806f020d 040bffff Failure Predicted on drive StorageVolumeElementName for array ComputerSystemElementName 1U Storage HDD6 ...

Page 352: ...f020d040cffff Severity Warning Serviceable Yes Automatically notify support Yes Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0168 User Response 1 Run the hard disk drive diagnostic test on drive n 2 Reseat the following components a Hard disk drive b Cable from the system board to the backplane 3 Replace the following components one at a time in the order ...

Page 353: ...ically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0136 User Response Note Each time you install or remove a DIMM you must disconnect the server from the power source then wait 10 seconds before restarting the server 1 Check the IBM support website for an applicable retain tip or firmware update that applies to this memory error 2 Make sure that ...

Page 354: ... or 0x806f030c2002ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0136 User Response Note Each time you install or remove a DIMM you must disconnect the server from the power source then wait 10 seconds before restarting the server 1 Check the IBM support website for an applicable retain tip or firmwa...

Page 355: ...he IBM support website for an applicable retain tip or firmware update that applies to this memory error 2 Make sure that the DIMMs are firmly seated and no foreign material is found in the DIMM connector Then retry with the same DIMM 3 If the problem is related to a DIMM replace the failing DIMM indicated by the error LEDs 4 If the problem occurs on the same DIMM connector swap the affected DIMMs...

Page 356: ... DIMM replace the failing DIMM indicated by the error LEDs 4 If the problem occurs on the same DIMM connector swap the affected DIMMs as indicated by the error LEDs on the system board or the event logs to a different memory channel or microprocessor 5 Trained technician only If the problem occurs on the same DIMM connector check the DIMM connector If the connector contains any foreign material or...

Page 357: ...e system board or the event logs to a different memory channel or microprocessor 5 Trained technician only If the problem occurs on the same DIMM connector check the DIMM connector If the connector contains any foreign material or is damaged replace the system board 6 Trained service technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If...

Page 358: ...ssor 5 Trained technician only If the problem occurs on the same DIMM connector check the DIMM connector If the connector contains any foreign material or is damaged replace the system board 6 Trained service technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 7 Trained service technician onl...

Page 359: ...or socket pins for any damaged pins If a damage is found replace the system board 7 Trained service technician only If the problem is related to microprocessor socket pins replace the system board 806f030c 2008ffff Scrub Failure for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 8 This message is for the use case when an implementation has detected a Memory Scrub failure May also be...

Page 360: ...f0313 1701ffff A software NMI has occurred on system ComputerSystemElementName NMI State This message is for the use case when an implementation has detected a Software NMI May also be shown as 806f03131701ffff or 0x806f03131701ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0228 User Response 1 Check ...

Page 361: ...upport No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0131 User Response 1 Make sure the DIMM is installed correctly 2 If the DIMM was disabled because of a memory fault memory uncorrectable error or memory logging limit reached follow the suggested actions for that error event and restart the server 3 Check the IBM support website for an applicable retain tip or firmwa...

Page 362: ...ow the suggested actions for that error event and restart the server 3 Check the IBM support website for an applicable retain tip or firmware update that applies to this memory event If no memory fault is recorded in the logs and no DIMM connector error LED is lit you can re enable the DIMM through the Setup utility or the Advanced Settings Utility ASU 806f040c 2003ffff PhysicalMemoryElementName D...

Page 363: ...806f040c2004ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0131 User Response 1 Make sure the DIMM is installed correctly 2 If the DIMM was disabled because of a memory fault memory uncorrectable error or memory logging limit reached follow the suggested actions for that error event and restart the server 3 ...

Page 364: ...mory event If no memory fault is recorded in the logs and no DIMM connector error LED is lit you can re enable the DIMM through the Setup utility or the Advanced Settings Utility ASU 806f040c 2006ffff PhysicalMemoryElementName Disabled on Subsystem MemoryElementName DIMM 6 This message is for the use case when an implementation has detected that Memory has been Disabled May also be shown as 806f04...

Page 365: ...tify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0131 User Response 1 Make sure the DIMM is installed correctly 2 If the DIMM was disabled because of a memory fault memory uncorrectable error or memory logging limit reached follow the suggested actions for that error event and restart the server 3 Check the IBM support website for an applicable retain tip or ...

Page 366: ...oryElementName All DIMMS This message is for the use case when an implementation has detected that Memory has been Disabled May also be shown as 806f040c2581ffff or 0x806f040c2581ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0131 User Response 1 Make sure the DIMM is installed correctly 2 If the DIMM was di...

Page 367: ...are Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4 Remove both adapters 5 Replace the PCIe adapters 6 Replace the riser card 806f0507 0301ffff ProcessorElementName has a Configuration Mismatch CPU 1 This messa...

Page 368: ...r the use case when an implementation has detected a Processor Configuration Mismatch has occurred May also be shown as 806f05070302ffff or 0x806f05070302ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0062 User Response 1 Check the CPU LED See more information about the CPU LED in Light path diagnostics...

Page 369: ...nated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 3 Make sure that the installed microprocessors are compatible with each other 4 Trained technician only Reseat microprocessor n 5 Trained technician only Replace microprocessor n n microprocessor number 806f050c 2001ffff Memory Logging...

Page 370: ... for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 806f050c 2002ffff Memory Logging Limit Reached for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 2 This message is for the use case when an implementation has detected that the Memory Logging Limit has been Reached May also be shown as 806f050c2002ffff o...

Page 371: ...support Yes Alert Category Warning Memory SNMP Trap ID 43 CIM Information Prefix PLAT ID 0144 User Response 1 Check the IBM support website for an applicable retain tip or firmware update that applies to this memory error 2 Swap the affected DIMMs as indicated by the error LEDs on the system board or the event logs to a different memory channel or microprocessor 3 If the error still occurs on the ...

Page 372: ...the affected DIMM 4 Trained technician only If the problem occurs on the same DIMM connector check the DIMM connector If the connector contains any foreign material or is damaged replace the system board 5 Trained technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only R...

Page 373: ...Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 806f050c 2006ffff Memory Logging Limit Reached for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 6 This message is for the use case when an implementation has detected that the Me...

Page 374: ...it has been Reached May also be shown as 806f050c2007ffff or 0x806f050c2007ffff Severity Warning Serviceable Yes Automatically notify support Yes Alert Category Warning Memory SNMP Trap ID 43 CIM Information Prefix PLAT ID 0144 User Response 1 Check the IBM support website for an applicable retain tip or firmware update that applies to this memory error 2 Swap the affected DIMMs as indicated by th...

Page 375: ... microprocessor 3 If the error still occurs on the same DIMM replace the affected DIMM 4 Trained technician only If the problem occurs on the same DIMM connector check the DIMM connector If the connector contains any foreign material or is damaged replace the system board 5 Trained technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a...

Page 376: ...rained technician only Remove the affected microprocessor and check the microprocessor socket pins for any damaged pins If a damage is found replace the system board 6 Trained technician only Replace the affected microprocessor 806f050d 0401ffff Array ComputerSystemElementName is in critical condition Computer HDD0 This message is for the use case when an implementation has detected that an Array ...

Page 377: ... Information Prefix PLAT ID 0174 User Response 1 Make sure that the RAID adapter firmware and hard disk drive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Check backplane cable connection 5 Replace the RAID adapter 6 Replace the hard disk drive that is indicated by a lit status LED 806f050d 0403ffff Array ComputerSystemElementName...

Page 378: ...ter HDD3 This message is for the use case when an implementation has detected that an Array is Critical May also be shown as 806f050d0404ffff or 0x806f050d0404ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0174 User Response 1 Make sure that the RAID adapter firmware and hard disk drive firmw...

Page 379: ...rive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Check backplane cable connection 5 Replace the RAID adapter 6 Replace the hard disk drive that is indicated by a lit status LED 806f050d 0406ffff Array ComputerSystemElementName is in critical condition 1U Storage HDD1 This message is for the use case when an implementation has det...

Page 380: ...ected that an Array is Critical May also be shown as 806f050d0407ffff or 0x806f050d0407ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0174 User Response 1 Make sure that the RAID adapter firmware and hard disk drive firmware are at the latest level 2 Make sure that the SAS cable is connected ...

Page 381: ...the SAS cable is connected correctly 3 Replace the SAS cable 4 Check backplane cable connection 5 Replace the RAID adapter 6 Replace the hard disk drive that is indicated by a lit status LED 806f050d 0409ffff Array ComputerSystemElementName is in critical condition 1U Storage HDD4 This message is for the use case when an implementation has detected that an Array is Critical May also be shown as 80...

Page 382: ...ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0174 User Response 1 Make sure that the RAID adapter firmware and hard disk drive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Check backplane cable connection 5 Replace the RAID...

Page 383: ...n 5 Replace the RAID adapter 6 Replace the hard disk drive that is indicated by a lit status LED 806f050d 040cffff Array ComputerSystemElementName is in critical condition 1U Storage HDD7 This message is for the use case when an implementation has detected that an Array is Critical May also be shown as 806f050d040cffff or 0x806f050d040cffff Severity Error Serviceable Yes Automatically notify suppo...

Page 384: ...eck the PCI LED 2 Reseat the affected adapters and riser card 3 Update the server firmware UEFI and IMM and adapter firmware Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4 Make sure that the adapter is support...

Page 385: ...ce is part of a cluster solution verify that the latest level of code is supported for the cluster solution before you update the code 4 Remove components one at a time restarting the server each time to see if the problem goes away 5 If the problem remains trained service technician replace the system board 806f0607 0301ffff An SM BIOS Uncorrectable CPU complex error for ProcessorElementName has ...

Page 386: ...rity Error Serviceable Yes Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0816 User Response 1 Make sure that the installed microprocessors are compatible with each other see Installing a microprocessor and heat sink for information about microprocessor requirements 2 Update the server firmware to the latest level see Updating the firmwar...

Page 387: ...mentName has failed Computer HDD0 This message is for the use case when an implementation has detected that an Array Failed May also be shown as 806f060d0401ffff or 0x806f060d0401ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0176 User Response 1 Make sure that the RAID adapter firmware and ...

Page 388: ...k drive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Replace the RAID adapter 5 Replace the hard disk drive that is indicated by a lit status LED 806f060d 0403ffff Array ComputerSystemElementName has failed Computer HDD2 This message is for the use case when an implementation has detected that an Array Failed May also be shown as ...

Page 389: ...le Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0176 User Response 1 Make sure that the RAID adapter firmware and hard disk drive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Replace the RAID adapter 5 Replace the hard disk drive that is indicated by a l...

Page 390: ...ailed 1U Storage HDD1 This message is for the use case when an implementation has detected that an Array Failed May also be shown as 806f060d0406ffff or 0x806f060d0406ffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0176 User Response 1 Make sure that the RAID adapter firmware and hard disk dr...

Page 391: ...rd disk drive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Replace the RAID adapter 5 Replace the hard disk drive that is indicated by a lit status LED 806f060d 0408ffff Array ComputerSystemElementName has failed 1U Storage HDD3 This message is for the use case when an implementation has detected that an Array Failed May also be s...

Page 392: ...erity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0176 User Response 1 Make sure that the RAID adapter firmware and hard disk drive firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Replace the RAID adapter 5 Replace the hard disk drive tha...

Page 393: ... failed 1U Storage HDD6 This message is for the use case when an implementation has detected that an Array Failed May also be shown as 806f060d040bffff or 0x806f060d040bffff Severity Error Serviceable Yes Automatically notify support Yes Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0176 User Response 1 Make sure that the RAID adapter firmware and hard disk ...

Page 394: ...firmware are at the latest level 2 Make sure that the SAS cable is connected correctly 3 Replace the SAS cable 4 Replace the RAID adapter 5 Replace the hard disk drive that is indicated by a lit status LED 806f070c 2001ffff Configuration Error for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 1 This message is for the use case when an implementation has detected a Memory DIMM confi...

Page 395: ...omatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0126 User Response Make sure that DIMMs are installed in the correct sequence and have the same size type speed and technology 806f070c 2003ffff Configuration Error for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 3 This message is for the use case when an implementation has ...

Page 396: ...ty Error Serviceable Yes Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0126 User Response Make sure that DIMMs are installed in the correct sequence and have the same size type speed and technology 806f070c 2005ffff Configuration Error for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 5 This message is for the use case...

Page 397: ...r 0x806f070c2006ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0126 User Response Make sure that DIMMs are installed in the correct sequence and have the same size type speed and technology 806f070c 2007ffff Configuration Error for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 7 This ...

Page 398: ...shown as 806f070c2008ffff or 0x806f070c2008ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0126 User Response Make sure that DIMMs are installed in the correct sequence and have the same size type speed and technology 806f070c 2581ffff Configuration Error for PhysicalMemoryElementName on Subsystem Mem...

Page 399: ...on has detected that an Array Rebuild is in Progress May also be shown as 806f070d0401ffff or 0x806f070d0401ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0178 User Response No action information only 806f070d 0402ffff Rebuild in progress for Array in system ComputerSystemElementName Computer HDD1 This messa...

Page 400: ...in Progress May also be shown as 806f070d0403ffff or 0x806f070d0403ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0178 User Response No action information only 806f070d 0404ffff Rebuild in progress for Array in system ComputerSystemElementName Computer HDD3 This message is for the use case when an implementa...

Page 401: ...rogress May also be shown as 806f070d0405ffff or 0x806f070d0405ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0178 User Response No action information only 806f070d 0406ffff Rebuild in progress for Array in system ComputerSystemElementName 1U Storage HDD1 This message is for the use case when an implementati...

Page 402: ...in Progress May also be shown as 806f070d0407ffff or 0x806f070d0407ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0178 User Response No action information only 806f070d 0408ffff Rebuild in progress for Array in system ComputerSystemElementName 1U Storage HDD3 This message is for the use case when an implemen...

Page 403: ...rogress May also be shown as 806f070d0409ffff or 0x806f070d0409ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0178 User Response No action information only 806f070d 040affff Rebuild in progress for Array in system ComputerSystemElementName 1U Storage HDD5 This message is for the use case when an implementati...

Page 404: ...in Progress May also be shown as 806f070d040bffff or 0x806f070d040bffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0178 User Response No action information only 806f070d 040cffff Rebuild in progress for Array in system ComputerSystemElementName 1U Storage HDD7 This message is for the use case when an implemen...

Page 405: ...mware Change May also be shown as 806f072b2101ffff or 0x806f072b2101ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0450 User Response No action information only 806f072b 2201ffff A successful software or firmware change was detected on system ComputerSystemElementName Bkup Auto Update This message is for the...

Page 406: ...en Disabled May also be shown as 806f08070301ffff or 0x806f08070301ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0061 User Response No action information only 806f0807 0302ffff ProcessorElementName has been Disabled CPU 2 This message is for the use case when an implementation has detected a Processor has b...

Page 407: ... or 0x806f08072584ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0061 User Response No action information only One of the CPUs 806f0813 2581ffff A Uncorrectable Bus Error has occurred on system ComputerSystemElementName DIMMs This message is for the use case when an implementation has detected a Bus Uncorrec...

Page 408: ...e code 5 Make sure that the installed DIMMs are supported and configured correctly 6 Trained technician only Replace the system board 806f0813 2582ffff A Uncorrectable Bus Error has occurred on system ComputerSystemElementName PCIs This message is for the use case when an implementation has detected a Bus Uncorrectable Error May also be shown as 806f08132582ffff or 0x806f08132582ffff Severity Erro...

Page 409: ... Prefix PLAT ID 0240 User Response 1 Check the system event log 2 Trained technician only Remove the failing microprocessor from the system board see Removing a microprocessor and heat sink 3 Check for a server firmware update Important Some cluster solutions require specific code levels or coordinated code updates If the device is part of a cluster solution verify that the latest level of code is...

Page 410: ...6f090c2001ffff or 0x806f090c2001ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category System Other SNMP Trap ID 22 CIM Information Prefix PLAT ID 0142 User Response 1 Reseat the DIMM and then restart the server 2 Replace DIMM n n DIMM number 806f090c 2002ffff PhysicalMemoryElementName on Subsystem MemoryElementName Throttled DIMM 2 This message is for the use case wh...

Page 411: ...ge is for the use case when an implementation has detected Memory has been Throttled May also be shown as 806f090c2003ffff or 0x806f090c2003ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category System Other SNMP Trap ID 22 CIM Information Prefix PLAT ID 0142 User Response 1 Reseat the DIMM and then restart the server 2 Replace DIMM n n DIMM number 806f090c 2004ffff P...

Page 412: ...lace DIMM n n DIMM number 806f090c 2005ffff PhysicalMemoryElementName on Subsystem MemoryElementName Throttled DIMM 5 This message is for the use case when an implementation has detected Memory has been Throttled May also be shown as 806f090c2005ffff or 0x806f090c2005ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category System Other SNMP Trap ID 22 CIM Information Pr...

Page 413: ...ix PLAT ID 0142 User Response 1 Reseat the DIMM and then restart the server 2 Replace DIMM n n DIMM number 806f090c 2007ffff PhysicalMemoryElementName on Subsystem MemoryElementName Throttled DIMM 7 This message is for the use case when an implementation has detected Memory has been Throttled May also be shown as 806f090c2007ffff or 0x806f090c2007ffff Severity Warning Serviceable Yes Automatically...

Page 414: ...No Alert Category System Other SNMP Trap ID 22 CIM Information Prefix PLAT ID 0142 User Response 1 Reseat the DIMM and then restart the server 2 Replace DIMM n n DIMM number 806f0a07 0301ffff ProcessorElementName is operating in a Degraded State CPU 1 This message is for the use case when an implementation has detected a Processor is running in the Degraded state May also be shown as 806f0a070301f...

Page 415: ...2ffff Severity Warning Serviceable Yes Automatically notify support No Alert Category Warning CPU SNMP Trap ID 42 CIM Information Prefix PLAT ID 0038 User Response 1 Make sure that the fans are operating that there are no obstructions to the airflow front and rear of the server that the air baffles are in place and correctly installed and that the server cover is installed and completely closed 2 ...

Page 416: ...ture is within the specifications 3 If a fan has failed complete the action for a fan failure 4 Replace DIMM n n DIMM number 806f0a0c 2002ffff An Over Temperature Condition has been detected on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 2 This message is for the use case when an implementation has detected an Over Temperature Condition for Memory that has been Detected May a...

Page 417: ...y Error Serviceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0146 User Response 1 Make sure that the fans are operating that there are no obstructions to the airflow that the air baffles are in place and correctly installed and that the server cover is installed and completely closed 2 Make sure that ambient temperature i...

Page 418: ...f a fan has failed complete the action for a fan failure 4 Replace DIMM n n DIMM number 806f0a0c 2005ffff An Over Temperature Condition has been detected on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 5 This message is for the use case when an implementation has detected an Over Temperature Condition for Memory that has been Detected May also be shown as 806f0a0c2005ffff or 0...

Page 419: ...iceable Yes Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0146 User Response 1 Make sure that the fans are operating that there are no obstructions to the airflow that the air baffles are in place and correctly installed and that the server cover is installed and completely closed 2 Make sure that ambient temperature is within the...

Page 420: ...n Over Temperature Condition has been detected on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 8 This message is for the use case when an implementation has detected an Over Temperature Condition for Memory that has been Detected May also be shown as 806f0a0c2008ffff or 0x806f0a0c2008ffff Severity Error Serviceable Yes Automatically notify support No Alert Category Critical Te...

Page 421: ...ally notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0244 User Response 1 Reseat the microprocessor and then restart the server 2 Replace microprocessor n n microprocessor number 806f0a13 0302ffff A Fatal Bus Error has occurred on system ComputerSystemElementName CPU 2 PECI This message is for the use case when an implementation has detected a Bus Fat...

Page 422: ...020701ffff or 0x810100020701ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Voltage SNMP Trap ID 13 CIM Information Prefix PLAT ID 0477 User Response No action information only 81010202 0701ffff Numeric sensor NumericSensorElementName going low lower critical has deasserted CMOS Battery This message is for the use case when an implementation has detected a ...

Page 423: ... or 0x810107010701ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0491 User Response No action information only 81010701 0702ffff Numeric sensor NumericSensorElementName going high upper non critical has deasserted DIMM AB Temp This message is for the use case when an implementation has detected an ...

Page 424: ...703ffff or 0x810107010703ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0491 User Response No action information only 81010701 0704ffff Numeric sensor NumericSensorElementName going high upper non critical has deasserted HDD Inlet Temp This message is for the use case when an implementation has det...

Page 425: ... 810107011001ffff or 0x810107011001ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0491 User Response No action information only 81010701 1002ffff Numeric sensor NumericSensorElementName going high upper non critical has deasserted PCI Riser 2 Temp This message is for the use case when an implementa...

Page 426: ...o be shown as 810107011501ffff or 0x810107011501ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0491 User Response No action information only 81010701 1502ffff Numeric sensor NumericSensorElementName going high upper non critical has deasserted GPU Outlet Temp This message is for the use case when a...

Page 427: ... also be shown as 810107011a01ffff or 0x810107011a01ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0491 User Response No action information only 81010701 2c01ffff Numeric sensor NumericSensorElementName going high upper non critical has deasserted Mezz Card Temp This message is for the use case whe...

Page 428: ...easserted May also be shown as 810107012d01ffff or 0x810107012d01ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Temperature SNMP Trap ID 12 CIM Information Prefix PLAT ID 0491 User Response No action information only 81010901 0701ffff Numeric sensor NumericSensorElementName going high upper critical has deasserted Ambient Temp This message is for the use c...

Page 429: ...erted May also be shown as 810109010702ffff or 0x810109010702ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0495 User Response No action information only 81010901 0703ffff Numeric sensor NumericSensorElementName going high upper critical has deasserted CPU1 VR Temp VCO This message is for the use c...

Page 430: ...deasserted May also be shown as 810109010704ffff or 0x810109010704ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0495 User Response No action information only 81010901 1001ffff Numeric sensor NumericSensorElementName going high upper critical has deasserted PCI Riser 1 Temp This message is for the ...

Page 431: ...sserted May also be shown as 810109011002ffff or 0x810109011002ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0495 User Response No action information only 81010901 1501ffff Numeric sensor NumericSensorElementName going high upper critical has deasserted PIB Ambient Temp This message is for the use...

Page 432: ... deasserted May also be shown as 810109011502ffff or 0x810109011502ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0495 User Response No action information only 81010901 1a01ffff Numeric sensor NumericSensorElementName going high upper critical has deasserted HDD Outlet Temp This message is for the ...

Page 433: ...deasserted May also be shown as 810109012c01ffff or 0x810109012c01ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0495 User Response No action information only 81010901 2d01ffff Numeric sensor NumericSensorElementName going high upper critical has deasserted PCH Temp This message is for the use case...

Page 434: ...lso be shown as 810109020701ffff or 0x810109020701ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Voltage SNMP Trap ID 1 CIM Information Prefix PLAT ID 0495 User Response No action information only SysBrd 3 3V SysBrd 5V 81010b01 0701ffff Numeric sensor NumericSensorElementName going high upper non recoverable has deasserted Ambient Temp This message is for...

Page 435: ...erted May also be shown as 81010b010702ffff or 0x81010b010702ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0499 User Response No action information only 81010b01 0703ffff Numeric sensor NumericSensorElementName going high upper non recoverable has deasserted CPU1 VR Temp VCO This message is for th...

Page 436: ...deasserted May also be shown as 81010b010704ffff or 0x81010b010704ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0499 User Response No action information only 81010b01 1001ffff Numeric sensor NumericSensorElementName going high upper non recoverable has deasserted PCI Riser 1 Temp This message is f...

Page 437: ...sserted May also be shown as 81010b011002ffff or 0x81010b011002ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0499 User Response No action information only 81010b01 1501ffff Numeric sensor NumericSensorElementName going high upper non recoverable has deasserted PIB Ambient Temp This message is for ...

Page 438: ... deasserted May also be shown as 81010b011502ffff or 0x81010b011502ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0499 User Response No action information only 81010b01 1a01ffff Numeric sensor NumericSensorElementName going high upper non recoverable has deasserted HDD Outlet Temp This message is f...

Page 439: ...deasserted May also be shown as 81010b012c01ffff or 0x81010b012c01ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0499 User Response No action information only 81010b01 2d01ffff Numeric sensor NumericSensorElementName going high upper non recoverable has deasserted PCH Temp This message is for the u...

Page 440: ...sserted May also be shown as 810300062101ffff or 0x810300062101ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0508 User Response No action information only 81030012 0601ffff Sensor SensorElementName has asserted SMM Mode SMM Monitor This message is for the use case when an implementation has detected a Senso...

Page 441: ... Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0508 User Response No action information only 8107010f 2201ffff Sensor SensorElementName has deasserted the transition from normal to non critical state GPT Status This message is for the use case when an implementation has detected that a Sensor has deasserted a tr...

Page 442: ...o be shown as 8107010f2582ffff or 0x8107010f2582ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Other SNMP Trap ID 60 CIM Information Prefix PLAT ID 0521 User Response No action information only 81070128 2e01ffff Sensor SensorElementName has deasserted the transition from normal to non critical state ME Recovery This message is for the use case when an impl...

Page 443: ...o be shown as 810702010301ffff or 0x810702010301ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0523 User Response No action information only 81070201 0302ffff Sensor SensorElementName has transitioned to a less severe state from critical CPU 2 OverTemp This message is for the use case when an imple...

Page 444: ...ritical May also be shown as 810702011101ffff or 0x810702011101ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0523 User Response No action information only 81070201 1102ffff Sensor SensorElementName has transitioned to a less severe state from critical PCI 2 Temp This message is for the use case wh...

Page 445: ...ritical May also be shown as 810702011103ffff or 0x810702011103ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0523 User Response No action information only 81070201 1104ffff Sensor SensorElementName has transitioned to a less severe state from critical PCI 4 Temp This message is for the use case wh...

Page 446: ...ere from critical May also be shown as 810702020701ffff or 0x810702020701ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Voltage SNMP Trap ID 1 CIM Information Prefix PLAT ID 0523 User Response No action information only 81070202 1501ffff Sensor SensorElementName has transitioned to a less severe state from critical PIB Fault This message is for the use ca...

Page 447: ...ical May also be shown as 810702021502ffff or 0x810702021502ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Voltage SNMP Trap ID 1 CIM Information Prefix PLAT ID 0523 User Response No action information only 8107020f 2201ffff Sensor SensorElementName has transitioned to a less severe state from critical TXT ACM Module This message is for the use case when ...

Page 448: ... from critical May also be shown as 8107020f2582ffff or 0x8107020f2582ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0523 User Response No action information only 81070214 2201ffff Sensor SensorElementName has transitioned to a less severe state from critical TPM Lock This message is for the use case wh...

Page 449: ...tical May also be shown as 810702190701ffff or 0x810702190701ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0523 User Response No action information only 8107021b 0301ffff Sensor SensorElementName has transitioned to a less severe state from critical CPU 1 QPILinkErr This message is for the use case whe...

Page 450: ...from critical May also be shown as 8107021b0302ffff or 0x8107021b0302ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0523 User Response No action information only 81070228 2e01ffff Sensor SensorElementName has transitioned to a less severe state from critical IPMB IO Error This message is for the use cas...

Page 451: ...e has deasserted May also be shown as 810703010301ffff or 0x810703010301ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0525 User Response No action information only 81070301 0302ffff Sensor SensorElementName has deasserted the transition to non recoverable from a less severe state CPU 2 OverTemp Th...

Page 452: ... deasserted May also be shown as 810703011101ffff or 0x810703011101ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0525 User Response No action information only 81070301 1102ffff Sensor SensorElementName has deasserted the transition to non recoverable from a less severe state PCI 2 Temp This messag...

Page 453: ...sserted May also be shown as 810703011103ffff or 0x810703011103ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0525 User Response No action information only 81070301 1104ffff Sensor SensorElementName has deasserted the transition to non recoverable from a less severe state PCI 4 Temp This message is...

Page 454: ...r 0x810b010c2581ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0803 User Response No action information only 810b030c 2581ffff Non redundant Sufficient Resources from Redundancy Degraded or Fully Redundant for RedundancySetElementName has deasserted Backup Memory This message is for the use case when a...

Page 455: ...redundant Insufficient Resources May also be shown as 810b050c2581ffff or 0x810b050c2581ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0811 User Response No action information only 816f0007 0301ffff ProcessorElementName has Recovered from IERR CPU 1 This message is for the use case when an implementati...

Page 456: ...ndition May also be shown as 816f00070302ffff or 0x816f00070302ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0043 User Response No action information only 816f0009 1301ffff PowerSupplyElementName has been turned on Host Power This message is for the use case when an implementation has detected a Power Un...

Page 457: ...f2201ffff or 0x816f000f2201ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0185 User Response No action information only Firmware Error Sys Boot Status 816f0013 1701ffff System ComputerSystemElementName has recovered from a diagnostic interrupt NMI State This message is for the use case when an implement...

Page 458: ... removed May also be shown as 816f00212201ffff or 0x816f00212201ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0331 User Response No action information only 816f0021 2582ffff Fault condition removed on slot PhysicalConnectorElementName on system ComputerSystemElementName All PCI Error This message is fo...

Page 459: ...t has been removed May also be shown as 816f00212c01ffff or 0x816f00212c01ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0331 User Response No action information only 816f0021 3001ffff Fault condition removed on slot PhysicalConnectorElementName on system ComputerSystemElementName PCI 1 This message is ...

Page 460: ...een removed May also be shown as 816f00213002ffff or 0x816f00213002ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0331 User Response No action information only 816f0021 3003ffff Fault condition removed on slot PhysicalConnectorElementName on system ComputerSystemElementName PCI 3 This message is for the...

Page 461: ...May also be shown as 816f00213004ffff or 0x816f00213004ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0331 User Response No action information only 816f0028 2101ffff Sensor SensorElementName has returned to normal on management system ComputerSystemElementName TPM Cmd Failures This message is for the us...

Page 462: ...rocessor May also be shown as 816f01070301ffff or 0x816f01070301ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0037 User Response No action information only 816f0107 0302ffff An Over Temperature Condition has been removed on ProcessorElementName CPU 2 This message is for the use case when an implem...

Page 463: ... recovery May also be shown as 816f010c2001ffff or 0x816f010c2001ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0139 User Response No action information only 816f010c 2002ffff Uncorrectable error recovery detected for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 2 This message is for t...

Page 464: ...or recovery May also be shown as 816f010c2003ffff or 0x816f010c2003ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0139 User Response No action information only 816f010c 2004ffff Uncorrectable error recovery detected for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 4 This message is for...

Page 465: ...ecovery May also be shown as 816f010c2005ffff or 0x816f010c2005ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0139 User Response No action information only 816f010c 2006ffff Uncorrectable error recovery detected for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 6 This message is for the...

Page 466: ...or recovery May also be shown as 816f010c2007ffff or 0x816f010c2007ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0139 User Response No action information only 816f010c 2008ffff Uncorrectable error recovery detected for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 8 This message is for...

Page 467: ...rrectable error recovery May also be shown as 816f010c2581ffff or 0x816f010c2581ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0139 User Response No action information only One of the DIMMs 816f010d 0401ffff The Drive StorageVolumeElementName has been enabled Computer HDD0 This message is for the use c...

Page 468: ...lso be shown as 816f010d0402ffff or 0x816f010d0402ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0167 User Response No action information only 816f010d 0403ffff The Drive StorageVolumeElementName has been enabled Computer HDD2 This message is for the use case when an implementation has detected...

Page 469: ...ff or 0x816f010d0404ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0167 User Response No action information only 816f010d 0405ffff The Drive StorageVolumeElementName has been enabled 1U Storage HDD0 This message is for the use case when an implementation has detected a Drive was Enabled May als...

Page 470: ...ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0167 User Response No action information only 816f010d 0407ffff The Drive StorageVolumeElementName has been enabled 1U Storage HDD2 This message is for the use case when an implementation has detected a Drive was Enabled May also be shown as 816f01...

Page 471: ...o Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0167 User Response No action information only 816f010d 0409ffff The Drive StorageVolumeElementName has been enabled 1U Storage HDD4 This message is for the use case when an implementation has detected a Drive was Enabled May also be shown as 816f010d0409ffff or 0x816f010d0409ffff...

Page 472: ...upport No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0167 User Response No action information only 816f010d 040bffff The Drive StorageVolumeElementName has been enabled 1U Storage HDD6 This message is for the use case when an implementation has detected a Drive was Enabled May also be shown as 816f010d040bffff or 0x816f010d040bffff Severity Info Serviceab...

Page 473: ... Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0167 User Response No action information only 816f010f 2201ffff The System ComputerSystemElementName has recovered from a firmware hang Firmware Error This message is for the use case when an implementation has recovered from a System Firmware Hang May also be shown as 816f010f2201ffff or 0x816f010f2201ffff Severity Info Serviceable No...

Page 474: ... see Removing a microprocessor and heat sink and Replacing a microprocessor and heat sink 2 If the problem persists and there is no other CPU with the same error indication replace the system board 3 Trained technician only Replace the system board see Removing the system board and Installing the system board n microprocessor number 816f0113 0302ffff System ComputerSystemElementName has recovered ...

Page 475: ...01ffff ManagedElementName detected as present PCI Riser 1 This message is for the use case when an implementation has detected a Managed Element is now Present May also be shown as 816f01251001ffff or 0x816f01251001ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0390 User Response No action information only 8...

Page 476: ...Present May also be shown as 816f01251f01ffff or 0x816f01251f01ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0390 User Response No action information only 816f0125 2c01ffff ManagedElementName detected as present Mezz Card This message is for the use case when an implementation has detected a Managed Element...

Page 477: ...lso be shown as 816f02070301ffff or 0x816f02070301ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0045 User Response No action information only 816f0207 0302ffff ProcessorElementName has Recovered from FRB1 BIST condition CPU 2 This message is for the use case when an implementation has detected a Processo...

Page 478: ...584ffff or 0x816f02072584ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0045 User Response No action information only One of the CPUs 816f020d 0401ffff Failure no longer Predicted on drive StorageVolumeElementName for array ComputerSystemElementName Computer HDD0 This message is for the use case when an i...

Page 479: ...also be shown as 816f020d0402ffff or 0x816f020d0402ffff Severity Info Serviceable No Automatically notify support No Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0169 User Response No action information only 816f020d 0403ffff Failure no longer Predicted on drive StorageVolumeElementName for array ComputerSystemElementName Computer HDD2 This message is for ...

Page 480: ...dicted May also be shown as 816f020d0404ffff or 0x816f020d0404ffff Severity Info Serviceable No Automatically notify support No Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0169 User Response No action information only 816f020d 0405ffff Failure no longer Predicted on drive StorageVolumeElementName for array ComputerSystemElementName 1U Storage HDD0 This me...

Page 481: ...edicted May also be shown as 816f020d0406ffff or 0x816f020d0406ffff Severity Info Serviceable No Automatically notify support No Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0169 User Response No action information only 816f020d 0407ffff Failure no longer Predicted on drive StorageVolumeElementName for array ComputerSystemElementName 1U Storage HDD2 This m...

Page 482: ...r Predicted May also be shown as 816f020d0408ffff or 0x816f020d0408ffff Severity Info Serviceable No Automatically notify support No Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0169 User Response No action information only 816f020d 0409ffff Failure no longer Predicted on drive StorageVolumeElementName for array ComputerSystemElementName 1U Storage HDD4 Th...

Page 483: ...edicted May also be shown as 816f020d040affff or 0x816f020d040affff Severity Info Serviceable No Automatically notify support No Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0169 User Response No action information only 816f020d 040bffff Failure no longer Predicted on drive StorageVolumeElementName for array ComputerSystemElementName 1U Storage HDD6 This m...

Page 484: ...e is no longer Predicted May also be shown as 816f020d040cffff or 0x816f020d040cffff Severity Info Serviceable No Automatically notify support No Alert Category System Predicted Failure SNMP Trap ID 27 CIM Information Prefix PLAT ID 0169 User Response No action information only 816f030c 2001ffff Scrub Failure for PhysicalMemoryElementName on Subsystem MemoryElementName has recovered DIMM 1 This me...

Page 485: ...ry May also be shown as 816f030c2002ffff or 0x816f030c2002ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0137 User Response No action information only 816f030c 2003ffff Scrub Failure for PhysicalMemoryElementName on Subsystem MemoryElementName has recovered DIMM 3 This message is for the use case when ...

Page 486: ... May also be shown as 816f030c2004ffff or 0x816f030c2004ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0137 User Response No action information only 816f030c 2005ffff Scrub Failure for PhysicalMemoryElementName on Subsystem MemoryElementName has recovered DIMM 5 This message is for the use case when an...

Page 487: ...shown as 816f030c2006ffff or 0x816f030c2006ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0137 User Response No action information only 816f030c 2007ffff Scrub Failure for PhysicalMemoryElementName on Subsystem MemoryElementName has recovered DIMM 7 This message is for the use case when an implementati...

Page 488: ...also be shown as 816f030c2008ffff or 0x816f030c2008ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0137 User Response No action information only 816f0313 1701ffff System ComputerSystemElementName has recovered from an NMI NMI State This message is for the use case when an implementation has detected a S...

Page 489: ... as 816f040c2001ffff or 0x816f040c2001ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0130 User Response No action information only 816f040c 2002ffff PhysicalMemoryElementName Enabled on Subsystem MemoryElementName DIMM 2 This message is for the use case when an implementation has detected that Memory has bee...

Page 490: ...03ffff or 0x816f040c2003ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0130 User Response No action information only 816f040c 2004ffff PhysicalMemoryElementName Enabled on Subsystem MemoryElementName DIMM 4 This message is for the use case when an implementation has detected that Memory has been Enabled May ...

Page 491: ...005ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0130 User Response No action information only 816f040c 2006ffff PhysicalMemoryElementName Enabled on Subsystem MemoryElementName DIMM 6 This message is for the use case when an implementation has detected that Memory has been Enabled May also be shown as 816f...

Page 492: ...fff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0130 User Response No action information only 816f040c 2008ffff PhysicalMemoryElementName Enabled on Subsystem MemoryElementName DIMM 8 This message is for the use case when an implementation has detected that Memory has been Enabled May also be shown as 816f040c...

Page 493: ...eable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0130 User Response No action information only One of the DIMMs 816f0413 2582ffff A PCI PERR recovery has occurred on system ComputerSystemElementName PCIs This message is for the use case when an implementation has detected a PCI PERR recovered May also be shown as 816f04132582ffff or 0...

Page 494: ...nfo Serviceable No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0063 User Response No action information only 816f0507 0302ffff ProcessorElementName has Recovered from a Configuration Mismatch CPU 2 This message is for the use case when an implementation has Recovered from a Processor Configuration Mismatch May also be shown as 816f0507...

Page 495: ...le No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0063 User Response No action information only One of the CPUs 816f050c 2001ffff Memory Logging Limit Removed for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 1 This message is for the use case when an implementation has detected that the Memory Logging Limit has been Re...

Page 496: ...050c2002ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Memory SNMP Trap ID 43 CIM Information Prefix PLAT ID 0145 User Response No action information only 816f050c 2003ffff Memory Logging Limit Removed for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 3 This message is for the use case when an implementation has detected that the Memory Log...

Page 497: ...f or 0x816f050c2004ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Memory SNMP Trap ID 43 CIM Information Prefix PLAT ID 0145 User Response No action information only 816f050c 2005ffff Memory Logging Limit Removed for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 5 This message is for the use case when an implementation has detected that the...

Page 498: ...wn as 816f050c2006ffff or 0x816f050c2006ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Memory SNMP Trap ID 43 CIM Information Prefix PLAT ID 0145 User Response No action information only 816f050c 2007ffff Memory Logging Limit Removed for PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 7 This message is for the use case when an implementation ...

Page 499: ...own as 816f050c2008ffff or 0x816f050c2008ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning Memory SNMP Trap ID 43 CIM Information Prefix PLAT ID 0145 User Response No action information only 816f050c 2581ffff Memory Logging Limit Removed for PhysicalMemoryElementName on Subsystem MemoryElementName All DIMMS This message is for the use case when an implementat...

Page 500: ...rted May also be shown as 816f050d0401ffff or 0x816f050d0401ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0175 User Response No action information only 816f050d 0402ffff Critical Array ComputerSystemElementName has deasserted Computer HDD1 This message is for the use case when an implementatio...

Page 501: ...ed May also be shown as 816f050d0403ffff or 0x816f050d0403ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0175 User Response No action information only 816f050d 0404ffff Critical Array ComputerSystemElementName has deasserted Computer HDD3 This message is for the use case when an implementation ...

Page 502: ... deasserted May also be shown as 816f050d0405ffff or 0x816f050d0405ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0175 User Response No action information only 816f050d 0406ffff Critical Array ComputerSystemElementName has deasserted 1U Storage HDD1 This message is for the use case when an impl...

Page 503: ...sserted May also be shown as 816f050d0407ffff or 0x816f050d0407ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0175 User Response No action information only 816f050d 0408ffff Critical Array ComputerSystemElementName has deasserted 1U Storage HDD3 This message is for the use case when an implemen...

Page 504: ... deasserted May also be shown as 816f050d0409ffff or 0x816f050d0409ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0175 User Response No action information only 816f050d 040affff Critical Array ComputerSystemElementName has deasserted 1U Storage HDD5 This message is for the use case when an impl...

Page 505: ...sserted May also be shown as 816f050d040bffff or 0x816f050d040bffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0175 User Response No action information only 816f050d 040cffff Critical Array ComputerSystemElementName has deasserted 1U Storage HDD7 This message is for the use case when an implemen...

Page 506: ...s deasserted May also be shown as 816f06070301ffff or 0x816f06070301ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0817 User Response No action information only 816f0607 0302ffff An SM BIOS Uncorrectable CPU complex error for ProcessorElementName has deasserted CPU 2 This message is for the use case when ...

Page 507: ...so be shown as 816f06072584ffff or 0x816f06072584ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical CPU SNMP Trap ID 40 CIM Information Prefix PLAT ID 0817 User Response No action information only One of the CPUs 816f060d 0401ffff Array in system ComputerSystemElementName has been restored Computer HDD0 This message is for the use case when an implementation ...

Page 508: ...estored May also be shown as 816f060d0402ffff or 0x816f060d0402ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0177 User Response No action information only 816f060d 0403ffff Array in system ComputerSystemElementName has been restored Computer HDD2 This message is for the use case when an implem...

Page 509: ...stored May also be shown as 816f060d0404ffff or 0x816f060d0404ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0177 User Response No action information only 816f060d 0405ffff Array in system ComputerSystemElementName has been restored 1U Storage HDD0 This message is for the use case when an imple...

Page 510: ...en Restored May also be shown as 816f060d0406ffff or 0x816f060d0406ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0177 User Response No action information only 816f060d 0407ffff Array in system ComputerSystemElementName has been restored 1U Storage HDD2 This message is for the use case when an ...

Page 511: ...estored May also be shown as 816f060d0408ffff or 0x816f060d0408ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0177 User Response No action information only 816f060d 0409ffff Array in system ComputerSystemElementName has been restored 1U Storage HDD4 This message is for the use case when an impl...

Page 512: ...en Restored May also be shown as 816f060d040affff or 0x816f060d040affff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0177 User Response No action information only 816f060d 040bffff Array in system ComputerSystemElementName has been restored 1U Storage HDD6 This message is for the use case when an ...

Page 513: ... shown as 816f060d040cffff or 0x816f060d040cffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Hard Disk drive SNMP Trap ID 5 CIM Information Prefix PLAT ID 0177 User Response No action information only 816f070c 2001ffff Configuration error for PhysicalMemoryElementName on Subsystem MemoryElementName has deasserted DIMM 1 This message is for the use case when...

Page 514: ... deasserted May also be shown as 816f070c2002ffff or 0x816f070c2002ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0127 User Response No action information only 816f070c 2003ffff Configuration error for PhysicalMemoryElementName on Subsystem MemoryElementName has deasserted DIMM 3 This message is for th...

Page 515: ...sserted May also be shown as 816f070c2004ffff or 0x816f070c2004ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0127 User Response No action information only 816f070c 2005ffff Configuration error for PhysicalMemoryElementName on Subsystem MemoryElementName has deasserted DIMM 5 This message is for the us...

Page 516: ... deasserted May also be shown as 816f070c2006ffff or 0x816f070c2006ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0127 User Response No action information only 816f070c 2007ffff Configuration error for PhysicalMemoryElementName on Subsystem MemoryElementName has deasserted DIMM 7 This message is for th...

Page 517: ...erted May also be shown as 816f070c2008ffff or 0x816f070c2008ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Memory SNMP Trap ID 41 CIM Information Prefix PLAT ID 0127 User Response No action information only 816f070c 2581ffff Configuration error for PhysicalMemoryElementName on Subsystem MemoryElementName has deasserted All DIMMS This message is for the u...

Page 518: ...y Rebuild has Completed May also be shown as 816f070d0401ffff or 0x816f070d0401ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0179 User Response No action information only 816f070d 0402ffff Rebuild completed for Array in system ComputerSystemElementName Computer HDD1 This message is for the use case when an ...

Page 519: ...mpleted May also be shown as 816f070d0403ffff or 0x816f070d0403ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0179 User Response No action information only 816f070d 0404ffff Rebuild completed for Array in system ComputerSystemElementName Computer HDD3 This message is for the use case when an implementation h...

Page 520: ...s Completed May also be shown as 816f070d0405ffff or 0x816f070d0405ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0179 User Response No action information only 816f070d 0406ffff Rebuild completed for Array in system ComputerSystemElementName 1U Storage HDD1 This message is for the use case when an implementa...

Page 521: ...mpleted May also be shown as 816f070d0407ffff or 0x816f070d0407ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0179 User Response No action information only 816f070d 0408ffff Rebuild completed for Array in system ComputerSystemElementName 1U Storage HDD3 This message is for the use case when an implementation...

Page 522: ...s Completed May also be shown as 816f070d0409ffff or 0x816f070d0409ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0179 User Response No action information only 816f070d 040affff Rebuild completed for Array in system ComputerSystemElementName 1U Storage HDD5 This message is for the use case when an implementa...

Page 523: ...mpleted May also be shown as 816f070d040bffff or 0x816f070d040bffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0179 User Response No action information only 816f070d 040cffff Rebuild completed for Array in system ComputerSystemElementName 1U Storage HDD7 This message is for the use case when an implementation...

Page 524: ...bled May also be shown as 816f08070301ffff or 0x816f08070301ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0060 User Response No action information only 816f0807 0302ffff ProcessorElementName has been Enabled CPU 2 This message is for the use case when an implementation has detected a Processor has been Enab...

Page 525: ...ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0060 User Response No action information only One of the CPUs 816f0813 2581ffff System ComputerSystemElementName has recovered from an Uncorrectable Bus Error DIMMs This message is for the use case when an implementation has detected a that a system has recovere...

Page 526: ... be shown as 816f08132582ffff or 0x816f08132582ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0241 User Response No action information only 816f0813 2584ffff System ComputerSystemElementName has recovered from an Uncorrectable Bus Error CPUs This message is for the use case when an implementation has de...

Page 527: ...e shown as 816f090c2001ffff or 0x816f090c2001ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0143 User Response No action information only 816f090c 2002ffff PhysicalMemoryElementName on Subsystem MemoryElementName is no longer Throttled DIMM 2 This message is for the use case when an implementation has detect...

Page 528: ...wn as 816f090c2003ffff or 0x816f090c2003ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0143 User Response No action information only 816f090c 2004ffff PhysicalMemoryElementName on Subsystem MemoryElementName is no longer Throttled DIMM 4 This message is for the use case when an implementation has detected Me...

Page 529: ...f or 0x816f090c2005ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0143 User Response No action information only 816f090c 2006ffff PhysicalMemoryElementName on Subsystem MemoryElementName is no longer Throttled DIMM 6 This message is for the use case when an implementation has detected Memory is no longer Thr...

Page 530: ...090c2007ffff Severity Info Serviceable No Automatically notify support No Alert Category System Other SNMP Trap ID CIM Information Prefix PLAT ID 0143 User Response No action information only 816f090c 2008ffff PhysicalMemoryElementName on Subsystem MemoryElementName is no longer Throttled DIMM 8 This message is for the use case when an implementation has detected Memory is no longer Throttled May ...

Page 531: ...01ffff Severity Info Serviceable No Automatically notify support No Alert Category Warning CPU SNMP Trap ID 42 CIM Information Prefix PLAT ID 0039 User Response No action information only 816f0a07 0302ffff The Processor ProcessorElementName is no longer operating in a Degraded State CPU 2 This message is for the use case when an implementation has detected a Processor is no longer running in the D...

Page 532: ...ff or 0x816f0a0c2001ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0147 User Response No action information only 816f0a0c 2002ffff An Over Temperature Condition has been removed on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 2 This message is for the use case when an implement...

Page 533: ... 816f0a0c2003ffff or 0x816f0a0c2003ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0147 User Response No action information only 816f0a0c 2004ffff An Over Temperature Condition has been removed on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 4 This message is for the use case wh...

Page 534: ... be shown as 816f0a0c2005ffff or 0x816f0a0c2005ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0147 User Response No action information only 816f0a0c 2006ffff An Over Temperature Condition has been removed on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 6 This message is for the...

Page 535: ...also be shown as 816f0a0c2007ffff or 0x816f0a0c2007ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Temperature SNMP Trap ID 0 CIM Information Prefix PLAT ID 0147 User Response No action information only 816f0a0c 2008ffff An Over Temperature Condition has been removed on the PhysicalMemoryElementName on Subsystem MemoryElementName DIMM 8 This message is for...

Page 536: ...0301ffff or 0x816f0a130301ffff Severity Info Serviceable No Automatically notify support No Alert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0245 User Response 1 Trained technician only Replace microprocessor n see Removing a microprocessor and heat sink and Replacing a microprocessor and heat sink 2 If the problem persists and there is no other CPU with the same error ...

Page 537: ...ert Category Critical Other SNMP Trap ID 50 CIM Information Prefix PLAT ID 0245 User Response 1 Trained technician only Replace microprocessor n see Removing a microprocessor and heat sink and Replacing a microprocessor and heat sink 2 If the problem persists and there is no other CPU with the same error indication replace the system board 3 Trained technician only Replace the system board see Rem...

Page 538: ...524 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 539: ...oses usually a user action or a change of states that is normal behavior Warning A warning is not as severe as an error but if possible the condition should be corrected before it becomes an error It might also be a condition that requires additional monitoring or maintenance Error An error typically indicates a failure or critical condition that impairs service or an expected function User respon...

Page 540: ...s 1 If this is a newly installed option ensure that matching Processors are installed in the correct Processor sockets according to the service information for this product 2 Check IBM support site for an applicable service bulletin that applies to this Processor error 3 Trained Service technician only Replace Processor Inspect Processor socket and replace the system board first if socket is damag...

Page 541: ...se Complete the following steps 1 Verify that matching DIMMs are installed in the correct population sequence according to the service information for this product Add link to Memory chart Correct any configuration issues found 2 Trained Service technician only Replace associated Processor Inspect Processor socket and replace the system board first if socket is damaged I 18009 I 18009 A core speed...

Page 542: ...or more processor packages Explanation Processors have one or more cache levels with mismatched size Severity Error User Response Complete the following steps 1 Verify that matching processors are installed in the correct processor sockets according to the service information for this product Correct any mismatch found 2 Check IBM support site for an applicable service bulletin or firmware update ...

Page 543: ...s 1 Verify that matching Processors are installed in the correct Processor sockets according to the service information for this product 2 Check IBM support site for an applicable service bulletin or firmware update that applies to this Processor error 3 Trained Service technician only Replace the system board I 1800F I 1800F A processor family mismatch has been detected for one or more processor ...

Page 544: ...r ASU or using adapter manufacturer utilities so that adapter firmware can be updated 3 Move card to a different slot If slot not available or error re occurs replace adapter 4 Trained Service technician only If adapter was moved to a different slot and error did not re occur verify that this is not a system limitation and then replace the system board Also if this is not the initial installation ...

Page 545: ...munication is unavailable use F1 Setup to access System Event Logs Menu and Choose Clear IMM System Event Log and Restart Server I 3818001 I 3818001 The firmware image capsule signature for the currently booted flash bank is invalid Explanation Current Bank CRTM Capsule Update Signature Invalid Severity Info User Response Complete the following steps 1 Reboot system Will come up on backup UEFI ima...

Page 546: ...error recovery is complete and no additional action is required 3 If system fails to boot or if flash attempt fails Trained service technician only Replace the system board I 58015 I 58015 Memory spare copy initiated Explanation Spare Copy Started Severity Info User Response Complete the following steps 1 No user required for this event This is for informational purposes only I 580A4 I 580A4 Memor...

Page 547: ...upport site for an applicable service bulletin or firmware update that applies to this Processor error 2 Trained Service technician only Replace the Processor S 1100C S 1100C An uncorrectable error has been detected on processor Explanation Uncorrectable processor error detected Severity Error User Response Complete the following steps 1 Check IBM support site for an applicable service bulletin or...

Page 548: ...ng steps 1 If this node and or any attached cables were recently installed moved serviced or upgraded a Reseat Adapter and any attached cables b Reload Device Driver c If device is not recognized reconfiguring slot to Gen1 or Gen2 may be required Gen1 Gen2 settings can be configured via F1 Setup System Settings Devices and I O Ports PCIe Gen1 Gen2 Gen3 Speed Selection or the ASU Utility 2 Check IB...

Page 549: ...letin or firmware update that applies to this error 2 Reflash UEFI image 3 Trained service technician only Replace the system board S 3040007 S 3040007 A firmware fault has been detected in the UEFI image Explanation Internal UEFI Firmware Fault Detected System halted Severity Error User Response Complete the following steps 1 Check IBM support site for an applicable service bulletin or firmware u...

Page 550: ...r 30 seconds to clear CMOS contents Verify that the system boots Then re install options one at a time to locate the problem 4 Check IBM support site for an applicable service bulletin or firmware update that applies to this error 5 Reflash UEFI firmware 6 Remove and re install CMOS battery for 30 seconds to clear CMOS contents 7 Trained service technician only Replace the system board S 3060007 S...

Page 551: ...ing steps 1 Continue booting sytem If system does not reset manually reset the system 2 If the error is not reported on the subsequent boot no additional recovery action is required 3 If the error persists continue booting system and reflash UEFI image 4 Trained service technician only Replace the system board S 3818007 S 3818007 The firmware image capsules for both flash banks could not be verifi...

Page 552: ...same memory channel 4 Check IBM support site for an applicable service bulletin or firmware update that applies to this memory error 5 Trained Service technician only If problem re occurs on the same DIMM connector inspect connector for damage If found replace system board 6 Trained Service technician only Replace affected Processor S 51006 S 51006 A memory mismatch has been detected Please verify...

Page 553: ... and or event log entry 4 If problem re occurs on the same DIMM connector swap the other DIMMs on the same memory channel across channels one at a time to a different memory channel or Processor check service information for this product Install guide for population requirements for sparing paring modes If problem follows a moved DIMM to a different memory channel replace that DIMM 5 Check IBM sup...

Page 554: ... is found remove debris 3 If error recurs or socket damage is found replace the system board Trained Service technician only 4 Trained Service Technician Only Replace the processor S 680B9 S 680B9 External QPI Link Failure Detected Explanation External QPI Link Failure Detected Severity Error User Response Complete the following steps 1 Check IBM support site for an applicable service bulletin or ...

Page 555: ...licable service bulletin or firmware update that applies to this error 2 Reflash Primary UEFI image Refer to UEFI Recovery section of service information for this product 3 Trained service technician only Replace the system board W 305000A W 305000A An invalid date and time have been detected Explanation RTC Date and Time Incorrect Severity Warning User Response Complete the following steps 1 Chec...

Page 556: ...m board W 305800B W 305800B DRIVER HEALTH PROTOCOL Reports Reboot Required Controller Explanation DRIVER HEALTH PROTOCOL Reports Reboot Required Controller Severity Warning User Response Complete the following steps 1 No action required system will reboot at the end of POST 2 If problem persists switch to backup UEFI or reflash current UEFI image 3 Trained Service Technician Only Replace system bo...

Page 557: ...ng steps 1 Reboot the system 2 If problem persists switch to backup UEFI or reflash current UEFI image 3 Trained Service Technician Only Replace system board W 3808000 W 3808000 An IMM communication failure has occurred Explanation IMM Communication Failure Severity Warning User Response Complete the following steps 1 Reset the IMM from the FPC 2 Use FPC to remove AUX power from the node This will...

Page 558: ... from the node This will reboot the entire node 4 Check IBM support site for an applicable service bulletin or firmware update that applies to this error 5 Reflash IMM Firmware 6 Remove and re install CMOS battery for 30 seconds to clear CMOS contents 7 Trained Service technician only Replace the system board W 3818005 W 3818005 The CRTM flash driver could not successfully flash the staging area T...

Page 559: ...everity Info User Response Complete the following steps 1 If the DIMM was disabled because of a memory fault follow the procedure for that event 2 If no memory fault is recorded in the logs and no DIMM connector error LEDs are lit re enable the DIMM through the Setup utility or the Advanced Settings Utility ASU 3 If problem persists Power cycle the node from management console 4 Reset IMM to defau...

Page 560: ...oard 5 Trained service technician only Inspect processor socket for foreign debris or damage If debris is found remove debris 6 Trained service technician only Remove affected processor and inspect processor socket pins for damaged or mis aligned pins If damage is found on processor replace system board 7 Trained Service technician only Replace affected processor W 58007 W 58007 Invalid memory con...

Page 561: ...he service information for this product W 68002 W 68002 A CMOS battery error has been detected Explanation CMOS Battery Fault Severity Error User Response Complete the following steps 1 If the system was recently Installed Moved or Serviced make sure the battery is properly seated 2 Check IBM support site for an applicable service bulletin or firmware update that applies to this error 3 Replace CM...

Page 562: ...548 IBM NeXtScale nx360 M4 Installation and Service Guide ...

Page 563: ...an result when you run the DSA Broadcom network test 405 000 000 BRCM TestControlRegisters Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 001 000 BRCM TestMIIRegisters Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notif...

Page 564: ... 405 003 000 BRCM TestInternalMemory Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 004 000 BRCM TestInterrupt Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No 550 IBM NeXtScale nx360 M4 Installation and ...

Page 565: ... No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 006 000 BRCM TestLoopbackPhysical Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 007 000 BRCM TestLEDs Test Passed The test passed Recoverable No Appendix C DSA diagnostic te...

Page 566: ...e No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 801 000 BRCM TestMIIRegisters Test Aborted The MII register test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 552 IBM NeXtSc...

Page 567: ...3 000 BRCM TestInternalMemory Test Aborted The internal memory test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 804 000 BRCM TestInterrupt Test Aborted The interrupt test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Appendi...

Page 568: ...upport website Latest level of DSA Latest level of BMC IMM 405 806 000 BRCM TestLoopbackPhysical Test Aborted Loopback testing at the physical layer was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 807 000 BRCM TestLEDs Test Aborted Verification of status LEDs was canceled R...

Page 569: ...nt firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 405 ...

Page 570: ...ed while testing non volatile RAM Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check component firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 If failure remains refer to Troubleshooting...

Page 571: ...TestInterrupt Test Failed A failure was detected while testing interrupts Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check component firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 If ...

Page 572: ... of BMC IMM 405 906 000 BRCM TestLoopbackPhysical Test Failed A failure was detected during the loopback test at the physical layer Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check component firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firm...

Page 573: ...mponent 2 Rerun the test 3 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA Brocade test results The following messages can result when you run the Brocade test Test results for the DSA Brocade test The following messages can result whe...

Page 574: ...ated links IBM Support website Latest level of DSA Latest level of BMC IMM 218 002 000 Brocade SerdesLoopbackTest Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 003 000 Brocade PCILoopbackTest Passed The test passed Recoverable No 560 IBM NeXtScale nx360 M4 Install...

Page 575: ...verable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 005 000 Brocade SerdesEthLoopbackTest Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM Appendix C DSA diagnostic tes...

Page 576: ...8 800 000 Brocade MemoryTest Aborted The test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 801 000 Brocade ExternalLoopbackTest Aborted The test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No 562 IBM NeXtScale nx360 M4 Install...

Page 577: ...lated links IBM Support website Latest level of DSA Latest level of BMC IMM 218 803 000 Brocade PCILoopbackTest Aborted The test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 804 000 Brocade ExternalEthLoopbackTest Aborted The test was canceled Recoverable No Appendix C D...

Page 578: ...o Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 806 000 Brocade InternalLoopbackTest Aborted The test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 564 IBM NeXtScale nx360 M4 I...

Page 579: ...port representative Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 901 000 Brocade ExternalLoopbackTest Failed A failure was detected during the Loopback test Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check cable connections 2 Rerun the test 3 Verify whether the firmware is at proper...

Page 580: ... problem remains contact your IBM technical support representative Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 903 000 Brocade PCILoopbackTest Failed A failure was detected during the Loopback test Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Rerun the test 2 Verify whether the firmw...

Page 581: ...P cable 2 Rerun the test 3 Verify whether the firmware is at proper level 4 Rerun the test 5 If the problem remains contact your IBM technical support representative Related links IBM Support website Latest level of DSA Latest level of BMC IMM 218 905 000 Brocade SerdesEthLoopbackTest Failed A failure was detected during the Loopback test Recoverable No Severity Error Serviceable Yes Automatically...

Page 582: ... the following steps 1 Rerun the test 2 Verify whether the firmware is at proper level 3 Rerun the test 4 If the problem remains contact your IBM technical support representative Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA checkpoint panel test results The following messages can result when you run the checkpoint panel test Test results for the DSA checkpoint ...

Page 583: ... No User Response Complete the following steps 1 Inspect and reseat operator information panel cable at both ends 2 Verify that the Baseboard Management Controller BMC is working 3 Run the test again 4 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level ...

Page 584: ...ymptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA CPU stress test results The following messages can result when you run the CPU stress test Test results for the DSA CPU stress test The following messages can result when you run the DSA CPU stress test 089 000 000 CPU Stress Test Pass...

Page 585: ...ction for this component The latest level firmware for this component can be found in reference to this system type at the IBM Support website 5 Run the test again 6 If the system has stopped responding turn off and restart the system and then run the test again 7 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Rela...

Page 586: ...ubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 089 803 000 CPU Stress Test Aborted CPU Stress Test Aborted Memory size is insufficient to run the test At least 1GB is required Recoverable No Severity Warning Serviceable Yes Automatically notify support No Related links...

Page 587: ...found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 5 Run the test again 6 If the system has stopped responding turn off and restart the system and then run the test again 7 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Late...

Page 588: ...ackTest Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 516 002 000 ELXUCNA ELXUCNA NIC LED Beacon Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links 574 IBM NeXtScale nx360 M4 Installation and Service ...

Page 589: ... Latest level of DSA Latest level of BMC IMM 516 801 000 ELXUCNA NIC PHY LoopBackTest Aborted Loopback testing at the physical layer was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 516 802 000 ELXUCNA ELXUCNA NIC LED Beacon Test Aborted Verification of status LEDs was canceled ...

Page 590: ...ssary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 516 901 000 ELXUCNA NIC PHY LoopBackTest ...

Page 591: ...able No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check component firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 If failure remains refer to Troubleshooting by symptom in the system Installation an...

Page 592: ...tify support No User Response Complete the following steps 1 Remove power cables wait for 45 seconds reconnect and rerun the test 2 Make sure that the scalability cable connections are as per specification 3 Make sure that DSA and BIOS uEFI are at the latest level 4 If the problem remains contact your technical service representative Related links IBM Support website Latest level of DSA Latest lev...

Page 593: ...st level of BMC IMM 401 901 001 EXA Port Ping Test Failed EXA Port Ping Test Failed Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Remove power cables wait for 45 seconds reconnect and rerun the test 2 Make sure that the scalability cable connections are as per specification 3 Check scalability cables for loose connections...

Page 594: ...port No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 217 800 000 HDD Test Aborted HDD Test Aborted The test was canceled Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check cable connections 2 Rerun the test 3 Verify that Hard drive supports self test and self test logging 4 If the probl...

Page 595: ...e is at the latest level 4 Rerun the test 5 If the problem remains contact your technical support representative Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA Intel network test results The following messages can result when you run the Intel network test Test results for the DSA Intel network test The following messages can result when you run the DSA Intel net...

Page 596: ...e No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 406 002 000 IANet FIFO Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 406 003 000 IANet Interrupts Test Passed 582 IBM NeXtScale nx360 M4 Installatio...

Page 597: ...pback Test Passed The test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 406 800 000 IANet Registers Test Aborted Registers test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links Appendix C DSA diagnostic test results 583 ...

Page 598: ...ort website Latest level of DSA Latest level of BMC IMM 406 802 000 IANet FIFO Test Aborted FIFO test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 406 803 000 IANet Interrupts Test Aborted Interrupt test was canceled Recoverable No Severity Warning 584 IBM NeXtScale nx360 M4...

Page 599: ... Registers Test Failed A failure was detected during the Registers test Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check component firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 If fa...

Page 600: ... Rerun the test 3 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 406 902 000 IANet FIFO Test Failed A failure was detected during the FIFO test Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Co...

Page 601: ...ound in the DSA Diagnostic Event Log within the Firmware VPD section for this component 2 Rerun the test 3 Check interrupt assignments in the PCI Hardware section of the DSA Diagnostic Log If the ethernet device is sharing interrupts if possible modify the interrupt assignments using F1 Setup to assign a unique interrupt to the device 4 Rerun the test 5 If failure remains refer to Troubleshooting ...

Page 602: ... next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA LSI hard drive test results The following messages can result when you run the LSI hard drive test Test results for the DSA LSI hard driveoutputfilename DSA_LSI_hard_drive test The following messages can result when you run the DSA LSI hard driveoutputfilename DSA_LSI_hard_drive test 407 000 0...

Page 603: ... Failed The hard drive self test detected a failure Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check cable connections 2 Rerun the test 3 Verify whether the firmware is at the latest level 4 Rerun the test 5 If the problem remains contact your IBM technical support representative Related links IBM Support website Lates...

Page 604: ...o Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 408 001 000 MLNX MLNX_DiagnosticTestIBPort Test Passed Port Test Passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 408 800 000 MLNX MLNX_Diagnosti...

Page 605: ...port No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 408 900 000 MLNX MLNX_DiagnosticTestEthernetPort Test Failed Port Test Failed Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Make sure that the physical link of the port under test in the active state 2 If these condition was met but the ...

Page 606: ...bric to which the port is attached 2 If these condition was met but the test keeps failing the port s adapter might be faulty 3 Try replacing the adapter and repeating the test Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA memory isolation test results The following messages can result when you run the memory isolation test Test results for the DSA memory isolat...

Page 607: ...erable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 000 002 Standalone Memory Test Passed Quick Full Memory Test CPU 2 Passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM Appendix C DSA di...

Page 608: ...00 004 Standalone Memory Test Passed Quick Full Memory Test CPU 4 Passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 811 000 Standalone Memory Test Aborted Unable to Locate SMBIOS key _SM_ Recoverable No Severity Warning Serviceable No Automatically notify support No 594 IBM NeXtScale...

Page 609: ... Locate SMBIOS key _SM_ Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 610: ...ory Test Aborted Unable to Locate SMBIOS key _SM_ Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Resea...

Page 611: ...evel of DSA Latest level of BMC IMM 201 812 001 Standalone Memory Test Aborted Memory test is not supported for this system Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 ...

Page 612: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 812 003 Standalone Memory Test Aborted Memory test is not supported for this system Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If...

Page 613: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 813 001 Standalone Memory Test Aborted Chipset Error Can not turn OFF ECC error reporting in CPU Recoverable No Severity Warning Serviceable No Automatically noti...

Page 614: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 615: ...cubbing feature for CPU Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 616: ... Chipset Error Can not disable Scubbing feature for CPU Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 617: ...of DSA Latest level of BMC IMM 201 815 000 Standalone Memory Test Aborted Program Error with Quick Memory Menu Option Selection Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representativ...

Page 618: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 815 002 Standalone Memory Test Aborted Program Error with Quick Memory Menu Option Selection Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each ac...

Page 619: ...f the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 816 000 Standalone Memory Test Aborted Program Error with Full Memory Menu Option Selection Recoverable No Severity Warning Serviceable No Automatically notify s...

Page 620: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 621: ...Full Memory Menu Option Selection Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconne...

Page 622: ...ne Memory Test Aborted Unable to Locate SMBIOS key _SM_ Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 623: ...test level of DSA Latest level of BMC IMM 201 818 003 Standalone Memory Test Aborted Unable to Locate SMBIOS key _SM_ Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn o...

Page 624: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 819 001 Standalone Memory Test Aborted The start end address ranges in the restricted area of the memory Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test a...

Page 625: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 819 003 Standalone Memory Test Aborted The start end address ranges in the restricted area of the memory Recoverable No Severity Warning Serviceable No Automatica...

Page 626: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 627: ... is less than 16 Mbytes Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 628: ...RR registers are larger than fixed range MTRR registers Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 629: ...test level of BMC IMM 201 821 002 Standalone Memory Test Aborted Variable range MTRR registers are larger than fixed range MTRR registers Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service rep...

Page 630: ...e that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 822 000 Standalone Memory Test Aborted Invalid MTRR service request Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each actio...

Page 631: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 822 002 Standalone Memory Test Aborted Invalid MTRR service request Recoverable No Severity Warning Serviceable No Automatically notify support No User Response C...

Page 632: ...mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 824 000 Standalone Mem...

Page 633: ...on and then re run the test Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it ...

Page 634: ...orted Node Interleave feature must be OFF Go to Setup and disable Node Interleave option and then re run the test Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off t...

Page 635: ...links IBM Support website Latest level of DSA Latest level of BMC IMM 201 826 001 Standalone Memory Test Aborted BIOS Memory Controller has been disabled Go to Setup and Enable Memory Controller Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 ...

Page 636: ...hat DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 826 003 Standalone Memory Test Aborted BIOS Memory Controller has been disabled Go to Setup and Enable Memory Controller Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mention...

Page 637: ...urn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 827 001 Standalone Memory Test Aborted BIOS ECC function has been disabled by BIOS Go to Setup and enable ECC generation Recoverable No Severity Warning Se...

Page 638: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 639: ... control MASK registers Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 640: ...lem in masking MSR machine check control MASK registers Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 641: ... Latest level of BMC IMM 201 845 000 Standalone Memory Test Aborted Chipset Error Problem clearing MSR machine check control registers Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service repres...

Page 642: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 845 002 Standalone Memory Test Aborted Chipset Error Problem clearing MSR machine check control registers Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test ...

Page 643: ...presentative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 859 000 Standalone Memory Test Aborted INVALID XSECSRAT type Recoverable No Severity Warning Serviceable No Automatically notify support No...

Page 644: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 645: ...ed INVALID XSECSRAT type Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to ...

Page 646: ...001 Standalone Memory Test Aborted No OEM0 type 1 found Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 647: ...ite Latest level of DSA Latest level of BMC IMM 201 860 003 Standalone Memory Test Aborted No OEM0 type 1 found Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the...

Page 648: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 861 001 Standalone Memory Test Aborted No SRAT type 1 found Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains con...

Page 649: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 861 003 Standalone Memory Test Aborted No SRAT type 1 found Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete ...

Page 650: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 651: ...No OEM1 structure found Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 652: ...e Memory Test Aborted No IBMERROR key in OEM1 structure Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 653: ...est level of DSA Latest level of BMC IMM 201 863 002 Standalone Memory Test Aborted No IBMERROR key in OEM1 structure Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn o...

Page 654: ...OS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 864 000 Standalone Memory Test Aborted No GAS located in OEM1 Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem rema...

Page 655: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 864 002 Standalone Memory Test Aborted No GAS located in OEM1 Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complet...

Page 656: ...owing steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of ...

Page 657: ...T key in OEM0 structure Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 658: ...Test Aborted No XSECSRAT key in OEM0 structure Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat D...

Page 659: ... of DSA Latest level of BMC IMM 201 866 001 Standalone Memory Test Aborted EFI SAL Invalid parameter from GetMemoryMap function Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representativ...

Page 660: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 866 003 Standalone Memory Test Aborted EFI SAL Invalid parameter from GetMemoryMap function Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each act...

Page 661: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 867 001 Standalone Memory Test Aborted EFI SAL Buffer not allocated Recoverable No Severity Warning Serviceable No Automatically notify support No User Response C...

Page 662: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 663: ... GetMemoryMap too small Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 664: ...rted EFI SAL Buffer allocated in GetMemoryMap too small Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 665: ... of DSA Latest level of BMC IMM 201 869 000 Standalone Memory Test Aborted EFI SAL Invalid parameter from GetMemoryMap function Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representativ...

Page 666: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 869 002 Standalone Memory Test Aborted EFI SAL Invalid parameter from GetMemoryMap function Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each act...

Page 667: ...e 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 870 000 Standalone Memory Test Aborted CPU Doamin in ACPI not valid Recoverable No Severity Warning Serviceable No Automatically notify support No Use...

Page 668: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 669: ...oamin in ACPI not valid Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 670: ...dalone Memory Test Aborted Data Mis compare encountered Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 671: ...Latest level of DSA Latest level of BMC IMM 201 871 003 Standalone Memory Test Aborted Data Mis compare encountered Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off...

Page 672: ...FI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 877 001 Standalone Memory Test Aborted BIOS Sparing in Extended PCI reg must be OFF Go to setup and disable sparing Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try...

Page 673: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 877 003 Standalone Memory Test Aborted BIOS Sparing in Extended PCI reg must be OFF Go to setup and disable sparing Recoverable No Severity Warning Serviceable No...

Page 674: ...e following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest lev...

Page 675: ...the sparing feature OFF Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to p...

Page 676: ...ster manipulation Can not write to memory without cache Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds...

Page 677: ... of BMC IMM 201 885 002 Standalone Memory Test Aborted Processor does not support MTRR register manipulation Can not write to memory without cache Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical se...

Page 678: ...sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 886 000 Standalone Memory Test Aborted Memory Upper limit is less than 16 Mbytes Recoverable No Severity Warning Serviceable No Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test ...

Page 679: ... the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 886 002 Standalone Memory Test Aborted Memory Upper limit is less than 16 Mbytes Recoverable No Severity Warning Serviceable No Automatically notify support No Us...

Page 680: ...he following steps 1 Perform the actions mentioned one at a time and try the test after each action 2 If the problem remains contact your technical service representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level Related links IBM Support website Latest level of DSA Latest le...

Page 681: ...links IBM Support website Latest level of DSA Latest level of BMC IMM 201 899 002 Standalone Memory Test Aborted Memory Diagnostics Test Aborted by user Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 899 003 Standalone Memory Test Aborted Memory Diagnostics Test Aborted by user Recover...

Page 682: ...e representative 3 Turn off the system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level 5 Replace any DIMMS s mentioned in error one by one 6 Make sure that all DIMMs are enabled in the Configuration Setup Utility program 7 If failure remains refer to Troubleshooting by symptom in the system Installation...

Page 683: ... Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 201 901 002 Standalone Memory Test Failed Memory Diagnostics Test Failed Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Perform the actions mentioned one at a time and try the test af...

Page 684: ... system and disconnect it from power Wait for 45 seconds Reseat DIMM s Reconnect it to power 4 Make sure that DSA and BIOS uEFI are at the latest level 5 Replace any DIMMS s mentioned in error one by one 6 Make sure that all DIMMs are enabled in the Configuration Setup Utility program 7 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next...

Page 685: ...ps 1 Turn off and restart the system 2 Make sure that the DSA Diagnostic code is at the latest level 3 Run the test again 4 If the system has stopped responding turn off and restart the system 5 Check the system firmware level and upgrade if necessary 6 Run the memory diagnostic to identify the specific failing DIMM 7 If the failure remains refer to Troubleshooting by symptom in the system Install...

Page 686: ...level of DSA Latest level of BMC IMM 202 803 000 MemStr Test Aborted User pressed Ctrl C Recoverable No Severity Warning Serviceable Yes Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 202 901 000 MemStr Test Failed Test Failed Recoverable No Severity Error Serviceable Yes Automatically notify support No 672 IBM NeXtScale nx360 M4 Insta...

Page 687: ...t Failed Memory size is insufficient to run the test Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Ensure that all memory is enabled by checking the Available System Memory in the Resource Utilization section of the DSA Diagnostic Event log 2 If necessary access the Configuration Setup Utility program by pressing F1 durin...

Page 688: ...ity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 003 000 Nvidia DiagnosticServiceProvider Bandwidth Test Passed Nvidia GPU Bandwidth test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 004 00...

Page 689: ...trix Test Passed Nvidia GPU Matrix test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 006 000 Nvidia DiagnosticServiceProvider Binomial Test Passed Nvidia GPU Binomial test passed Recoverable No Severity Event Serviceable No Automatically notify support No Related links Appendix ...

Page 690: ... DSA Latest level of BMC IMM 409 803 000 Nvidia DiagnosticServiceProvider Bandwidth Test Aborted Nvidia GPU Bandwidth test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 804 000 Nvidia DiagnosticServiceProvider Query Test Aborted Nvidia GPU Query test was canceled Recovera...

Page 691: ...ble No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 806 000 Nvidia DiagnosticServiceProvider Binomial Test Aborted Nvidia GPU Binomial test was canceled Recoverable No Severity Warning Serviceable No Automatically notify support No Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 900 000 NVIDIA Us...

Page 692: ... Related links IBM Support website Latest level of DSA Latest level of BMC IMM 409 903 000 Nvidia DiagnosticServiceProvider Bandwidth Test Failed Nvidia GPU Bandwidth Test Failed Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Verify that the GPU is seated in the PCIe slot correctly by reseating the GPU Then power cycle the...

Page 693: ...irmly Then power cycle the system 3 Run nvidia smi q In some cases this will report a poorly connected power cable 4 Rerun the diagnostics using the same GPU on system that is known to be working A variety of system issues can cause diagnostic failure 5 If the problem remains contact your IBM technical support representative Related links IBM Support website Latest level of DSA Latest level of BMC...

Page 694: ...st Failed Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Verify that the GPU is seated in the PCIe slot correctly by reseating the GPU Then power cycle the system 2 Verify that the power connectors to the GPU are connected firmly Then power cycle the system 3 Run nvidia smi q In some cases this will report a poorly connect...

Page 695: ...Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Make sure that the DSA Diagnostic code is at the latest level 2 Run the test again 3 Check the drive cabling for loose or broken connections at both ends or damage to the cable Replace the cable if damage is present 4 Run the test again 5 Check system firmware level and upgr...

Page 696: ...ds or damage to the cable Replace the cable if damage is present 3 Run the test again 4 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 215 803 000 Optical Drive Test Failed Optical Drive Test Failed Disk may be in use by the operating sys...

Page 697: ... a new CD or DVD into the drive and wait for 15 seconds for the media to be recognized Rerun the test 3 Check the drive cabling for loose or broken connections at both ends or damage to the cable Replace the cable if damage is present 4 Run the test again 5 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related lin...

Page 698: ...Optical Drive Test Failed Read miscompare Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Insert a new CD or DVD into the drive and wait for 15 seconds for the media to be recognized Rerun the test 2 Check the drive cabling for loose or broken connections at both ends or damage to the cable Replace the cable if damage is pr...

Page 699: ...ostic Event Log within the Firmware VPD section for this component 5 Run the test again 6 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM DSA system management test results The following messages can result when you run the system manageme...

Page 700: ... power 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 802 001 IMM I2C Test Aborted Test cannot be completed for unknown reason Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn o...

Page 701: ... 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 804 001 IMM I2C Test Aborted Invalid Command Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it fr...

Page 702: ...e latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 806 001 IMM I2C Test Aborted Timeout while processing command Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 secon...

Page 703: ...te Latest level of DSA Latest level of BMC IMM 166 808 001 IMM I2C Test Aborted Reservation Canceled or Invalid Reservation ID Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make ...

Page 704: ...evel of BMC IMM 166 810 001 IMM I2C Test Aborted Request data length invalid Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level ...

Page 705: ... IMM 166 812 001 IMM I2C Test Aborted Parameter out of range Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level Related links IB...

Page 706: ...est Aborted Requested Sensor data or record not present Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Sup...

Page 707: ...d illegal for specified sensor or record type Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Support websi...

Page 708: ...le No Severity Warning Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 819 ...

Page 709: ... Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 821 001 IMM I2C Test Aborted Command respo...

Page 710: ...stem and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 823 001 IMM I2C Test Aborted Cannot execute command Insufficient privilege level Recoverable No Severity Warning Serviceable Yes Automatically notify support No User Response Perform th...

Page 711: ...ted links IBM Support website Latest level of DSA Latest level of BMC IMM 166 901 001 IMM I2C Test Failed IMM Indicates failure in RTMM bus BUS 0 Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Perform the actions mentioned one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it...

Page 712: ... the test again 4 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 166 904 001 IMM I2C Test Failed IMM Indicates failure in TMP75 bus BUS 3 Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Perform ...

Page 713: ...d one at a time and try the test after each action 1 Turn off the system and disconnect it from power Wait for 45 seconds Reconnect it to power 2 Make sure that DSA and BMC IMM are at the latest level 3 Run the test again 4 If failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest leve...

Page 714: ...upport website Latest level of DSA Latest level of BMC IMM DSA tape drive test results The following messages can result when you run the tape drive test Test results for the DSA tape drive test The following messages can result when you run the DSA tape drive test 264 000 000 Tape Test Passed Tape Test Passed Recoverable No Severity Event Serviceable No Automatically notify support No Related lin...

Page 715: ...ks IBM Support website Latest level of DSA Latest level of BMC IMM 264 902 000 Tape Test Failed Tape Test Failed Media is not detected Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Clean the tape drive using the appropriate cleaning media and install new media 2 Run the test again 3 Make sure that the drive firmware is at...

Page 716: ... refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 264 904 000 Tape Test Failed Tape Test Failed Drive hardware error Recoverable No Severity Error Serviceable Yes Automatically notify support No User Response Complete the following steps 1 Check the tape driv...

Page 717: ... stopped responding turn off and restart the system 2 Check the system firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component 3 Run the test again 4 If the system has stopped responding turn off and restart the system 5 Make sure that the drive firmware is at the latest level 6 Run the tes...

Page 718: ...essary 8 Run the test again 9 If the failure remains refer to Troubleshooting by symptom in the system Installation and Service Guide for the next corrective action Related links IBM Support website Latest level of DSA Latest level of BMC IMM 264 907 000 Tape Test Failed An error was found in the block address somewhere Recoverable No Severity Error Serviceable Yes Automatically notify support No ...

Page 719: ... Yes Automatically notify support No User Response Complete the following steps 1 Make sure that medium is present 2 Clean the tape drive using the appropriate cleaning media and install new media Related links IBM Support website Latest level of DSA Latest level of BMC IMM Appendix C DSA diagnostic test results 705 ...

Page 720: ...re levels Other pertinent information such as error messages and logs Go to http www ibm com support entry portal Open_service_request to submit an Electronic Service Request Submitting an Electronic Service Request will start the process of determining a solution to your problem by making the pertinent information available to IBM Support quickly and efficiently IBM service technicians can start ...

Page 721: ... administrative services Software service and support Through IBM Support Line you can get telephone assistance for a fee with usage configuration and software problems with your IBM products For more information about Support Line and other IBM services see http www ibm com services or see http www ibm com planetwide for support telephone numbers In the U S and Canada call 1 800 IBM SERV 1 800 42...

Page 722: ......

Page 723: ...this statement may not apply to you This information could include technical inaccuracies or typographical errors Changes are periodically made to the information herein these changes will be incorporated in new editions of the publication IBM may make improvements and or changes in the product s and or the program s described in this publication at any time without notice Any references in this i...

Page 724: ...memory might require replacement of the standard memory with an optional memory module Each solid state memory cell has an intrinsic finite number of write cycles that the cell can incur Therefore a solid state device has a maximum number of write cycles that it can be subjected to expressed as total bytes written TBW A device that has exceeded this limit might fail to respond to system generated ...

Page 725: ...iculate air HEPA filters that meet MIL STD 282 The deliquescent relative humidity of the particulate contamination must be more than 60 2 The room must be free of conductive contamination such as zinc whiskers Gaseous Copper Class G1 as per ANSI ISA 71 04 19853 Silver Corrosion rate of less than 300 Å in 30 days 1 ASHRAE 52 2 2008 Method of Testing General Ventilation Air Cleaning Devices for Remo...

Page 726: ... and connectors or by unauthorized changes or modifications to this equipment Unauthorized changes or modifications could void the user s authority to operate the equipment This device complies with Part 15 of the FCC Rules Operation is subject to the following two conditions 1 this device may not cause harmful interference and 2 this device must accept any interference received including interfer...

Page 727: ... mit folgendem Warnhinweis versehen werden Warnung Dieses ist eine Einrichtung der Klasse A Diese Einrichtung kann im Wohnbereich Funk Störungen verursachen in diesem Fall kann vom Betreiber verlangt werden angemessene Maßnahmen zu ergreifen und dafür aufzukommen Deutschland Einhaltung des Gesetzes über die elektromagnetische Verträglichkeit von Geräten Dieses Produkt entspricht dem Gesetz über di...

Page 728: ...Electronics and Information Technology Industries Association JEITA statement Japan Electronics and Information Technology Industries Association JEITA Confirmed Harmonics Guidelines with Modifications products greater than 20 A per phase Korea Communications Commission KCC statement This is electromagnetic wave compatibility equipment for business Type A Sellers and users need to pay attention to...

Page 729: ...nt The product is not suitable for use with visual display work place devices according to clause 2 of the German Ordinance for Work with Visual Display Units Das Produkt ist nicht für den Einsatz an Bildschirmarbeitsplätzen im Sinne 2 der Bildschirmarbeitsverordnung geeignet ...

Page 730: ......

Page 731: ...statements 4 chassis management module 7 check log LED 12 checkout procedure 48 performing 49 China Class A electronic emission statement dccxiv Class A electronic emission notice dccxii collecting data 45 components illustrated 9 11 server 81 system board 15 compute node installing 93 174 removing 92 173 compute node cover installing 107 removing 105 configuration information 21 instructions 21 N...

Page 732: ...mptoms general 59 hard disk drive 60 hypervisor flash device 60 intermittent 61 keyboard 61 memory 62 microprocessor 63 monitor 64 mouse 61 network connection 66 optional devices 67 power 67 serial port 69 ServerGuide 69 software 70 USB port 70 USB device 61 video 64 71 errors format DSA code 58 Ethernet controller 72 Ethernet controller 7 Ethernet controller configuration 22 European Union EMC Di...

Page 733: ...y program overview 37 IBM Electronic Service Agent 59 IBM Systems Director updating 37 IBM Taiwan product service dccvii IMM host name 34 IMM web interface 34 IMM2 22 IMM2 heartbeat LED 53 important notices 4 dccx in band automated boot recovery method 78 manual recovery method 76 information center dccvi inspecting for unsafe conditions vi installation 1 guidelines 89 installation guidelines 89 i...

Page 734: ...e 25 notes 4 notes important dccx notices dccix electronic emission dccxii FCC Class A dccxii notices and statements 4 Nx boot failure 79 nx360 introduction 1 O obtaining 34 online documentation 1 online publications 3 operating system 2 operating system event log 54 operator information panel installing 123 operator information panelbezel removing 122 optional 3 5 inch hard disk drive hardware RA...

Page 735: ...GPU tray 103 153 hard disk drive 144 148 heat sink 162 memory module 129 microprocessor 162 operator information panel 122 paddle card from the GPU tray 124 PCI riser filler 112 PCI riser cage assembly 151 153 RAID adapter battery holder 110 storage tray 101 Replaceable server components 81 replacing adapter GPU adapter 157 air baffle 109 battery system 127 components 89 DIMM 134 filler on to the ...

Page 736: ...ystem reliability guidelines 91 system board assembly components 9 system event log 54 system event log assertion event 54 system event log deassertion event 54 systems management 7 chassis management module 7 T Taiwan Class A electronic emission statement dccxv telecommunication regulatory statement dccxii telephone numbers dccvii temperature 6 test log viewing 58 thermal grease 172 Tier 2 CRUs r...

Page 737: ......

Page 738: ...Part Number 00KC216 Printed in China 1P P N 00KC216 1P00KC216 ...

Reviews: