Node error code overview
Node error codes describe failure that relate to a specific node canister.
Because node errors are specific to a node, for example, memory has failed, the
errors are only reported on that node. However, some of the conditions that the
node detects relate to the shared components of the enclosure. In these cases both
node canisters in the enclosure report the error.
There are two types of node errors: critical node errors and noncritical node errors.
Critical errors
A critical error means that the node is not able to participate in a clustered system
until the issue that is preventing it from joining a clustered system is resolved. This
error occurs because part of the hardware has failed or the system detects that the
software is corrupt. If it is possible to communicate with the canister with a node
error, an alert that describes the error is logged in the event log. If the system
cannot communicate with the node canister, a
Node missing
alert is reported. If a
node has a critical node error, it is in service state, and the fault LED on the node
is on. The exception is when the node cannot connect to enough resources to form
a clustered system. It shows a critical node error but is in the starting state. The
range of errors that are reserved for critical errors are 500 - 699.
Noncritical errors
A noncritical error code is logged when there is a hardware or software failure that
is related to just one specific node. These errors do not stop the node from entering
active state and joining a clustered system. If the node is part of a clustered
system, there is also an alert that describes the error condition. The node error is
shown to make it clear which of the node canisters the alert refers to. The range of
errors that are reserved for noncritical errors are 800 - 899.
Clustered-system code overview
Recovery codes for clustered systems indicate that a critical software error has
occurred that might corrupt your system. Each error-code topic includes an error
code number, a description, action, and possible field-replaceable units (FRUs).
Error codes for recovering a clustered system
You must perform software problem analysis before you can perform further
operations to avoid the possibility of corrupting your configuration.
Error code range
This topic shows the number range for each message classification.
Table 27 lists the number range for each message classification.
Table 27. Message classification number range
Message classification
Range
Node errors
Critical node errors
500-699
Noncritical node errors
800-899
130
Storwize V7000: Troubleshooting, Recovery, and Maintenance Guide
Summary of Contents for Storwize V7000
Page 1: ...IBM Storwize V7000 Version 6 3 0 Troubleshooting Recovery and Maintenance Guide GC27 2291 02...
Page 6: ...vi Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 8: ...viii Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 10: ...x Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 34: ...18 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 42: ...26 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 80: ...64 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 128: ...112 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 156: ...140 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 166: ...150 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 171: ......
Page 172: ...Printed in USA GC27 2291 02...