9 Hard drive failures and faulted
LUNs
The purpose of fault-tolerant array con
fi
gurations is to protect against data loss due to hard drive failure.
Each RAID con
fi
guration has inherent limitations on the number of hard drive failures that it can tolerate.
If the fault-tolerance level of a particular LUN or array con
fi
guration is exceeded, the array will be locked
from any further I/O. This protection is designed to preserve the integrity of the local drive, but does
require manual intervention to recover or re-enable the LUN.
Although controller
fi
rmware is designed to protect against normal hard drive failure, it is imperative that
you perform the correct actions to recover from a hard drive failure without inadvertently introducing any
additional hard drive failures.
Included sections:
• Recognizing hard drive failure
• Compromised fault tolerance
•
Recovering from compromised fault tolerance (e
nabling failed LUNs
)
• Best practices when replacing hard drives
• Automatic data recovery
Recognizing hard drive failure
LEDs on the front of each hard drive are visible from the front of the external storage unit. When a hard
drive is con
fi
gured as a part of an array and attached to a powered-on controller, the status of the hard
drive can be determined from the illumination pattern of these LEDs.
For detailed descriptions of the various LED combinations, see
Hard drive LEDs
.
Other ways to determine that a hard drive has failed include the following:
•
LEDs on the storage system chassis illuminate amber if failed hard drives are inside. (However,
this LED also illuminates when other problems occur, such as when a fan fails, a redundant power
supply fails, or the system overheats.)
•
LEDs on the hard drives illuminate amber if a hard drive has failed or is a member of a faulted LUN.
•
Front-panel LCD display messages list faulted LUNs and failed hard drives whenever the system is
restarted, as long as the controller detects one or more good hard drives.
•
ACU represents faulted LUNs and failed drives with distinctive icons.
•
HP-SIM can detect failed hard drives.
•
ADU lists all failed hard drives.
For more information on troubleshooting hard drive problems, see the HP ProLiant Servers Troubleshooting
Guide.
Effects of hard drive failure
When a hard drive fails, all logical drives that are in the same array are affected. Each logical drive in
an array may be using a different fault-tolerance method, so each logical drive can be affected differently.
•
RAID 0 con
fi
gurations cannot tolerate hard drive failure. If any physical hard drive in the array
fails, all non-fault-tolerant (RAID 0) LUNs in the same array also are failed.
•
RAID 1 and RAID 1+0 con
fi
gurations can tolerate multiple hard drive failures, as long as none of
the failed hard drives are mirrored to one another.
•
RAID 5 con
fi
gurations can tolerate one hard drive failure.
maintenance and service guide
99
Summary of Contents for 201723-B21 - HP StorageWorks Modular SAN Array 1000 Hard Drive
Page 8: ...8 ...
Page 12: ...12 About this guide ...
Page 18: ...18 Specifications ...
Page 28: ...28 System components and LEDs ...
Page 58: ...58 LCD panel and message descriptions ...
Page 94: ...94 Customer replaceable components ...
Page 98: ...98 Capacity expansion and extension ...
Page 104: ...104 Hard drive failures and faulted LUNs ...
Page 110: ...110 SCSI hard drive firmware ...