Chapter 4. Understanding the medium errors and bad blocks
A storage system returns a medium error response to a hose when it is unable to
successfully read a block. The Storwize V7000 response to a host read follows this
behavior.
The volume virtualization that is provided extends the time when a medium error
is returned to a host. Because of this difference to non-virtualized systems, the
Storwize V7000 uses the term
bad blocks
rather than medium errors.
The Storwize V7000 allocates volumes from the extents that are on the managed
disks (MDisks). The MDisk can be a volume on an external storage controller or a
RAID array that is created from internal drives. In either case, depending on the
RAID level used, there is normally protection against a read error on a single
drive. However, it is still possible to ge a medium error on a read request if
multiple drives have errors or if the drives are rebuilding or are offline due to
other issues.
The Storwize V7000 provides migration facilities to move a volume from one
underlying set of physical storage to another or to replicate a volume that uses
FlashCopy or Metro Mirror or Global Mirror. In all these cases, the migrated
volume or the replicated volume returns a medium error to the host when the
logical block address on the original volume is read. The system maintains tables
of bad blocks to record where the logical block addresses that cannot be read are.
These tables are associated with the MDisks that are providing storage for the
volumes.
The
dumpmdiskbadblocks
command and the
dumpallmdiskbadblocks
command are
available to query the location of bad blocks.
It is possible that the tables that are used to record bad block locations can fill up.
The table can fill either on an MDisk or on the system as a whole. If a table does
fill up, the migration or replication that was creating the bad block fails because it
was not possible to create an exact image of the source volume.
The system creates alerts in the event log for the following situations:
v
When it detects medium errors and creates a bad block
v
When the bad block tables fill up
The following errors are identified:
Table 17. Bad block errors
Error code
Description
1840
The managed disk has bad blocks.
1226
The system has failed to create a bad block
because the MDisk already has the
maximum number of allowed bad blocks.
1225
The system has failed to create a bad block
because the system already has the
maximum number of allowed bad blocks.
© Copyright IBM Corp. 2010, 2011
27
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||
||
||
||
|
|
||
|
|
|
Summary of Contents for Storwize V7000
Page 1: ...IBM Storwize V7000 Version 6 3 0 Troubleshooting Recovery and Maintenance Guide GC27 2291 02...
Page 6: ...vi Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 8: ...viii Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 10: ...x Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 34: ...18 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 42: ...26 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 80: ...64 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 128: ...112 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 156: ...140 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 166: ...150 Storwize V7000 Troubleshooting Recovery and Maintenance Guide...
Page 171: ......
Page 172: ...Printed in USA GC27 2291 02...