Impending drive failure (waiting for hot spare)

A drive is reporting internal errors that could cause the drive to fail. SANtricity has attempted to automatically copy the drive's contents as a safety precaution, but was unable to as there are currently no eligible standby hot spare drives to spare for the affected drive in the storage array.

CAUTION:
Risk of Data Loss. If a "Degraded Volume" problem is also displayed in the Recovery Guru Summary area, always fix the "Degraded Volume" problem first. Fixing the "Impending Drive Failure" problem before fixing a "Degraded Volume" may result in data loss.
CAUTION:
Electrostatic discharge can damage sensitive components. Always use proper antistatic protection when handling components. Touching the components without using a proper ground may damage the equipment.
Important:
  • The affected volumes are RAID 1,3,5,6. If the drive fails, you may lose redundancy.

  • Because hot spares will automatically spare for any similar failed or missing drives in the storage array, it is recommended that all failed or missing drives be replaced prior to following this recovery procedure.

  • Because the drive has not failed, its fault indicator light will not be turned on.

  • You can replace the drive while the volumes associated to the affected drive are receiving I/O.

Recovery Steps

  1. If... Then...
    You want to assign a hot spare to copy the drive's content to. Go to step 2.
    You want to replace the drive, but are unable to assign a hot spare due to the storage array's configuration. Recovering a Volume Group Drive without a Hot Spare
    You want to take no further action and wait for the affected drive to fail. You can then complete the recovery steps to fix the problem when it is reported. You need to do nothing more in this procedure.
  2. Check the status of the volumes associated with the affected drive. To determine the associated volumes, on the Hardware tab in the Array Management Window (AMW), highlight the affected drive and view the "Associated volume group" or "Associated disk pool" in the Properties pane. Next, on the Storage and Copy Services tab, view the status of the identified disk pool or volume group and its volumes.

    • All volumes on the Storage and Copy Services tab in the AMW should be Optimal before continuing with this procedure.

    • If any volumes in the disk pool or volume group currently show Operation in Progress , wait for all volumes to change to Optimal before continuing.

    • If the volumes change from Operation in Progress to any status other than Optimal, click the Recheck button to rerun the Recovery Guru and fix the failures reported.

  3. Assign an unassigned drive that is similar in media and interface type, and that has a capacity equal or larger than the affected drive as a hot spare. Information on assigning a hot Spare can be found in the "Assigning and Unassigning Hot Spare Drives" help topic. If you are unable to assign a hot spare due to your storage array's configuration go to Recovering a Volume Group Drive without a Hot Spare.

  4. Click the Recheck button to rerun the Recovery Guru. The failure should no longer appear in the Summary area. If the failure appears again, contact your Technical Support Representative.

Recovering a Volume Group Drive without a Hot Spare

Use the following procedure if all of the following conditions apply:

  • The affected drive is a member of a RAID 1, 3, 5, or 6 volume group

  • There are no standby hot spares in the storage array

  • There are no unassigned drives that can be assigned as a hot spare for the affected drive

Note: Whenever possible it is recommended to copy the drive's contents to a hot spare before failing the drive, as it greatly reduces the possibility of data loss.
  1. Highlight the affected drive on the Hardware tab in the AMW and select the Hardware > Drive > Advanced > Fail menu option. This will open the Confirm Fail Drive dialog.

  2. Making sure that the Copy contents of drive before failing checkbox is unchecked, type 'yes' into the text field and press OK. The associated volumes will become Degraded.

  3. Remove the failed drive (its fault indicator light should be on).

  4. Wait 30 seconds, then insert the new drive. Its fault indicator light may be lit for a short time (one minute or less).

  5. Click the Recheck button to rerun the Recovery Guru. The failure should no longer appear in the Summary area. If the failure appears again, contact your Technical Support Representative.