IBM 86655RY Hardware Maintenance Manual - Page 214
Miscellaneous programs, Using ServeRAID Controllers to avoid data loss, Drive failures, How They Occur
View all IBM 86655RY manuals
Add to My Manuals
Save this manual to your list of manuals |
Page 214 highlights
Miscellaneous programs: The IPSSEND and IPSMON programs are advanced command-line programs that can be used to manage the ServeRAID controllers. You can use the IPSSEND program to view the configuration of a ServeRAID controller, rebuild a defunct drive, and perform other functions. You can use the ISPMON program to monitor a ServeRAID controller for defunct drives, predictive failure analysis (PFA) warnings, rebuild operators synchronizations, and logical drive migration. See the README files for installation instructions. Using ServeRAID Controllers to avoid data loss: RAID-5 and RAID-1 technology provides the ability to continue operation after the failure of a hard drive and the ability to rebuild the lost data onto a replacement drive. In conjunction with the bad sector remapping capabilities of the hard drives, RAID-5 and RAID-1 can also help recreate data lost due to sector media corruption. Defective sectors on hard drives are not uncommon. Data scrubbing helps you detect and correct these errors before they become a problem. If the ServeRAID Array is not properly set up and/or maintained, a significant risk of data loss grows with the passage of time. This manual examines how to avoid data loss wherever possible. Drive failures: Three types of drive failures can typically occur in a RAID-5 or RAID1 subsystem that may endanger the protection of stored data: • "Catastrophic drive failures" • "Grown sector media errors" • "Combination failures" on page 205 Catastrophic drive failures: How They Occur Catastrophic drive failures occur when all data on a drive, including the ECC data written on the drive to protect information, is completely inaccessible due to mechanical or electrical problems. Grown sector media errors: How They Occur Grown sector media errors occur due to the following: • Latent imperfections on the disk • Media damage due to mishandling of the disk • Harsh environments The drive itself can often repair these errors by recalculating lost data from Error Correction Code (ECC) information stored within each data sector on the drive. The drive then remaps this damaged sector to an unused area of the drive to prevent data loss. Note: Sector media errors, which affect only a small area of the surface of the drive, may not be detected in seldom used files or in non-data areas of the disk. These errors are only identified and corrected if a read or write request is made to data stored within that location. Data scrubbing forces all sectors in the logical drive to be accessed so that sector media errors are detected by the drive. Once detected, the drive's error recovery procedures are launched to repair these errors by recalculating the lost data from the ECC information described above. If the ECC information is not sufficient to recalculate the lost data, the information may still be recovered if the drive is part of a RAID-5 or RAID-1 array. RAID-5 and RAID-1 arrays can provide their own redundant information (similar to the ECC data written on the drive itself), which is stored on other drives in the array. The ServeRAID controller can recalculate the lost data and remap the bad sector. Note: 204 Hardware Maintenance Manual: Netfinity 7600 - Type 8665 Models 1RY, 2RY