Seagate ST300MM0026 Savvio 10K.1 SAS Product Manual - Page 45

Defect and error management

Page 45 highlights

7.0 Defect and error management The drive, as delivered, complies with this product manual. The read error rates and specified storage capacities are not dependent upon use of defect management routines by the host (initiator). Defect and error management in the SCSI protocol involves the drive internal defect/error management and SAS system error considerations (errors in communications between the initiator and the drive). Tools for use in designing a defect/error management plan are briefly outlined in this section. References to other sections are provided when necessary. 7.1 Drive internal defects/errors During the initial drive format operation at the factory, media defects are identified, tagged as being unusable, and their locations recorded on the drive primary defects list (referred to as the "P' list and also as the ETF defect list). At factory format time, these known defects are also reallocated, that is, reassigned to a new place on the medium and the location listed in the defects reallocation table. The "P" list is not altered after factory formatting. Locations of defects found and reallocated during error recovery procedures after drive shipment are listed in the "G" list (defects growth list). The "P" and "G" lists may be referenced by the initiator using the Read Defect Data command. Details of the SCSI commands supported by the drive are described in the SAS Interface Manual. Also, more information on the drive Error Recovery philosophy is presented in the SAS Interface Manual. 7.2 Drive error recovery procedures When an error occurs during drive operation, the drive, if programmed to do so, performs error recovery procedures to attempt to recover the data. The error recovery procedures used depend on the options previously set in the Error Recovery Parameters mode page. Error recovery and defect management may involve using several SCSI commands described in the SAS Interface Manual. The drive implements selectable error recovery time limits required in video applications. The error recovery scheme supported by the drive provides a way to control the total error recovery time for the entire command in addition to controlling the recovery level for a single LBA. The total amount of time spent in error recovery for a command can be limited using the Recovery Time Limit bytes in the Error Recovery mode page. The total amount of time spent in error recovery for a single LBA can be limited using the Read Retry Count or Write Retry Count bytes in the Error Recovery mode page. The drive firmware error recovery algorithms consists of 11 levels for read recoveries and five levels for write. Each level may consist of multiple steps, where a step is defined as a recovery function involving a single reread or re-write attempt. The maximum level used by the drive in LBA recovery is determined by the read and write retry counts. Table 14 equates the read and write retry count with the maximum possible recovery time for read and write recovery of individual LBAs. The times given do not include time taken to perform reallocations. Reallocations are performed when the ARRE bit (for reads) or AWRE bit (for writes) is one, the RC bit is zero, and the recovery time limit for the command has not yet been met. Time needed to perform reallocation is not counted against the recovery time limit. Savvio SAS Product Manual, Rev. D 39

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86

Savvio SAS Product Manual, Rev. D
39
7.0
Defect and error management
The drive, as delivered, complies with this product manual. The read error rates and specified storage capaci-
ties are not dependent upon use of defect management routines by the host (initiator).
Defect and error management in the SCSI protocol involves the drive internal defect/error management and
SAS system error considerations (errors in communications between the initiator and the drive). Tools for use
in designing a defect/error management plan are briefly outlined in this section. References to other sections
are provided when necessary.
7.1
Drive internal defects/errors
During the initial drive format operation at the factory, media defects are identified, tagged as being unusable,
and their locations recorded on the drive primary defects list (referred to as the “P’ list and also as the ETF
defect list). At factory format time, these known defects are also reallocated, that is, reassigned to a new place
on the medium and the location listed in the defects reallocation table. The “P” list is not altered after factory
formatting. Locations of defects found and reallocated during error recovery procedures after drive shipment
are listed in the “G” list (defects growth list). The “P” and “G” lists may be referenced by the initiator using the
Read Defect Data command.
Details of the SCSI commands supported by the drive are described in the
SAS Interface Manual
. Also, more
information on the drive Error Recovery philosophy is presented in the
SAS Interface Manual
.
7.2
Drive error recovery procedures
When an error occurs during drive operation, the drive, if programmed to do so, performs error recovery proce-
dures to attempt to recover the data. The error recovery procedures used depend on the options previously set
in the Error Recovery Parameters mode page. Error recovery and defect management may involve using sev-
eral SCSI commands described in the
SAS Interface Manual
. The drive implements selectable error recovery
time limits required in video applications.
The error recovery scheme supported by the drive provides a way to control the total error recovery time for the
entire command in addition to controlling the recovery level for a single LBA. The total amount of time spent in
error recovery for a command can be limited using the Recovery Time Limit bytes in the Error Recovery mode
page. The total amount of time spent in error recovery for a single LBA can be limited using the Read Retry
Count or Write Retry Count bytes in the Error Recovery mode page.
The drive firmware error recovery algorithms consists of 11 levels for read recoveries and five levels for write.
Each level may consist of multiple steps, where a step is defined as a recovery function involving a single re-
read or re-write attempt. The maximum level used by the drive in LBA recovery is determined by the read and
write retry counts.
Table 14 equates the read and write retry count with the maximum possible recovery time for read and write
recovery of individual LBAs. The times given do not include time taken to perform reallocations. Reallocations
are performed when the ARRE bit (for reads) or AWRE bit (for writes) is one, the RC bit is zero, and the recov-
ery time limit for the command has not yet been met. Time needed to perform reallocation is not counted
against the recovery time limit.