Seagate ST336607LC ST3146807LC Model Product Manual PDF - Page 26

Cheetah 10K.6 SCSI Product Manual, Rev. D, Performance impact, Reporting control, Determining rate, - firmware

Page 26 highlights

16 Cheetah 10K.6 SCSI Product Manual, Rev. D The process of measuring off-line attributes and saving data can be forced by the Rezero Unit command. Forcing S.M.A.R.T. will reset the timer so that the next scheduled interrupt will be two hours. The drive can be interrogated by the host to determine the time remaining before the next scheduled measurement and data logging process will occur. This is accomplished by a log sense command to log page 0x3E. The purpose is to allow the customer to control when S.M.A.R.T. interruptions occur. As described above, forcing S.M.A.R.T by the Rezero Unit command will reset the timer. Performance impact S.M.A.R.T. attribute data will be saved to the disc for the purpose of recreating the events that caused a predictive failure. The drive will measure and save parameters once every two hours subject to an idle period on the SCSI bus. The process of measuring off-line attribute data and saving data to the disc is uninterruptable and the maximum delay is summarized below: Maximum processing delay S.M.A.R.T. delay times On-line only delay DEXCPT = 0, PERF = 1 60 milliseconds Fully enabled delay DEXCPT = 0, PERF = 0 370 milliseconds Reporting control Reporting is controlled in the Informational Exceptions Control Page (1Ch). Subject to the reporting method, the firmware will issue a 01-5D00 sense code to the host. The error code is preserved through bus resets and power cycles. Determining rate S.M.A.R.T. monitors the rate at which errors occur and signals a predictive failure if the rate of degraded error rate increases to an unacceptable level. To determine rate, error events are logged and compared to the number of total operations for a given attribute. The interval defines the number of operations over which to measure the rate. The counter that keeps track of the current number of operations is referred to as the Interval Counter. S.M.A.R.T. measures error rate, hence for each attribute the occurrence of an error is recorded. A counter keeps track of the number of errors for the current interval. This counter is referred to as the Failure Counter. Error rate is simply the number of errors per operation. The algorithm that S.M.A.R.T. uses to record rates of error is to set thresholds for the number of errors and the interval. If the number of errors exceeds the threshold before the interval expires, then the error rate is considered to be unacceptable. If the number of errors does not exceed the threshold before the interval expires, then the error rate is considered to be acceptable. In either case, the interval and failure counters are reset and the process starts over. Predictive failures S.M.A.R.T. signals predictive failures when the drive is performing unacceptably for a period of time. The firmware keeps a running count of the number of times the error rate for each attribute is unacceptable. To accomplish this, a counter is incremented whenever the error rate is unacceptable and decremented (not to exceed zero) whenever the error rate is acceptable. Should the counter continually be incremented such that it reaches the predictive threshold, a predictive failure is signaled. This counter is referred to as the Failure History Counter. There is a separate Failure History Counter for each attribute. 5.2.8 Thermal monitor Cheetah 10K.6 SCSI drives implement a temperature warning system which: 1. Signals the host if the temperature exceeds a value which would threaten the drive. 2. Signals the host if the temperature exceeds a user-specified value. 3. Saves a S.M.A.R.T. data frame on the drive which exceed the threatening temperature value.

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74
  • 75
  • 76
  • 77
  • 78
  • 79
  • 80
  • 81
  • 82
  • 83
  • 84
  • 85
  • 86
  • 87
  • 88
  • 89
  • 90
  • 91
  • 92
  • 93
  • 94
  • 95
  • 96

16
Cheetah 10K.6 SCSI Product Manual, Rev. D
The process of measuring off-line attributes and saving data can be forced by the Rezero Unit command. Forc-
ing S.M.A.R.T. will reset the timer so that the next scheduled interrupt will be two hours.
The drive can be interrogated by the host to determine the time remaining before the next scheduled measure-
ment and data logging process will occur. This is accomplished by a log sense command to log page 0x3E.
The purpose is to allow the customer to control when S.M.A.R.T. interruptions occur. As described above, forc-
ing S.M.A.R.T by the Rezero Unit command will reset the timer.
Performance impact
S.M.A.R.T. attribute data will be saved to the disc for the purpose of recreating the events that caused a pre-
dictive failure. The drive will measure and save parameters once every two hours subject to an idle period on
the SCSI bus. The process of measuring off-line attribute data and saving data to the disc is uninterruptable
and the maximum delay is summarized below:
Maximum processing delay
On-line only delay
Fully enabled delay
DEXCPT = 0, PERF = 1
DEXCPT = 0, PERF = 0
S.M.A.R.T. delay times
60 milliseconds
370 milliseconds
Reporting control
Reporting is controlled in the Informational Exceptions Control Page (1Ch). Subject to the reporting method,
the firmware will issue a 01-5D00 sense code to the host. The error code is preserved through bus resets and
power cycles.
Determining rate
S.M.A.R.T. monitors the rate at which errors occur and signals a predictive failure if the rate of degraded error
rate increases to an unacceptable level. To determine rate, error events are logged and compared to the num-
ber of total operations for a given attribute. The interval defines the number of operations over which to mea-
sure the rate. The counter that keeps track of the current number of operations is referred to as the Interval
Counter.
S.M.A.R.T. measures error rate, hence for each attribute the occurrence of an error is recorded. A counter
keeps track of the number of errors for the current interval. This counter is referred to as the Failure Counter.
Error rate is simply the number of errors per operation. The algorithm that S.M.A.R.T. uses to record rates of
error is to set thresholds for the number of errors and the interval. If the number of errors exceeds the threshold
before the interval expires, then the error rate is considered to be unacceptable. If the number of errors does
not exceed the threshold before the interval expires, then the error rate is considered to be acceptable. In
either case, the interval and failure counters are reset and the process starts over.
Predictive failures
S.M.A.R.T. signals predictive failures when the drive is performing unacceptably for a period of time. The firm-
ware keeps a running count of the number of times the error rate for each attribute is unacceptable. To accom-
plish this, a counter is incremented whenever the error rate is unacceptable and decremented (not to exceed
zero) whenever the error rate is acceptable. Should the counter continually be incremented such that it
reaches the predictive threshold, a predictive failure is signaled. This counter is referred to as the Failure His-
tory Counter. There is a separate Failure History Counter for each attribute.
5.2.8
Thermal monitor
Cheetah 10K.6 SCSI drives implement a temperature warning system which:
1.
Signals the host if the temperature exceeds a value which would threaten the drive.
2.
Signals the host if the temperature exceeds a user-specified value.
3.
Saves a S.M.A.R.T. data frame on the drive which exceed the threatening temperature value.