Seagate ST400FX0002 Pulsar XT.2 SAS Product Manual - Page 26

Predictive failures, Thermal monitor, Table 7, Temperature Log Dh, Parameter Code, Description

Page 26 highlights

Error rate is the number of errors per operation. The algorithm that S.M.A.R.T. uses to record rates of error is to set thresholds for the number of errors and appropriate interval. If the number of errors exceeds the threshold before the interval expires, the error rate is considered to be unacceptable. If the number of errors does not exceed the threshold before the interval expires, the error rate is considered to be acceptable. In either case, the interval and failure counters are reset and the process starts over. Predictive failures S.M.A.R.T. signals predictive failures when the drive is performing unacceptably for a period of time. The firmware keeps a running count of the number of times the error rate for each attribute is unacceptable. To accomplish this, a counter is incremented each time the error rate is unacceptable and decremented (not to exceed zero) whenever the error rate is acceptable. If the counter continually increments such that it reaches the predictive threshold, a predictive failure is signaled. This counter is referred to as the Failure History Counter. There is a separate Failure History Counter for each attribute. 6.3.5 Thermal monitor Pulsar XT.2 drives implement a temperature warning system which: 1. Signals the host if the temperature exceeds a value which would threaten the drive. 2. Signals the host if the temperature exceeds a user-specified value. (i.e., the reference temperature value) 3. Saves a S.M.A.R.T. data frame on the drive which exceeds the threatening temperature value. A temperature sensor monitors the drive temperature and issues a warning over the interface when the temperature exceeds a set threshold. The temperature is measured at power-up and then at ten-minute intervals after power-up. The thermal monitor system generates a warning code of 01-0B01 when the temperature exceeds the specified limit in compliance with the SCSI standard. The drive temperature is reported in the FRU code field of MODE SENSE data. Administrators can use this information to determine if the warning is due to the temperature exceeding the drive threatening temperature or the user-specified temperature. This feature is controlled by the Enable Warning (EWasc) bit, and the reporting mechanism is controlled by the Method of Reporting Informational Exceptions field (MRIE) on the Informational Exceptions Control (IEC) mode page (1Ch). The current algorithm implements two temperature trip points. The first trip point is set at the maximum temperature limit according to the drive specification. The second trip point is user-selectable using the LOG SELECT command. The reference temperature parameter in the temperature log page (see Table 7) can be used to set this trip point. The default value for this drive is listed in the table, however, applications can set it to any value in the range defined. If a temperature is specified that is greater than the maximum allowed in this field, the temperature is rounded down to the maximum allowed. A sense code is sent to the host to indicate the rounding of the parameter field. Table 7: Temperature Log Page (0Dh) Parameter Code 0000h 0001h Description Primary Temperature Reference Temperature Default Setting Changeable Range 400GB 200/100GB Drive Temperature 70°C 65°C 0 to 70°C 0 to 65°C 18 Pulsar XT.2 SAS Product Manual, Rev. B

  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • 17
  • 18
  • 19
  • 20
  • 21
  • 22
  • 23
  • 24
  • 25
  • 26
  • 27
  • 28
  • 29
  • 30
  • 31
  • 32
  • 33
  • 34
  • 35
  • 36
  • 37
  • 38
  • 39
  • 40
  • 41
  • 42
  • 43
  • 44
  • 45
  • 46
  • 47
  • 48
  • 49
  • 50
  • 51
  • 52
  • 53
  • 54
  • 55
  • 56
  • 57
  • 58
  • 59
  • 60
  • 61
  • 62
  • 63
  • 64
  • 65
  • 66
  • 67
  • 68
  • 69
  • 70
  • 71
  • 72
  • 73
  • 74

18
Pulsar XT.2 SAS Product Manual, Rev. B
Error rate is the number of errors per operation. The algorithm that S.M.A.R.T. uses to record rates of error is to
set thresholds for the number of errors and appropriate interval. If the number of errors exceeds the threshold
before the interval expires, the error rate is considered to be unacceptable. If the number of errors does not
exceed the threshold before the interval expires, the error rate is considered to be acceptable. In either case,
the interval and failure counters are reset and the process starts over.
Predictive failures
S.M.A.R.T. signals predictive failures when the drive is performing unacceptably for a period of time. The firm-
ware keeps a running count of the number of times the error rate for each attribute is unacceptable. To accom-
plish this, a counter is incremented each time the error rate is unacceptable and decremented (not to exceed
zero) whenever the error rate is acceptable. If the counter continually increments such that it reaches the pre-
dictive threshold, a predictive failure is signaled. This counter is referred to as the Failure History Counter.
There is a separate Failure History Counter for each attribute.
6.3.5
Thermal monitor
Pulsar XT.2 drives implement a temperature warning system which:
1.
Signals the host if the temperature exceeds a value which would threaten the drive.
2.
Signals the host if the temperature exceeds a user-specified value. (i.e., the reference temperature value)
3.
Saves a S.M.A.R.T. data frame on the drive which exceeds the threatening temperature value.
A temperature sensor monitors the drive temperature and issues a warning over the interface when the tem-
perature exceeds a set threshold. The temperature is measured at power-up and then at ten-minute intervals
after power-up.
The thermal monitor system generates a warning code of 01-0B01 when the temperature exceeds the speci-
fied limit in compliance with the SCSI standard. The drive temperature is reported in the FRU code field of
MODE SENSE data. Administrators can use this information to determine if the warning is due to the tempera-
ture exceeding the drive threatening temperature or the user-specified temperature.
This feature is controlled by the Enable Warning (EWasc) bit, and the reporting mechanism is controlled by the
Method of Reporting Informational Exceptions field (MRIE) on the Informational Exceptions Control (IEC)
mode page (1Ch).
The current algorithm implements two temperature trip points. The first trip point is set at the maximum temper-
ature limit according to the drive specification. The second trip point is user-selectable using the LOG SELECT
command. The reference temperature parameter in the temperature log page (see Table 7) can be used to set
this trip point. The default value for this drive is listed in the table, however, applications can set it to any value
in the range defined. If a temperature is specified that is greater than the maximum allowed in this field, the
temperature is rounded down to the maximum allowed. A sense code is sent to the host to indicate the round-
ing of the parameter field.
Table 7:
Temperature Log Page (0Dh)
Parameter Code
Description
400GB
200/100GB
0000h
Primary Temperature
Drive Temperature
0001h
Reference Temperature
Default Setting
70
°
C
65
°
C
Changeable Range
0 to 70
°
C
0 to 65
°
C