Failed SMART usage Attribute: 10 Spin_Retry_Count

Status
Not open for further replies.

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
After I noticed a brown out in my area I immediately unplugged my NAS. Upon rebooting I get the SMART error 10 Spin Retry Count on two of the four drives on that controller. However, those two particular drives are the only ones running on the 3.0 Gb/s ports with the remaining two running on the 6.0 Gb/s ports of the same controller. I've replaced the cables with no effect.

Would it be possible to place those two drives on a different SATA port with a different controller, or should I RMA them? Considering that I'm getting these errors on two of my four drives, how will this affect my data running a 3+1 configuration?
 
Joined
Oct 2, 2014
Messages
925
You dont have a UPS.....i would invest in one of them ASAP, even if you never suffer a brownout/backout again....at least youll have it and it will condition the power for FreeNAS.

As for the SMART stuff, do you have any more SATA 6 ports? If not, and all you have is those iffy SATA 3's i would say switch the SATA 6 drives with the SATA 3's and see if they still error, if they still error then you know its the drives, if they dont error, and the SATA 6 drives error, then its a the 3Gb controller.

Please note this may not be the best move....its just what i would do, another member will probably weigh in
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
It's not a big problem to have a few spin retries but they are clearly due to the brown outs and co and you really should invest in a UPS as soon as possible. SATA cables have nothing to do with spin retries.
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
Yes, a UPS is high on my purchase list. Should I just ignore the spin retries, plug those drives into different ports or RMA them?
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
How many retries they have?
 

Robert Trevellyan

Pony Wrangler
Joined
May 16, 2014
Messages
3,778
Just noticed your question.
Considering that I'm getting these errors on two of my four drives, how will this affect my data running a 3+1 configuration?
If you mean you're running RAIDZ1, then losing two drives would wipe out all your data.
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
Ok, now I'm only getting the error on one drive. How do I check to see how retries there are/have been?
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
smartctl -a /dev/adaX (with X a number, for example ada0 for the first drive), it's maybe daX instead of adaX.
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
I'm getting the error on ada1 right now but I first received the error on ada0 and ada3. So I'm including the reports for all drives to review.

ada0
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
2 Throughput_Performance  0x0005  133  133  054  Pre-fail  Offline  -  104
3 Spin_Up_Time  0x0007  135  135  024  Pre-fail  Always  -  447 (Average 589)
4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  79
5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
8 Seek_Time_Performance  0x0005  115  115  020  Pre-fail  Offline  -  41
9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7341
10 Spin_Retry_Count  0x0013  070  070  060  Pre-fail  Always  -  393216
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  79
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  244
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  244
194 Temperature_Celsius  0x0002  157  157  000  Old_age  Always  -  38 (Min/Max 20/43)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

ada1
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  135  135  054  Pre-fail  Offline  -  98
  3 Spin_Up_Time  0x0007  151  151  024  Pre-fail  Always  -  446 (Average 478)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  83
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  115  115  020  Pre-fail  Offline  -  41
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7340
10 Spin_Retry_Count  0x0013  060  060  060  Pre-fail  Always  FAILING_NOW 524288
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  83
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  250
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  250
194 Temperature_Celsius  0x0002  157  157  000  Old_age  Always  -  38 (Min/Max 20/41)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

ada2
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  134  134  054  Pre-fail  Offline  -  100
  3 Spin_Up_Time  0x0007  145  145  024  Pre-fail  Always  -  475 (Average 488)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  81
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  117  117  020  Pre-fail  Offline  -  40
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7341
10 Spin_Retry_Count  0x0013  090  090  060  Pre-fail  Always  -  131072
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  81
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  309
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  309
194 Temperature_Celsius  0x0002  157  157  000  Old_age  Always  -  38 (Min/Max 20/42)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

ada3
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  134  134  054  Pre-fail  Offline  -  100
  3 Spin_Up_Time  0x0007  167  167  024  Pre-fail  Always  -  450 (Average 384)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  83
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  109  109  020  Pre-fail  Offline  -  44
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7341
10 Spin_Retry_Count  0x0013  075  075  060  Pre-fail  Always  -  327680
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  81
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  319
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  319
194 Temperature_Celsius  0x0002  157  157  000  Old_age  Always  -  38 (Min/Max 20/44)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0
 

Robert Trevellyan

Pony Wrangler
Joined
May 16, 2014
Messages
3,778
10 Spin_Retry_Count 0x0013 060 060 060 Pre-fail Always FAILING_NOW 524288
I suggest you replace ada1 now and keep a close eye on the rest.

By the way, your drives temps have been a bit high at some point (anything over 40C is worth looking into).
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
OK, after a few restarts, I'm no longer receiving any SMART errors on any drives. Is this common and should I still seek to RMA ada1?
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
Wow, these numbers are very high. What PSU do you use?

And yep, these drives run a bit too hot, you should solve this problem too.
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
I'm using a Antec EA-380D Green 380W ATX12V v2.3 power supply currently. New fans are on the way to help keep the temps cool as well. I had no clue that drive temps were getting a little high.

Here are the latest SMART readings that don't show any errors. Should I still go ahead and remove and replace ada1? If so, can I keep the system running without that drive while a new one is on order?
ada0
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  133  133  054  Pre-fail  Offline  -  104
  3 Spin_Up_Time  0x0007  152  152  024  Pre-fail  Always  -  469 (Average 447)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  82
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  115  115  020  Pre-fail  Offline  -  41
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7363
 10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
 12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  82
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  248
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  248
194 Temperature_Celsius  0x0002  166  166  000  Old_age  Always  -  36 (Min/Max 20/43)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

ada1
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  134  134  054  Pre-fail  Offline  -  100
  3 Spin_Up_Time  0x0007  161  161  024  Pre-fail  Always  -  419 (Average 446)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  86
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  113  113  020  Pre-fail  Offline  -  42
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7362
 10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
 12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  86
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  254
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  254
194 Temperature_Celsius  0x0002  171  171  000  Old_age  Always  -  35 (Min/Max 20/41)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

ada2
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  134  134  054  Pre-fail  Offline  -  100
  3 Spin_Up_Time  0x0007  145  145  024  Pre-fail  Always  -  475 (Average 488)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  84
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  117  117  020  Pre-fail  Offline  -  40
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7363
 10 Spin_Retry_Count  0x0013  090  090  060  Pre-fail  Always  -  131072
 12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  84
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  313
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  313
194 Temperature_Celsius  0x0002  171  171  000  Old_age  Always  -  35 (Min/Max 20/42)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

ada3
Code:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  134  134  054  Pre-fail  Offline  -  100
  3 Spin_Up_Time  0x0007  160  160  024  Pre-fail  Always  -  422 (Average 450)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  86
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  111  111  020  Pre-fail  Offline  -  43
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  7363
 10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
 12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  84
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  323
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  323
194 Temperature_Celsius  0x0002  166  166  000  Old_age  Always  -  36 (Min/Max 20/44)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
Ideally you want to keep them in the low 30's °C and not going above 40 °C at any moment ;)

Wait why all drives (but ada2) now show 0 in the spin retry count value? and why the worst value has been resetted too?
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
I'm still trying to figure out why the drives are not reporting any errors now. Should I be concerned and replace one or more of them still?
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
I just don't know, it's very weird. Anybody has any idea?
 

sdgenxr

Contributor
Joined
Sep 4, 2014
Messages
131
I still have no idea as to what was going on. However, I've run the HGST WinDFT software on all the drives and did not receive any errors. Oh well, let's hope that I don't have any further issues.

Thanks for all the help everyone!
 
Status
Not open for further replies.
Top