Resource icon

Hard Drive Burn-In Testing - Discussion Thread

D

Deleted47050

Guest
I've not read the whole thread but following the OP I have done the smartctl short and conveyance tests
It says they will take x minutes but don;t report back anything to say they have completed or otherwise. Is that normal?

Yes that is normal. You run smartctl with the -t option to run the test and then, if you want to check the output of the test, you run it with the -a option.
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
I tried smartctl -a /dev/ada1 and it came up with an error re syntax, as it did with any of my other disks too.
It said I needed to use the -d option to specify a device?

Anyway I've started the long tests now so when I get up in the morning they will be done but it'd be good if I was able to check results before I move onto the tmux I/O testing. (Obviously I'd like to start that in the morning as well, due to the time it takes...)
 
Last edited by a moderator:
D

Deleted47050

Guest
That's weird, last time I used it I did not get any error that I can remember (version 9.3).
 

Mirfster

Doesn't know what he's talking about
Joined
Oct 2, 2015
Messages
3,215
I tried smartctl -a /dev/ada1 and it came up with an error re syntax, as it did with any of my other disks too.
It said I needed to use the -d option to specify a device?

Anyway I've started the long tests now so when I get up in the morning they will be done but it'd be good if I was able to check results before I move onto the tmux I/O testing. (Obviously I'd like to start that in the morning as well, due to the time it takes...)
Are your drives detected as "ada"? Mine show up as "da" (da0, da1, da2, etc). Check under [Storage] - [View Disks] to see what they are listed as.
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
Version 9.10

Definitely ada1 through to ada6


OK tried again via GUI shell and would appreciate greatly if someone could assist me in interpreting the results (below).

It's worth noting that ada1 ada2 and ada3 are newly purchased disks, while ada4 ada5 and ada6 came from the synology NAS I had. (which hadn't been intensively used, but which had failed in a permanent kind of a way)

My (amateur) assaumption is that ada1-3 are fine
ada4 is completely buggered and ada5 and ada6 are somewhat stuffed.

Is there any point in doing full I/O (tmux) testing on all these disks?



Thank you, in advance, for any assistance you can offer !


RESULTS from smarctrl -a dev/adax testing

Code:
ADA1

Shell
Conveyance self-test routine                                                                                                    
recommended polling time:        (   5) minutes.                                                                                
SCT capabilities:              (0x703d) SCT Status supported.                                                                    
                                        SCT Error Recovery Control supported.                                                    
                                        SCT Feature Control supported.                                                          
                                        SCT Data Table supported.                                                                
                                                                                                                                
SMART Attributes Data Structure revision number: 16                                                                              
Vendor Specific SMART Attributes with Thresholds:                                                                                
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                
  1 Raw_Read_Error_Rate     0x002f   100   253   051    Pre-fail  Always       -       0                                        
  3 Spin_Up_Time            0x0027   100   253   021    Pre-fail  Always       -       0                                        
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       2                                        
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                        
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0                                        
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       18                                        
10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0                                        
11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0                                        
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       2                                        
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       0                                        
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       15                                        
194 Temperature_Celsius     0x0022   124   118   000    Old_age   Always       -       26                                        
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                        
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0                                        
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0                                        
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0                                        
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0                                        
                                                                                                                                
SMART Error Log Version: 1                                                                                                      
No Errors Logged                                                                                                                
                                                                                                                                
SMART Self-test log structure revision number 1                                                                                  
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error                                  
# 1  Extended offline    Completed without error       00%        12         -                                                  
# 2  Conveyance offline  Completed without error       00%         4         -                                                  
# 3  Short offline       Completed without error       00%         3         -                                                  
                                                                                                                                
SMART Selective self-test log data structure revision number 1                                                                  
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                                                                    
    1        0        0  Not_testing                                                                                            
    2        0        0  Not_testing                                                                                            
    3        0        0  Not_testing                                                                                            
    4        0        0  Not_testing                                                                                            
    5        0        0  Not_testing                                                                                            
Selective self-test flags (0x0):                                                                                                
  After scanning selected spans, do NOT read-scan remainder of disk.                                                            
If Selective self-test is pending on power-up, resume after 0 minute delay.                                                      
         
                                                                                                                    
ADA2

recommended polling time:        ( 407) minutes.                                                                                
Conveyance self-test routine                                                                                                    
recommended polling time:        (   5) minutes.                                                                                
SCT capabilities:              (0x703d) SCT Status supported.                                                                    
                                        SCT Error Recovery Control supported.                                                    
                                        SCT Feature Control supported.                                                          
                                        SCT Data Table supported.                                                                
                                                                                                                                
SMART Attributes Data Structure revision number: 16                                                                              
Vendor Specific SMART Attributes with Thresholds:                                                                                
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                
  1 Raw_Read_Error_Rate     0x002f   100   253   051    Pre-fail  Always       -       0                                        
  3 Spin_Up_Time            0x0027   100   253   021    Pre-fail  Always       -       0                                        
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       5                                        
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                        
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0                                        
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       107                                      
10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0                                        
11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0                                        
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       5                                        
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       2                                        
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       109                                      
194 Temperature_Celsius     0x0022   124   117   000    Old_age   Always       -       26                                        
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                        
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0                                        
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0                                        
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0                                        
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0                                        
                                                                                                                                
SMART Error Log Version: 1                                                                                                      
No Errors Logged                                                                                                                
                                                                                                                                
SMART Self-test log structure revision number 1                                                                                  
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error                                  
# 1  Extended offline    Completed without error       00%       101         -                                                  
# 2  Conveyance offline  Completed without error       00%        93         -                                                  
# 3  Short offline       Completed without error       00%        92         -                                                  
                                                                                                                                
SMART Selective self-test log data structure revision number 1                                                                  
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                                                                    
    1        0        0  Not_testing                                                                                            
    2        0        0  Not_testing                                                                                            
    3        0        0  Not_testing                                                                                            
    4        0        0  Not_testing                                                                                            
    5        0        0  Not_testing                                                                                            
Selective self-test flags (0x0):                                                                                                
  After scanning selected spans, do NOT read-scan remainder of disk.                                                            
If Selective self-test is pending on power-up, resume after 0 minute delay.


ADA3

recommended polling time:        ( 393) minutes.                                                                                
Conveyance self-test routine                                                                                                    
recommended polling time:        (   5) minutes.                                                                                
SCT capabilities:              (0x703d) SCT Status supported.                                                                    
                                        SCT Error Recovery Control supported.                                                    
                                        SCT Feature Control supported.                                                          
                                        SCT Data Table supported.                                                                
                                                                                                                                
SMART Attributes Data Structure revision number: 16                                                                              
Vendor Specific SMART Attributes with Thresholds:                                                                                
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                
  1 Raw_Read_Error_Rate     0x002f   100   253   051    Pre-fail  Always       -       0                                        
  3 Spin_Up_Time            0x0027   194   194   021    Pre-fail  Always       -       5275                                      
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       6                                        
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                        
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0                                        
  9 Power_On_Hours          0x0032   099   099   000    Old_age   Always       -       967                                      
10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0                                        
11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0                                        
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       6                                        
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       4                                        
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       967                                      
194 Temperature_Celsius     0x0022   123   108   000    Old_age   Always       -       27                                        
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                        
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0                                        
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0                                        
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0                                        
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0                                        
                                                                                                                                
SMART Error Log Version: 1                                                                                                      
No Errors Logged                                                                                                                
                                                                                                                                
SMART Self-test log structure revision number 1                                                                                  
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error                                  
# 1  Extended offline    Completed without error       00%       961         -                                                  
# 2  Conveyance offline  Completed without error       00%       953         -                                                  
# 3  Short offline       Completed without error       00%       953         -                                                  
                                                                                                                                
SMART Selective self-test log data structure revision number 1                                                                  
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                                                                    
    1        0        0  Not_testing                                                                                            
    2        0        0  Not_testing                                                                                            
    3        0        0  Not_testing                                                                                            
    4        0        0  Not_testing                                                                                            
    5        0        0  Not_testing                                                                                            
Selective self-test flags (0x0):                                                                                                
  After scanning selected spans, do NOT read-scan remainder of disk.                                                            
If Selective self-test is pending on power-up, resume after 0 minute delay. 




continued next post....
 
Last edited by a moderator:

VladTepes

Patron
Joined
May 18, 2016
Messages
287
Code:
ADA4


recommended polling time:        ( 389) minutes.                                                                                  

Conveyance self-test routine                                                                                                      

recommended polling time:        (   5) minutes.                                                                                  

SCT capabilities:              (0x70bd) SCT Status supported.                                                                      

                                        SCT Error Recovery Control supported.                                                      

                                        SCT Feature Control supported.                                                            

                                        SCT Data Table supported.                                                                  

                                                                                                                                  

SMART Attributes Data Structure revision number: 16                                                                                

Vendor Specific SMART Attributes with Thresholds:                                                                                  

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                  

  1 Raw_Read_Error_Rate     0x002f   182   182   051    Pre-fail  Always       -       9293                                        

  3 Spin_Up_Time            0x0027   184   182   021    Pre-fail  Always       -       5800                                        

  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       2232                                        

  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                          

  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0                                          

  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       1830                                        

10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0                                          

11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0                                          

12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       2053                                        

192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       14                                          

193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       2217                                        

194 Temperature_Celsius     0x0022   124   104   000    Old_age   Always       -       26                                          

196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                          

197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0                                          

198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0                                          

199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0                                          

200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0                                          

                                                                                                                                  

SMART Error Log Version: 1                                                                                                        

No Errors Logged                                                                                                                  

                                                                                                                                  

SMART Self-test log structure revision number 1                                                                                    

Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error                                    

# 1  Extended offline    Completed without error       00%      1824         -                                                    

# 2  Conveyance offline  Completed without error       00%      1816         -                                                    

# 3  Short offline       Completed without error       00%      1816         -                                                    

                                                                                                                                  

SMART Selective self-test log data structure revision number 1                                                                    

SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                                                                      

    1        0        0  Not_testing                                                                                              

    2        0        0  Not_testing                                                                                              

    3        0        0  Not_testing                                                                                              

    4        0        0  Not_testing                                                                                              

    5        0        0  Not_testing                                                                                              

Selective self-test flags (0x0):                                                                                                  

  After scanning selected spans, do NOT read-scan remainder of disk.                                                              

If Selective self-test is pending on power-up, resume after 0 minute delay.                                                        

                                                                            


ADA5


recommended polling time:        ( 392) minutes.                                                                                  

Conveyance self-test routine                                                                                                      

recommended polling time:        (   5) minutes.                                                                                  

SCT capabilities:              (0x70bd) SCT Status supported.                                                                      

                                        SCT Error Recovery Control supported.                                                      

                                        SCT Feature Control supported.                                                            

                                        SCT Data Table supported.                                                                  

                                                                                                                                  

SMART Attributes Data Structure revision number: 16                                                                                

Vendor Specific SMART Attributes with Thresholds:                                                                                  

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                  

  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0                                          

  3 Spin_Up_Time            0x0027   180   179   021    Pre-fail  Always       -       5958                                        

  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       2231                                        

  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                          

  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0                                          

  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       1826                                        

10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0                                          

11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0                                          

12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       2052                                        

192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       13                                          

193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       2217                                        

194 Temperature_Celsius     0x0022   122   103   000    Old_age   Always       -       28                                          

196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                          

197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0                                          

198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0                                          

199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0                                          

200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0                                          

                                                                                                                                  

SMART Error Log Version: 1                                                                                                        

No Errors Logged                                                                                                                  

                                                                                                                                  

SMART Self-test log structure revision number 1                                                                                    

Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error                                    

# 1  Extended offline    Completed without error       00%      1819         -                                                    

# 2  Conveyance offline  Completed without error       00%      1811         -                                                    

# 3  Short offline       Completed without error       00%      1811         -                                                    

                                                                                                                                  

SMART Selective self-test log data structure revision number 1                                                                    

SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                                                                      

    1        0        0  Not_testing                                                                                              

    2        0        0  Not_testing                                                                                              

    3        0        0  Not_testing                                                                                              

    4        0        0  Not_testing                                                                                              

    5        0        0  Not_testing                                                                                              

Selective self-test flags (0x0):                                                                                                  

  After scanning selected spans, do NOT read-scan remainder of disk.                                                              

If Selective self-test is pending on power-up, resume after 0 minute delay.



ADA6


recommended polling time:        ( 405) minutes.                                                                                  

Conveyance self-test routine                                                                                                      

recommended polling time:        (   5) minutes.                                                                                  

SCT capabilities:              (0x70bd) SCT Status supported.                                                                      

                                        SCT Error Recovery Control supported.                                                      

                                        SCT Feature Control supported.                                                            

                                        SCT Data Table supported.                                                                  

                                                                                                                                  

SMART Attributes Data Structure revision number: 16                                                                                

Vendor Specific SMART Attributes with Thresholds:                                                                                  

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                  

  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0                                          

  3 Spin_Up_Time            0x0027   179   179   021    Pre-fail  Always       -       6008                                        

  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       2231                                        

  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                          

  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0                                          

  9 Power_On_Hours          0x0032   098   098   000    Old_age   Always       -       1824                                        

10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0                                          

11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0                                          

12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       2052                                        

192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       13                                          

193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       2217                                        

194 Temperature_Celsius     0x0022   122   105   000    Old_age   Always       -       28                                          

196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                          

197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0                                          

198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0                                          

199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0                                          

200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0                                          

                                                                                                                                  

SMART Error Log Version: 1                                                                                                        

No Errors Logged                                                                                                                  

                                                                                                                                  

SMART Self-test log structure revision number 1                                                                                    

Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error                                    

# 1  Extended offline    Completed without error       00%      1818         -                                                    

# 2  Conveyance offline  Completed without error       00%      1809         -                                                    

# 3  Short offline       Completed without error       00%      1809         -                                                    

                                                                                                                                  

SMART Selective self-test log data structure revision number 1                                                                    

SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                                                                      

    1        0        0  Not_testing                                                                                              

    2        0        0  Not_testing                                                                                              

    3        0        0  Not_testing                                                                                              

    4        0        0  Not_testing                                                                                              

    5        0        0  Not_testing                                                                                              

Selective self-test flags (0x0):                                                                                                  

  After scanning selected spans, do NOT read-scan remainder of disk.                                                              

If Selective self-test is pending on power-up, resume after 0 minute delay.



END
 
Last edited by a moderator:

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Please use CODE tags to preserve formatting and avoid walls of text.
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
Noted for future. Thanks for the tip.

Does the 30.000 character limit still apply for data inside code tags?
 
Last edited:

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Probably, but something's wrong if you feel the need to paste nearly 30kB of text.

If it's truly indispensable, attaching it as a file is easier for everyone.
 

wblock

Documentation Engineer
Joined
Nov 14, 2014
Messages
1,506
Or pasting it on the web where people can read it without downloading.

Code tags added, form letter sent. :)
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
Please note I have done more investigation and rephrased this to make it an easy-to-answer (I hope) question :)

Edit: Having re-read "Fester's Guide" in the wiki (excellent by the way) and looking at stats for lines 1, 5, 7, 19, 11 and 196-199 (as specified in that document) it appears to me that:

ada1, 2, 3, 5, 6 - OK

ada4 - not OK. The stats are zero for the important ones other than the Raw-Read-Error-Rate which really is "up there" at a staggering 9293 !

Can someone confirm for me that this assessment is correct please? Would it be worth testing ada4 with any other tests or re-testing with smartctl or is it a case of ditching this drive?

Thanks
 
Last edited:

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
Please note I have done more investigation and rephrased this to make it an easy-to-answer (I hope) question :)

Edit: Having re-read "Fester's Guide" in the wiki (excellent by the way) and looking at stats for lines 1, 5, 7, 19, 11 and 196-199 (as specified in that document) it appears to me that:

ada1, 2, 3, 5, 6 - OK

ada4 - not OK. The stats are zero for the important ones other than the Raw-Read-Error-Rate which really is "up there" at a staggering 9293 !

Can someone confirm for me that this assessment is correct please? Would it be worth testing ada4 with any other tests or re-testing with smartctl or is it a case of ditching this drive?

Thanks
It's probably worth repeating the long test. And if this is passed, check whether the Raw-Read-Error-Rate is growing.
 

wblock

Documentation Engineer
Joined
Nov 14, 2014
Messages
1,506
No reallocated or pending sectors on ada4, though. Might be worth replacing the cable to that drive.
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
I re-ran the test and the results are the same as before.
OK I will try a new cable. Ta.
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
It
I re-ran the test and the results are the same as before.
OK I will try a new cable. Ta.
It's worth trying the cable in case the fault is intermittent, but if the long test is ok and the error rate is not increasing then I would have thought you could afford to keep the drive. Just do short SMART tests every few days and long ones every week or two and occasionally check the "raw=-error" rate that was high. Could the errors have occurred while the drive was in the old NAS? If so they might be due to a poor connection or an error when the drive was new that has not worsened.
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
It

It's worth trying the cable in case the fault is intermittent, but if the long test is ok and the error rate is not increasing then I would have thought you could afford to keep the drive. Just do short SMART tests every few days and long ones every week or two and occasionally check the "raw=-error" rate that was high. Could the errors have occurred while the drive was in the old NAS? If so they might be due to a poor connection or an error when the drive was new that has not worsened.

I think it's extremely likely that an error occurred with the drive when it was in the old NAS.

The reason behind my previous questions is that I'm new at this stuff and SMART testing, but you people are helping me learn which is great. :)

My confusion stems from the fact that I'm not sure what those potential error sources actually MEAN and in this case what effect they have in the real world. I'd simply read that those particular errors should have a raw rate of ZERO whereas 9,000+ appears to me to be significantly not good. :)

Also another quick burn in Q. The process recommends long SMART followed by I/O (tmux) followed by long SMART. Why the first run of SMART if it is to be run later anyway?
 

VladTepes

Patron
Joined
May 18, 2016
Messages
287
Latest long test - no change/ exactly the same result as before. Exactly.
 

rogerh

Guru
Joined
Apr 18, 2014
Messages
1,111
Latest long test - no change/ exactly the same result as before. Exactly.
AFAICS the only purpose of the long test before is to save the bother of doing the burn-in if the drive is already defective. On your present findings I think it would be reasonable to go ahead with the badblocks on ada4 in the hope it will now prove stable.
 

Mirfster

Doesn't know what he's talking about
Joined
Oct 2, 2015
Messages
3,215

tumblingthrough

Dabbler
Joined
Aug 29, 2014
Messages
30
Many many thanks for this How To from a FreeNAS noob. Excellent to feel like I'm doing the right thing before I start with setting up my new box.

My badblocks tests are running in tmux on my 4 x 3TB disks as I write. Just thought I'd mention that I'm getting this info for each of the sessions:

Code:
master-backend-new# badblocks -b 4096 -ws /dev/ada1                                                                                 
Testing with pattern 0xaa: set_o_direct: Inappropriate ioctl for device                                                             
 73.22% done, 4:53:09 elapsed. (0/0/0 errors)


"inappropriate ioctl"? Can anyone explain?
 
Top