SAS Smart self test results confusing.... Help needed

nasnice

Explorer
Joined
Jul 14, 2014
Messages
82
Hi all,
I am currently in the process of building a new NAS and was doing the hard drive burning in procedure per our forum guide.
Encountered some strange results and since this is my first SAS build I am at a loss wat to do...
System is a Non ECC motherboard with the latest Freenas release 11.2-U7. Using a Dell HC310 SAS controller flashed to IT mode and here is my first question.

During startup this controller is identified as follows
Code:
Nov 28 06:16:22 freenas mps0: <Avago Technologies (LSI) SAS2008> port 0xe000-0xe0ff mem 0xdf140000-0xdf14ffff,0xdf100000-0xdf13ffff irq 16 at device 0.0 on pci1
Nov 28 06:16:22 freenas mps0: Firmware: 20.00.07.00, Driver: 21.02.00.00-fbsd
Nov 28 06:16:22 freenas mps0: IOCCapabilities: 1285c<ScsiTaskFull,DiagTrace,SnapBuf,EEDP,TransRetry,EventReplay,HostDisc>


So the Firmware is @ level 20 and the driver is 21 is that a problem?

After the first smart test I checked for results and got the following (the output of the smartctl -a/dev/ad0 came in little chunks and took 3 minites to complete...)
Code:
=== START OF INFORMATION SECTION ===                                                                                               
Vendor:               HGST                                                                                                         
Product:              HUH728080AL5200                                                                                               
Revision:             A907                                                                                                         
Compliance:           SPC-4                                                                                                         
User Capacity:        8,001,563,222,016 bytes [8.00 TB]                                                                             
Logical block size:   512 bytes                                                                                                     
Physical block size:  4096 bytes                                                                                                   
LU is fully provisioned                                                                                                             
Rotation Rate:        7200 rpm                                                                                                     
Form Factor:          3.5 inches                                                                                                   
Logical Unit id:      0x5000cca2541ac5a8                                                                                           
Serial number:        VKGGREHV                                                                                                     
Device type:          disk                                                                                                         
Transport protocol:   SAS (SPL-3)                                                                                                   
Local Time is:        Fri Nov 29 06:38:59 2019 WET                                                                                 
SMART support is:     Available - device has SMART capability.                                                                     
SMART support is:     Enabled                                                                                                       
Temperature Warning:  Enabled                                                                                                       
                                                                                                                                    
=== START OF READ SMART DATA SECTION ===                                                                                           
SMART Health Status: OK                                                                                                             
                                                                                                                                    
Current Drive Temperature:     36 C                                                                                                 
Drive Trip Temperature:        85 C                                                                                                 
                                                                                                                                    
Manufactured in week 21 of year 2018                                                                                               
Specified cycle count over device lifetime:  50000                                                                                 
Accumulated start-stop cycles:  2                                                                                                   
Specified load-unload count over device lifetime:  600000                                                                           
Accumulated load-unload cycles:  95805                                                                                             
Seagate Cache Log Sense Failed: Input/output error                                                                                 
Error counter log:                                                                                                                 
           Errors Corrected by           Total   Correction     Gigabytes    Total                                                 
               ECC          rereads/    errors   algorithm      processed    uncorrected                                           
           fast | delayed   rewrites  corrected  invocations   [10^9 bytes]  errors                                                 
read:          0        0         0         0      12950          1.047           0                                                 
write:         0        0         0         0          0          0.000           0                                                 
verify:        0        0         0         0      19348          0.000           0                                                 
                                                                                                                                    
Non-medium error count:        0                                                                                                   
                                                                                                                                    
SMART Self-test log                                                                                                                 
Num  Test              Status                 segment  LifeTime  LBA_first_err [SK ASC ASQ]                                         
     Description                              number   (hours)                                                                     
# 1  Background long   Failed in segment -->       7       3        7317853072 [0x3 0x5d 0x1]
# 2  Background short  Failed in segment -->       6       0        7373601840 [0x3 0x5d 0x1]                                       
                                                                                                                                    
Long (extended) Self Test duration: 65535 seconds [1092.2 minutes]   


This drive is definitely bad or is it?

This is so different from the SATA Smart results that I have problems to interpret this information... what is important... and what not... Looked around but did not find a definite answer... did find many more confused user topics...

Specifically what Correction Algorith invocations mean... is that a normal value?
Thanks in advance for any insights...
 
Joined
Jul 3, 2015
Messages
926
HBA drivers often don't match the firmware so I wouldn't worry. Although I don't have that card I think the general feeling is that you should try and have the latest firmware unless there is a known reason not to.

Regarding the drive then I have LOTS of SAS drives and can confirm that drive is bad.
 
Joined
Jul 3, 2015
Messages
926
'Failed in segment' is a common failure in HGST drives and as I have more than a thousand I speak from experience.
 

nasnice

Explorer
Joined
Jul 14, 2014
Messages
82
Just found out that my new drives are not eligible for the HGST warranty... HGST suggest to resolve this with the supplier... Bought them (6) through Amazon... Will return the lot and order really NEW drives... Found a remark on a German forum that these drives may be reconditioned or OEM surplus... My bad...
Still confused about that smart test result... Could anyone post here a good or perfect smart report on a SAS drive... just for reference..?
 
Joined
Jul 3, 2015
Messages
926
Code:
Current Drive Temperature:     27 C                                                                                                 
Drive Trip Temperature:        85 C                                                                                                 
                                                                                                                                    
Manufactured in week 50 of year 2016                                                                                               
Specified cycle count over device lifetime:  50000                                                                                 
Accumulated start-stop cycles:  65                                                                                                 
Specified load-unload count over device lifetime:  600000                                                                           
Accumulated load-unload cycles:  1039                                                                                               
Elements in grown defect list: 0                                                                                                   
                                                                                                                                    
Vendor (Seagate) cache information                                                                                                 
  Blocks sent to initiator = 13147544540413952                                                                                     
                                                                                                                                    
Error counter log:                                                                                                                 
           Errors Corrected by           Total   Correction     Gigabytes    Total                                                 
               ECC          rereads/    errors   algorithm      processed    uncorrected                                           
           fast | delayed   rewrites  corrected  invocations   [10^9 bytes]  errors                                                 
read:          0        3         0         3   12182273     182929.955           0                                                 
write:         0        0         0         0     425962      45941.240           0                                                 
verify:        0        0         0         0      24195          0.000           0                                                 
                                                                                                                                    
Non-medium error count:        0                                                                                                   
                                                                                                                                    
SMART Self-test log                                                                                                                 
Num  Test              Status                 segment  LifeTime  LBA_first_err [SK ASC ASQ]                                         
     Description                              number   (hours)                                                                     
# 1  Background short  Completed                   -   17984                 - [-   -    -]                                         
# 2  Background short  Completed                   -   17864                 - [-   -    -]                                         
# 3  Background short  Completed                   -   17744                 - [-   -    -]                                         
# 4  Background short  Completed                   -   17624                 - [-   -    -]                                         
# 5  Background short  Completed                   -   17504                 - [-   -    -]                                         
# 6  Background long   Completed                   -   17405                 - [-   -    -]                                         
# 7  Background short  Completed                   -   17384                 - [-   -    -]                                         
# 8  Background short  Completed                   -   17312                 - [-   -    -]                                         
# 9  Background short  Completed                   -   17192                 - [-   -    -]                                         
#10  Background short  Completed                   -   17072                 - [-   -    -]                                         
#11  Background short  Completed                   -   16952                 - [-   -    -]                                         
#12  Background short  Completed                   -   16832                 - [-   -    -]                                         
#13  Background long   Completed                   -   16762                 - [-   -    -]                                         
#14  Background short  Completed                   -   16713                 - [-   -    -]                                         
#15  Background short  Completed                   -   16568                 - [-   -    -]                                         
#16  Background short  Completed                   -   16448                 - [-   -    -]                                         
#17  Background short  Completed                   -   16328                 - [-   -    -]                                         
#18  Background short  Completed                   -   16209                 - [-   -    -]                                         
#19  Background short  Completed                   -   16088                 - [-   -    -]                                         
#20  Background long   Completed                   -   15989                 - [-   -    -]                                         
                                                                                                                                    
Long (extended) Self Test duration: 65535 seconds [1092.2 minutes]                             
 
Joined
Jul 3, 2015
Messages
926
'Elements in grown defect list' I find to be a good marker. A value above 0 is suspect and if it increases much more over a relatively short space of time then I pull it.
 
Top