SOLVED How to configure smartd to ignore a SMART attribute?

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
Hello,

I am using an SDD as boot device that is not in the smartctl database and its attributes are quite different.
So I get the console flooded with messages like:
smartd 55321 - - Device: /dev/ada0, 3464 Offline uncorrectable sectors

The attribute 198 that is usually for Offline uncorrectable sectors is actually mapped to something else for this drive, it seems to be "Total count of read sectors" and is of course regularly incrementing... So naturally smartd is complaining since it thinks those are uncorrectable sectors...

How can I tell smartd to ignore this?
My initial research would indicate that this could be possible through editing smartd.conf and adding something like -I 198 -i 198 to that drive line.
But...
  • I'm not sure! :smile:
  • smartd.conf is not editable (well it is but the changes are not persistent)

Therefore my question is, how can I tell smartd to ignore attribute 198 for this drive?

Thanks for your feedback.
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
I also fund this bug but it's not helping much in how to do it... :smile:
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,702
You have additional smart options in each disk in the GUI... can't you add it there?
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
Thanks for the tip!

It is always good to have an other perspective... because I didn't think about that.
I had a closer look at it and, unless I haven't found it or understand it (which is not completely to dismiss! :smile:) there seems to be no SMART options to ignore an attribute. At least I couldn't find anything here that would suggest it.

But then I saw an other parameter: Enable SMART! :smile:
It is a bit drastic but that works for me. So I disabled SMART on this drive. Since the SMART parameters are completely different and the SMART tests also behave differently, I don't mind so much (and it is only the boot drive).
Of course it is a less "clean" solution, but it is an acceptable workaround!
 

HoneyBadger

actually does care
Administrator
Moderator
iXsystems
Joined
Feb 6, 2014
Messages
5,110
Curious - which drive make/model is this that reports its SMART data so differently? If it's potentially common, having it documented (even here on the forums) would let fellow users pull it up by keyword searches.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,702
If the database has the drive, maybe there's a chance of correcting it...

/usr/local/share/smartmontools/drivedb.h

I guess it needs a recompile to take any changes to that header file though.
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,973
If the database has the drive, maybe there's a chance of correcting it...

/usr/local/share/smartmontools/drivedb.h

I guess it needs a recompile to take any changes to that header file though.
The only issue is it will need to end up in the FreeBSD 12 distro because iXsystems is not really making custom builds of things like this, those days are I think dead. But it could be a manually recompiled operation for a user, for someone inclined to go through the steps. i use to do things like this all the time, that was many years ago.
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
The drive is a Renice 128GB SSD drive (model: Renice X2).
I found some information from the manufacturer about the SMART attributes and they are completely different to the usual attributes.
Besides of attributes 9 (power on hours) and 12 (power cycle count), the rest is not matching.

Some examples:
ID#Standard SMART descriptionManufacturer description
196Reallocated event countErase failure block count
197Current pending sectorRead failure block count
198Offline uncorrectableTotal count of read sectors
199UDMA_CRC error countTotal count of write sector
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,973
Unfortunately I can't reach a single site that supports this drive manufacturer. We have a tight firewall, maybe when I get home.

Well a manufacturer does not need to stick with the group and can do what they want. i'd be curious to see what a full smartctl -x output looks like.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,702
Interesting that there is a Renice Z2 in the header:
Code:
  { "Indilinx Barefoot based SSDs",
    "Corsair CSSD-V(32|60|64|128|256)GB2|" // Corsair Nova, tested with Corsair CSSD-V32GB2/2.2
    "Corsair CMFSSD-(32|64|128|256)D1|" // Corsair Extreme, tested with Corsair CMFSSD-128D1/1.0
    "CRUCIAL_CT(64|128|256)M225|" // tested with CRUCIAL_CT64M225/1571
    "G.SKILL FALCON (64|128|256)GB SSD|" // tested with G.SKILL FALCON 128GB SSD/2030
    "OCZ[ -](AGILITY|ONYX|VERTEX( 1199|-TURBO| v1\\.10)?)|" // tested with
      // OCZ-ONYX/1.6, OCZ-VERTEX 1199/00.P97, OCZ-VERTEX/1.30, OCZ VERTEX-TURBO/1.5, OCZ-VERTEX v1.10/1370
    "Patriot[ -]Torqx.*|"
    "RENICE Z2|" // tested with RENICE Z2/2030
    "STT_FT[MD](28|32|56|64)GX25H|" // Super Talent Ultradrive GX, tested with STT_FTM64GX25H/1916
    "TS(18|25)M(64|128)MLC(16|32|64|128|256|512)GSSD|" // ASAX Leopard Hunt II, tested with TS25M64MLC64GSSD/0.1
    "FM-25S2I-(64|128)GBFII|" // G.Skill FALCON II, tested with FM-25S2I-64GBFII
    "TS(60|120)GSSD25D-M", // Transcend Ultra SSD (SATA II), see also Ticket #80
    "", "",
    "-v 1,raw64 " // Raw_Read_Error_Rate
    "-v 9,raw64 " // Power_On_Hours
    "-v 12,raw64 " // Power_Cycle_Count
    "-v 184,raw64,Initial_Bad_Block_Count "
    "-v 195,raw64,Program_Failure_Blk_Ct "
    "-v 196,raw64,Erase_Failure_Blk_Ct "
    "-v 197,raw64,Read_Failure_Blk_Ct "
    "-v 198,raw64,Read_Sectors_Tot_Ct "
    "-v 199,raw64,Write_Sectors_Tot_Ct "
    "-v 200,raw64,Read_Commands_Tot_Ct "
    "-v 201,raw64,Write_Commands_Tot_Ct "
    "-v 202,raw64,Error_Bits_Flash_Tot_Ct "
    "-v 203,raw64,Corr_Read_Errors_Tot_Ct "
    "-v 204,raw64,Bad_Block_Full_Flag "
    "-v 205,raw64,Max_PE_Count_Spec "
    "-v 206,raw64,Min_Erase_Count "
    "-v 207,raw64,Max_Erase_Count "
    "-v 208,raw64,Average_Erase_Count "
    "-v 209,raw64,Remaining_Lifetime_Perc "
    "-v 210,raw64,Indilinx_Internal "
    "-v 211,raw64,SATA_Error_Ct_CRC "
    "-v 212,raw64,SATA_Error_Ct_Handshake "
    "-v 213,raw64,Indilinx_Internal"
  },


Maybe there's hope yet... you could possibly just add the X2 to the models list in that item and get away with it...

Also the added interest that there are plenty of manufacturers using that chip, so you're not alone in that non-standard allocation.
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
I couldn't find any trace of Z2 on the manufacturer's site...
On some other sites, there is a mention of Z2 but related to an other model: Renice K3VLAR 1.8" PATA ZIF SSD 128GB (Renice Z2).
But here too, no trace of the K3VLAR model. Apparently this model has been discontinued in 2011.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,702
Anyway, it seems like that model made it into the smartd database and has the same attribute assignment as your drive.
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
Here is the output of smartclt -x
Code:
smartctl 7.1 2019-12-30 r5022 [FreeBSD 12.2-RC3 amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     RENICE X2
Serial Number:    RCM128-SX32521827172
Firmware Version: 2030
User Capacity:    128,035,676,160 bytes [128 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS (minor revision not indicated)
Local Time is:    Fri Dec  4 17:59:44 2020 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
DSN feature is:   Unavailable
ATA Security is:  Disabled, frozen [SEC2]
Wt Cache Reorder: Unavailable

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      ( 249) Self-test routine in progress...
                                        90% of test remaining.
Total time to complete Offline
data collection:                (    0) seconds.
Offline data collection
capabilities:                    (0x1d) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Abort Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        No Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x00) Error logging NOT supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   0) minutes.
Extended self-test routine
recommended polling time:        (   0) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     ------   006   000   000    -    0
  9 Power_On_Hours          ------   052   012   000    -    0
 12 Power_Cycle_Count       ------   091   002   000    -    0
184 End-to-End_Error        ------   042   000   000    -    0
195 Hardware_ECC_Recovered  ------   000   000   000    -    0
196 Reallocated_Event_Count ------   000   000   000    -    0
197 Current_Pending_Sector  ------   000   000   000    -    0
198 Offline_Uncorrectable   ------   020   122   000    -    3533
199 UDMA_CRC_Error_Count    ------   030   031   000    -    9020
200 Unknown_SSD_Attribute   ------   069   035   000    -    23
201 Unknown_SSD_Attribute   ------   179   058   000    -    82
202 Unknown_SSD_Attribute   ------   008   166   000    -    2
203 Run_Out_Cancel          ------   077   142   000    -    2
204 Soft_ECC_Correction     ------   000   000   000    -    0
205 Thermal_Asperity_Rate   ------   136   019   000    -    0
206 Unknown_SSD_Attribute   ------   001   000   000    -    0
207 Unknown_SSD_Attribute   ------   085   001   000    -    0
208 Unknown_SSD_Attribute   ------   089   000   000    -    0
209 Unknown_SSD_Attribute   ------   099   000   000    -    0
211 Unknown_Attribute       ------   000   000   000    -    0
212 Unknown_Attribute       ------   000   000   000    -    0
213 Unknown_Attribute       ------   000   000   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01       GPL,SL  R/O      1  Summary SMART error log
0x02       GPL,SL  R/O     51  Comprehensive SMART error log
0x03       GPL,SL  R/O     77  Ext. Comprehensive SMART error log
0x06       GPL,SL  R/O      1  SMART self-test log
0x07       GPL,SL  R/O      1  Extended self-test log
0x09       GPL,SL  R/W      1  Selective self-test log
0x10       GPL,SL  R/O      1  NCQ Command Error log
0x11       GPL,SL  R/O      1  SATA Phy Event Counters log
0xa0       GPL     VS    4096  Device vendor specific log

Warning! SMART Extended Comprehensive Error Log Structure error: invalid SMART checksum.
SMART Extended Comprehensive Error Log Version: 49 (77 sectors)
No Errors Logged

Warning! SMART Extended Self-test Log Structure error: invalid SMART checksum.
SMART Extended Self-test Log Version: 1 (1 sectors)
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

Selective Self-tests/Logging not supported

SCT Commands not supported

Device Statistics (GP/SMART Log 0x04) not supported

Pending Defects log (GP Log 0x0c) not supported

SATA Phy Event Counters (GP Log 0x11)
[Reserved: 0x00 0x00 0x41 0x00]
ID      Size     Value  Description
0x0634  8       671944  Unknown

What does the invalid SMART checksum mean?
I also tried to run long and short test but they never finished... (Self-test routine in progress...90% of test remaining.)
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
Anyway, it seems like that model made it into the smartd database and has the same attribute assignment as your drive.
Is there an easy way to use the Z2 for the X2 or do I need to modify /usr/local/share/smartmontools/drivedb.h and recompile (which I might have done very long time ago on a Linux distribution... :tongue: But wouldn't even know where to start to do it with FreeNAS! :-D but could be a fun exercise to do...).
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,973
Here is the output of smartclt -x
Code:
smartctl 7.1 2019-12-30 r5022 [FreeBSD 12.2-RC3 amd64] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     RENICE X2
Serial Number:    RCM128-SX32521827172
Firmware Version: 2030
User Capacity:    128,035,676,160 bytes [128 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ATA8-ACS (minor revision not indicated)
Local Time is:    Fri Dec  4 17:59:44 2020 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
DSN feature is:   Unavailable
ATA Security is:  Disabled, frozen [SEC2]
Wt Cache Reorder: Unavailable

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      ( 249) Self-test routine in progress...
                                        90% of test remaining.
Total time to complete Offline
data collection:                (    0) seconds.
Offline data collection
capabilities:                    (0x1d) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Abort Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        No Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x00) Error logging NOT supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   0) minutes.
Extended self-test routine
recommended polling time:        (   0) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     ------   006   000   000    -    0
  9 Power_On_Hours          ------   052   012   000    -    0
12 Power_Cycle_Count       ------   091   002   000    -    0
184 End-to-End_Error        ------   042   000   000    -    0
195 Hardware_ECC_Recovered  ------   000   000   000    -    0
196 Reallocated_Event_Count ------   000   000   000    -    0
197 Current_Pending_Sector  ------   000   000   000    -    0
198 Offline_Uncorrectable   ------   020   122   000    -    3533
199 UDMA_CRC_Error_Count    ------   030   031   000    -    9020
200 Unknown_SSD_Attribute   ------   069   035   000    -    23
201 Unknown_SSD_Attribute   ------   179   058   000    -    82
202 Unknown_SSD_Attribute   ------   008   166   000    -    2
203 Run_Out_Cancel          ------   077   142   000    -    2
204 Soft_ECC_Correction     ------   000   000   000    -    0
205 Thermal_Asperity_Rate   ------   136   019   000    -    0
206 Unknown_SSD_Attribute   ------   001   000   000    -    0
207 Unknown_SSD_Attribute   ------   085   001   000    -    0
208 Unknown_SSD_Attribute   ------   089   000   000    -    0
209 Unknown_SSD_Attribute   ------   099   000   000    -    0
211 Unknown_Attribute       ------   000   000   000    -    0
212 Unknown_Attribute       ------   000   000   000    -    0
213 Unknown_Attribute       ------   000   000   000    -    0
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01       GPL,SL  R/O      1  Summary SMART error log
0x02       GPL,SL  R/O     51  Comprehensive SMART error log
0x03       GPL,SL  R/O     77  Ext. Comprehensive SMART error log
0x06       GPL,SL  R/O      1  SMART self-test log
0x07       GPL,SL  R/O      1  Extended self-test log
0x09       GPL,SL  R/W      1  Selective self-test log
0x10       GPL,SL  R/O      1  NCQ Command Error log
0x11       GPL,SL  R/O      1  SATA Phy Event Counters log
0xa0       GPL     VS    4096  Device vendor specific log

Warning! SMART Extended Comprehensive Error Log Structure error: invalid SMART checksum.
SMART Extended Comprehensive Error Log Version: 49 (77 sectors)
No Errors Logged

Warning! SMART Extended Self-test Log Structure error: invalid SMART checksum.
SMART Extended Self-test Log Version: 1 (1 sectors)
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

Selective Self-tests/Logging not supported

SCT Commands not supported

Device Statistics (GP/SMART Log 0x04) not supported

Pending Defects log (GP Log 0x0c) not supported

SATA Phy Event Counters (GP Log 0x11)
[Reserved: 0x00 0x00 0x41 0x00]
ID      Size     Value  Description
0x0634  8       671944  Unknown

What does the invalid SMART checksum mean?
I also tried to run long and short test but they never finished... (Self-test routine in progress...90% of test remaining.)
I would not put any faith in the SMART output until it has been properly recognized by the application. It does not make any sense that a SMART test doesn't complete.

As for compiling the code, I have not tried on FreeNAS in a long time, I'd have to give it a try but I don't think it's difficult. If it were the full FreeBSD distro then it's super easy to do. Maybe @sretalla has the answer, been doing great so far.
 

dak180

Patron
Joined
Nov 22, 2017
Messages
308

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
update-smart-drivedb doesn't seem to be in TrueNAS anymore, at least not in /usr/local/sbin and I tried to look around but couldn't find it.
It is present in 9.10.x and 11.3 though.
 

dak180

Patron
Joined
Nov 22, 2017
Messages
308
update-smart-drivedb doesn't seem to be in TrueNAS anymore
True, which is why there is that second link that explains how to fake it; keep in mind any changes will be wiped out by an upgrade and possibly even a reboot.
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,973
You can also contact Bruce Allen (the author) and ask him to update the program. I was in touch with him about 3 years ago and he was supporting it back then. I was asking him some details about the program.
 

Pitfrr

Wizard
Joined
Feb 10, 2014
Messages
1,523
Ooh sorry I didn't do my homework correctly: I checked out the links but I was a bit quick on the second link!! :-(

Thanks for all the feedback! It gives me more possibilities to work out this issue.
I'll check out update-smart-drivedb to see if I can get it to work, probably a bit later to be able to dive into it more thoroughly.
Contacting Bruce Allen is a good idea but I guess this would make sense if several people encounter this type of drive.
 
Top