SSD boot drive not added to smartd.conf

Status
Not open for further replies.

artix lumren

Cadet
Joined
May 25, 2016
Messages
1
Hello everyone,

I'm testing my build to check that everything is working as expected.

Here is the build :
Supermicro X11SSL-F
Intel Core i3-6100
Kingston Memory DDR4 16GB 2133MHz ECC
Seasonic FOCUS Plus 550 Gold
Samsung ssd 830 pro 128GB (Detected has 840)
2x WD Red - 3.5" SATA 6Gb/s - 6TB (will be 8 with a m1015)

Freenas version:
Code:
[root@NAS-02 /]# uname -a
FreeBSD local.info 11.0-STABLE FreeBSD 11.0-STABLE #0 r321665+25fe8ba8d06(freenas/11.0-stable): Mon Sep 25 06:24:11 UTC 2017	 root@gauntlet:/freenas-11-releng/freenas/_BE/objs/freenas-11-releng/freenas/_BE/os/sys/FreeNAS.amd64  amd64


First i had only the ssd connected with freenas installed on it. (connected on MB AHCI)
In this configuration, smartd was complaining that /usr/local/etc/smartd.conf was empty. I started to investigate and come to the conclusion that following that script :
/etc/ix.rc.d/ix-smartd
this command was returning empty :
sqlite3 /data/freenas-v1.db "select * from storage_disk;"

So... i connected 2 WD drives to the MB to test further.

Code:
[root@NAS-02 /]#  cat /usr/local/etc/smartd.conf
################################################
# smartd.conf generated by /etc/ix.rc.d/ix-smartd
################################################
/dev/ada1 -a -n never -W 3,25,45 -m email -M exec /usr/local/www/freenasUI/tools/smart_alert.py
/dev/ada2 -a -n never -W 3,25,45 -m email -M exec /usr/local/www/freenasUI/tools/smart_alert.py



Code:
[root@NAS-02 /]# sqlite3 /data/freenas-v1.db "select * from storage_disk;"
|Auto|Always On|Disabled|Disabled|1||{serial_lunid}WD-WX71DA4A0EPN_50014ee260e8b9bd|ada1|1|WD-WX71DA4A0EPN|||ada|1|6001175126016
|Auto|Always On|Disabled|Disabled|1||{serial_lunid}WD-WX11DC44910J_50014ee20b931ab5|ada2|1|WD-WX11DC44910J|||ada|2|6001175126016


Code:
[root@NAS-02 /]# camcontrol devlist -v
scbus0 on ahcich0 bus 0:
<>								 at scbus0 target -1 lun ffffffff ()
scbus1 on ahcich1 bus 0:
<>								 at scbus1 target -1 lun ffffffff ()
scbus2 on ahcich2 bus 0:
<Samsung SSD 840 PRO Series DXM06B0Q>  at scbus2 target 0 lun 0 (pass0,ada0)
<>								 at scbus2 target -1 lun ffffffff ()
scbus3 on ahcich3 bus 0:
<WDC WD60EFRX-68MYMN1 82.00A82>	at scbus3 target 0 lun 0 (pass1,ada1)
<>								 at scbus3 target -1 lun ffffffff ()
scbus4 on ahcich4 bus 0:
<WDC WD60EFRX-68MYMN1 82.00A82>	at scbus4 target 0 lun 0 (pass2,ada2)
<>								 at scbus4 target -1 lun ffffffff ()
scbus5 on ahcich5 bus 0:
<>								 at scbus5 target -1 lun ffffffff ()
scbus6 on camsim0 bus 0:
<>								 at scbus6 target -1 lun ffffffff ()
scbus-1 on xpt0 bus 0:
<>								 at scbus-1 target -1 lun ffffffff (xpt0)


I checked that my ssd is working correctly with smartd :

Code:
[root@NAS-02 /]# smartctl -a /dev/ada0
smartctl 6.5 2016-05-07 r4318 [FreeBSD 11.0-STABLE amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:	 Samsung based SSDs
Device Model:	 Samsung SSD 840 PRO Series
Serial Number:	S12PNEAD417379T
LU WWN Device Id: 5 002538 5502e087e
Firmware Version: DXM06B0Q
User Capacity:	128,035,676,160 bytes [128 GB]
Sector Size:	  512 bytes logical/physical
Rotation Rate:	Solid State Device
Device is:		In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ATA8-ACS T13/1699-D revision 4c
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:	Mon Nov 27 22:56:09 2017 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
										was never started.
										Auto Offline Data Collection: Disabled.
Self-test execution status:	  (   0) The previous self-test routine completed
										without error or no self-test has ever
										been run.
Total time to complete Offline
data collection:				(65476) seconds.
Offline data collection
capabilities:					(0x53) SMART execute Offline immediate.
										Auto Offline data collection on/off support.
										Suspend Offline collection upon new
										command.
										No Offline surface scan supported.
										Self-test supported.
										No Conveyance Self-test supported.
										Selective Self-test supported.
SMART capabilities:			(0x0003) Saves SMART data before entering
										power-saving mode.
										Supports SMART auto save timer.
Error logging capability:		(0x01) Error logging supported.
										General Purpose Logging supported.
Short self-test routine
recommended polling time:		(   2) minutes.
Extended self-test routine
recommended polling time:		(  15) minutes.
SCT capabilities:			  (0x003d) SCT Status supported.
										SCT Error Recovery Control supported.
										SCT Feature Control supported.
										SCT Data Table supported.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0033   100   100   010	Pre-fail  Always	   -	   0
  9 Power_On_Hours		  0x0032   096   096   000	Old_age   Always	   -	   19243
 12 Power_Cycle_Count	   0x0032   099   099   000	Old_age   Always	   -	   35
177 Wear_Leveling_Count	 0x0013   092   092   000	Pre-fail  Always	   -	   273
179 Used_Rsvd_Blk_Cnt_Tot   0x0013   100   100   010	Pre-fail  Always	   -	   0
181 Program_Fail_Cnt_Total  0x0032   100   100   010	Old_age   Always	   -	   0
182 Erase_Fail_Count_Total  0x0032   100   100   010	Old_age   Always	   -	   0
183 Runtime_Bad_Block	   0x0013   100   100   010	Pre-fail  Always	   -	   0
187 Uncorrectable_Error_Cnt 0x0032   100   100   000	Old_age   Always	   -	   0
190 Airflow_Temperature_Cel 0x0032   074   060   000	Old_age   Always	   -	   26
195 ECC_Error_Rate		  0x001a   200   200   000	Old_age   Always	   -	   0
199 CRC_Error_Count		 0x003e   100   100   000	Old_age   Always	   -	   0
235 POR_Recovery_Count	  0x0012   099   099   000	Old_age   Always	   -	   27
241 Total_LBAs_Written	  0x0032   099   099   000	Old_age   Always	   -	   4586478946

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline	   Completed without error	   00%	 19241		 -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
	1		0		0  Not_testing
	2		0		0  Not_testing
	3		0		0  Not_testing
	4		0		0  Not_testing
	5		0		0  Not_testing
  255		0	65535  Read_scanning was never started
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


Can I come to the conclusion that boot hdd are not smart health checked ??

I tried to get my health check done but, after reading on the forums, i come to the conclusion that it was not that simple to add something in smartd.conf manually after reboot.

So i was looking at why the storage_disk database is missing the entry for the ssd but i didn't found where the sql "INSERT" are done.

I'm not interested in smart short/long test for sdd... but i really want to have health checked.

Thank you for your time.

Artix
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080
Running a smart test is how you check but the GUI doesn't include the settings for this. I run a command line through cron.
Have you looked at the scripts in the resources?

Sent from my SAMSUNG-SGH-I537 using Tapatalk
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,994
@artix lumren

I'm a bit confused so I appoligize but I'm going to ask you to make this posting a bit clearer, please answer my questions below. And treat me like an idiot who has no clue what is going on, it helps to avoid assumptions. Assumptions can really cause issues trying to help someone and I like to avoid them if I can.

1) With just the SSD installed, are you saying that FreeNAS is producing an error message about smart.d or any other error message which conserns you?
or
2) Are you asking a question about why the boot SSD is not in smart.d but there are no error messages?
or
3) Are you just trying to get a health status of your SSD emailed to you periodically?

I'm trying to understand what you mean so if I'm way off base, please correct me.

If you do have an error message, please post it completely.
 

Artix

Cadet
Joined
May 25, 2016
Messages
8
Thank you both for your answers

Running a smart test is how you check but the GUI doesn't include the settings for this. I run a command line through cron.
Have you looked at the scripts in the resources?
Just looked at some of the scripts... thanks for the suggestion i will probably use one of them

@artix lumren

1) With just the SSD installed, are you saying that FreeNAS is producing an error message about smart.d or any other error message which conserns you?
or
2) Are you asking a question about why the boot SSD is not in smart.d but there are no error messages?
or
3) Are you just trying to get a health status of your SSD emailed to you periodically?

I'm trying to understand what you mean so if I'm way off base, please correct me.

If you do have an error message, please post it completely.
No problem, i will try to explain better. It's a bit of your 3 questions so i will answer all of them :)

1) Yes, with just the SSD installed, FreeNAS is producing an error in the notification about smart.d not starting.
Something like :
Code:
Configuration file /usr/local/etc/smartd.conf parsed but has no entries

The error message is just the sign that it's not working has i want to, i'm not concerned about the error itself.

2) Yes, I'm asking a question about why the boot SSD is not in smart.d (there is an error)

3) Trying to get smart health check for my SSD is what concerns me.
I want my boot SSD to be health checked periodically like my storage HDD. Emailed when something goes wrong.
I will probably need to go the script way if freenas is not doing it.
But i think it should be able to do it... I looked at github to check code but seems not easy to add that feature.

Please feel free to ask more questions if i'm not clear
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,994
Thanks, you were clear.

The scripts are an easy way to get the report you desire.
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,080
Trying to get smart health check for my SSD is what concerns me.
As an example, the following is the output of the email generated by one of the scripts. The last two drives on the list are my boot drives.
Code:
########## SMART status report summary for all drives on server EMILY-NAS ##########

+------+------------------+----+-----+-----+-----+-------+-------+--------+------+----------+------+-------+----+
|Device|Serial			|Temp|Power|Start|Spin |ReAlloc|Current|Offline |Seek  |Total	 |High  |Command|Last|
|	  |Number			|	|On   |Stop |Retry|Sectors|Pending|Uncorrec|Errors|Seeks	 |Fly   |Timeout|Test|
|	  |				  |	|Hours|Count|Count|	   |Sectors|Sectors |	  |		  |Writes|Count  |Age |
+------+------------------+----+-----+-----+-----+-------+-------+--------+------+----------+------+-------+----+
|da0   |Z4Z2			  | 30 | 5370|   95|	0|	  0|	  0|	   0|	 0|  50498915|	 3|	  0|   0|
|da3   |W4Z2			  | 29 |11002|   69|	0|	  0|	  0|	   0|	 0|  56127175|	 1|	  0|   0|
|da5   |Z307			  | 30 | 2368|   37|	0|	  0|	  0|	   0|	 0|  10095892|	 0|	  0|   0|
|da1   |Z307			  | 30 | 4587|   39|	0|	  0|	  0|	   0|	 0|  10903623|	 0|	  0|   0|
|da4   |Z307			  | 30 | 4947|   80|	0|	  0|	  0|	   0|	 0|  11745926|	 0|	  0|   0|
|da2   |Z307			  | 30 | 5868|   72|	0|	  0|	  0|	   0|	 0|  13728111|	 0|	  0|   0|
|da9   |Z4Z2			  | 30 | 8440| 1301|	0|	  0|	  0|	   0|	 0|  81935258|	 2|	  0|   0|
|da10  |Z1E0			  | 31 |  490|   10|	0|	  0|	  0|	   0|	 0|   4569656|	 0|	  0|   0|
|da11  |Z1E0			  | 31 |  490|   10|	0|	  0|	  0|	   0|	 0|   4634631|	 0|	  0|   0|
|da7   |Z4Z3			  | 29 | 9026|   21|	0|	  0|	  0|	   0|	 1|  26987776|	 0|	  0|   0|
|da8   |Z305			  | 29 | 2241|	6|	0|	  0|	  0|	   0|	 0|  13865003|	 0|	  0|   0|
|da6   |Z305			  | 30 | 2256|	8|	0|	  0|	  0|	   0|	 0|  14441917|	 0|	  0|   0|
|ada2  |W4J1			  | 27 | 5792|   43|	0|	  0|	  0|	   0|	 0| 751781779|	 0|	  0|   0|
|ada5  |W4J1			  | 27 | 5792|   47|	0|	  0|	  0|	   0|	 0| 741032185|	 0|	  0|   0|
|ada3  |W4J1			  | 27 | 5792|   43|	0|	  0|	  0|	   0|	 0| 739679314|	 0|	  0|   0|
|ada4  |W4J1			  | 28 | 5792|   42|	0|	  0|	  0|	   0|	 0| 734416361|	 0|	  0|   0|
|ada0  |NZ05T772		  | 35 |12963|  118|	0|	  0|	  0|	   0|	 0|	   428|   N/A|	N/A|   0|
|ada1  |NZ05T772		  | 35 |12964|  113|	0|	  0|	  0|	   0|	 0|	  1063|   N/A|	N/A|   0|
+------+------------------+----+-----+-----+-----+-------+-------+--------+------+----------+------+-------+----+
 

Artix

Cadet
Joined
May 25, 2016
Messages
8
Updated to freenas 11.1

Something has changed, now my /dev/ada0 ssd is showing everywhere !

In file /usr/local/etc/smartd.conf

In command
Code:
 sqlite3 /data/freenas-v1.db "select * from storage_disk;"


And even is the interface for smart tests !!

https://redmine.ixsystems.com/issues/24237

Thanks :)
 
Status
Not open for further replies.
Top