SOLVED Replaced 2 drives, SMART tests scheduled, but not running...?

Joined
Jun 26, 2012
Messages
260
I really thought I posted this already, but can't find it...Anyway

I replaced 2 drives following the usual instructions.
I added ada3 and ada5 back into the SMART Short/Long test schedules.
The SMART tests are running for the other "original" drives, but NOT for the 2 new drives.

Did I miss something simple to reacquire the new drives?

SMART.png
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,700
I think I ran into this before too... you need to create a new scheduled task and kill the old one if you change anything other than the schedule (or use the all disks option).

I have recently replaced a disk (da17 for me) and added it back to the task... I'll know in a few days if it's running the long tests as scheduled for it or if I will need to recreate the task.
 
Joined
Jun 26, 2012
Messages
260
I've created the new tasks and deleted the old.
I'll update if that works or not...

Thanks!
 
Joined
Jun 26, 2012
Messages
260
I think I ran into this before too... you need to create a new scheduled task and kill the old one if you change anything other than the schedule (or use the all disks option).

I have recently replaced a disk (da17 for me) and added it back to the task... I'll know in a few days if it's running the long tests as scheduled for it or if I will need to recreate the task.
Did this work for you? Did not for me. The 2 new drivers are not registering as having a smart test completed.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,700
So the new/replaced da17 had not run a long test in the past 2 weeks (even though it should happen) with the original task settings.

I have now created a new task and will monitor. I have seen a new task work successfully before, so I expect it to work now.
 

Jailer

Not strong, but bad
Joined
Sep 12, 2014
Messages
4,977
In the past you had to go in and select the new dives in the scheduled smart test. I've never had to re create the scheduled task, just re select all the drives and save the changes.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,700
If you use the "all drives" option (like I do for the short tests) it's fine, all drives are always covered.

If you pick only some, any replaced drive is never re-scanned by that job even if re-added.

I have just re-confirmed it here and had seen it in the past and not thought much more about it.
 
Joined
Jun 26, 2012
Messages
260
If you use the "all drives" option (like I do for the short tests) it's fine, all drives are always covered.

If you pick only some, any replaced drive is never re-scanned by that job even if re-added.

I have just re-confirmed it here and had seen it in the past and not thought much more about it.
Where is the "all drives" option? In the SMART Test scheduler, I only see checkboxes for each individual drive.
freenas11.png
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,700
It's in TrueNAS... I guess not in the version you're on (still FreeNAS according to that screenshot... and your signature).
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,700
I was also going to say that for reasons of limiting heat, I split my long tests between evens and odds, so need those to be selective, hence noticing the problem with the newest disk, which despite being scheduled for a long test every 2 weeks, shows this:

Code:
SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%       822         -
# 2  Short offline       Completed without error       00%       798         -
# 3  Short offline       Completed without error       00%       774         -
# 4  Short offline       Completed without error       00%       750         -
# 5  Short offline       Completed without error       00%       726         -
# 6  Short offline       Completed without error       00%       702         -
# 7  Short offline       Completed without error       00%       678         -
# 8  Short offline       Completed without error       00%       654         -
# 9  Short offline       Completed without error       00%       630         -
#10  Short offline       Completed without error       00%       606         -
#11  Short offline       Completed without error       00%       582         -
#12  Short offline       Completed without error       00%       558         -
#13  Short offline       Completed without error       00%       534         -
#14  Short offline       Completed without error       00%       510         -
#15  Short offline       Completed without error       00%       486         -
#16  Short offline       Completed without error       00%       462         -
#17  Short offline       Completed without error       00%       438         -
#18  Short offline       Completed without error       00%       414         -
#19  Short offline       Completed without error       00%       390         -
#20  Short offline       Completed without error       00%       366         -
#21  Short offline       Completed without error       00%       342         -


So only the "all disks" option is working for it.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
It's in TrueNAS... I guess not in the version you're on (still FreeNAS according to that screenshot... and your signature).
It's in 11.3, too, if not earlier.
 
Joined
Jun 26, 2012
Messages
260
ok ok...time to upgrade the OS
 
Joined
Jun 26, 2012
Messages
260
I really thought I posted this already, but can't find it...Anyway

I replaced 2 drives following the usual instructions.
I added ada3 and ada5 back into the SMART Short/Long test schedules.
The SMART tests are running for the other "original" drives, but NOT for the 2 new drives.

Did I miss something simple to reacquire the new drives?

View attachment 49309
Updated the OS, now running 11.3 U5
and the SMART Status Report Summary is unreadable in the email (table is all messed up).
I'll need to see if I can find the thread that showed how to set that up...and any updates to the script that obviously need to be done.

If anyone wants to link or provide the instruction, that would be great.
(see below for what it looks like now...it continues, I didn't copy the whole thing).

Code:
<br><br>
<table style="border: 1px solid black; border-collapse: collapse;">
<tr><th colspan="9" style="text-align:center; font-size:20px; height:40px; font-family:courier;">ZPool Status Report Summary</th></tr>
<tr>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Pool<br>Name</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Status</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Read<br>Errors</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Write<br>Errors</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Cksum<br>Errors</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Used %</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Scrub<br>Repaired<br>Bytes</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Scrub<br>Errors</th>
<th style="text-align:center; width:130px; height:60px; border:1px solid black; border-collapse:collapse; font-family:courier;">Last<br>Scrub<br>Age</th>
</tr>
 
Joined
Jun 26, 2012
Messages
260
OK, found the updated script (sad face, it's not as pretty as it used to be in email...)
and my original issue is solved. The "All Drives" option, now available in 11.3-U5, gets the job done.

Thanks everyone!
 
Joined
Jun 26, 2012
Messages
260
Upon further review...not working out so well. After choosing the "All Drives" option, SMART tests were run.
But now, I'm hitting the same issue. ada3 and ada5 are 29 days past, while all the other drives just had their tests run yesterday.




Code:
Device|Serial                  |Temp| Power|Start|Spin |ReAlloc|Current|Offline |Seek  |Total     |High  |    Command|Last|
|      |Number                  |    | On   |Stop |Retry|Sectors|Pending|Uncorrec|Errors|Seeks     |Fly   |    Timeout|Test|
|      |                        |    | Hours|Count|Count|       |Sectors|Sectors |      |          |Writes|    Count  |Age |
+------+------------------------+----+------+-----+-----+-------+-------+--------+------+----------+------+-----------+----+
|da0 ? |WD-WCC4N1ZK0HKS         |27  | 37408|   51|    0|      0|      0|       0|   N/A|       N/A|   N/A|        N/A|  8*|
|ada0  |170794457314            |37  | 36060|   99|     |       |       |        |   N/A|       N/A|   N/A|          0|   1|
|ada1  |WD-WCC4N1ZK03JV         |29  | 37408|   50|    0|      0|      0|       0|   N/A|       N/A|   N/A|        N/A|   1|
|ada2  |WD-WCC4N2VRX064         |30  | 37408|   50|    0|      0|      0|       0|   N/A|       N/A|   N/A|        N/A|   1|
|ada3 ?|WD-WX51D89KS166         |26  |  3005|    7|    0|      0|      0|       0|   N/A|       N/A|   N/A|        N/A| 29*|
|ada4  |WD-WCC4N2YJV3S8         |26  | 17649|   20|    0|      0|      0|       0|   N/A|       N/A|   N/A|        N/A|   1|
|ada5 ?|WD-WX61D89A8U87         |25  |  3005|    7|    0|      0|      0|       0|   N/A|       N/A|   N/A|        N/A| 29*|
+------+------------------------+----+------+-----+-----+-------+-------+--------+------+----------+------+-----------+----+

########## SATA drive /dev/da0 Serial: WD-WCC4N1ZK0HKS
########## Western Digital Red (WDC WD30EFRX-68EUZN0)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   183   181   021    Pre-fail  Always       -       5841
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       51
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   049   049   000    Old_age   Always       -       37408
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       51
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       49
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       532
194 Temperature_Celsius     0x0022   123   109   000    Old_age   Always       -       27
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Short offline       Completed without error       00%     37212         -

########## SATA drive /dev/ada0 Serial: 170794457314
########## SandForce Driven SSDs (SanDisk SDSSDA120G)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Retired_Block_Count     0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours_and_Msec 0x0032   000   100   000    Old_age   Always       -       36060h+00m+00.000s
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       99
165 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       4301717521
166 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
167 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
168 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       1
169 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
170 Reserve_Block_Count     0x0032   100   100   000    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Unknown_SandForce_Attr  0x0032   100   100   000    Old_age   Always       -       0
174 Unexpect_Power_Loss_Ct  0x0032   100   100   000    Old_age   Always       -       57
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   065   054   000    Old_age   Always       -       35 (Min/Max 0/46)
199 SATA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       0
230 Life_Curve_Status       0x0032   100   100   000    Old_age   Always       -       21474836485
232 Available_Reservd_Space 0x0033   100   100   004    Pre-fail  Always       -       100
233 SandForce_Internal      0x0032   100   100   000    Old_age   Always       -       1
234 SandForce_Internal      0x0032   100   100   000    Old_age   Always       -       78
241 Lifetime_Writes_GiB     0x0030   253   253   000    Old_age   Offline      -       29
242 Lifetime_Reads_GiB      0x0030   253   253   000    Old_age   Offline      -       357
244 Unknown_Attribute       0x0032   000   100   000    Old_age   Always       -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Short offline       Completed without error       00%     36033         -

########## SATA drive /dev/ada1 Serial: WD-WCC4N1ZK03JV
########## Western Digital Red (WDC WD30EFRX-68EUZN0)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   174   174   021    Pre-fail  Always       -       6258
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       50
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   049   049   000    Old_age   Always       -       37408
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       50
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       17
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       560
194 Temperature_Celsius     0x0022   121   109   000    Old_age   Always       -       29
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Short offline       Completed without error       00%     37380         -

########## SATA drive /dev/ada2 Serial: WD-WCC4N2VRX064
########## Western Digital Red (WDC WD30EFRX-68EUZN0)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   177   176   021    Pre-fail  Always       -       6108
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       50
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   049   049   000    Old_age   Always       -       37408
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       50
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       17
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       568
194 Temperature_Celsius     0x0022   120   109   000    Old_age   Always       -       30
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Short offline       Completed without error       00%     37380         -

########## SATA drive /dev/ada3 Serial: WD-WX51D89KS166
########## Western Digital Red (SMR) (WDC WD30EFAX-68JH4N0)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   199   199   021    Pre-fail  Always       -       3050
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       7
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   096   096   000    Old_age   Always       -       3005
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       7
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       5
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       1
194 Temperature_Celsius     0x0022   121   117   000    Old_age   Always       -       26
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Extended offline    Interrupted (host reset)      10%      2314         -

########## SATA drive /dev/ada4 Serial: WD-WCC4N2YJV3S8
########## Western Digital Red (WDC WD30EFRX-68EUZN0)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   179   179   021    Pre-fail  Always       -       6033
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       20
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   076   076   000    Old_age   Always       -       17649
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       20
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       10
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       475
194 Temperature_Celsius     0x0022   124   110   000    Old_age   Always       -       26
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Short offline       Completed without error       00%     17621         -

########## SATA drive /dev/ada5 Serial: WD-WX61D89A8U87
########## Western Digital Red (SMR) (WDC WD30EFAX-68JH4N0)

SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   200   200   021    Pre-fail  Always       -       2983
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       7
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   096   096   000    Old_age   Always       -       3005
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       7
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       2
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       4
194 Temperature_Celsius     0x0022   122   115   000    Old_age   Always       -       25
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

No Errors Logged

Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
Extended offline    Interrupted (host reset)      10%      2314         -
 
Top