SOLVED Alert: unrecoverable error

Jerren · Jun 30, 2020

So i've been trying to figure this out and sometimes it stays away for a week or so, sometimes it's instantly back. Basically it's an alert i get.

Boot pool status is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected..

So i went looking in other topics where i found someone recommending to zpool status -v, which gives me the following result.

Code:

 state: ONLINE
  scan: scrub repaired 0 in 0 days 08:00:16 with 0 errors on Sun Jun 28 08:00:16 2020
config:

        NAME                                            STATE     READ WRITE CKSUM
        PoolA                                          ONLINE       0     0 0
          raidz2-0                                      ONLINE       0     0 0
            gptid/8917829c-7dc0-11ea-a619-7824af43cfa9  ONLINE       0     0 0
            gptid/716454bd-9787-11ea-a381-7824af43cfa9  ONLINE       0     0 0
            gptid/898dbed0-7dc0-11ea-a619-7824af43cfa9  ONLINE       0     0 0
            gptid/897ceb83-7dc0-11ea-a619-7824af43cfa9  ONLINE       0     0 0
            gptid/899c4e6c-7dc0-11ea-a619-7824af43cfa9  ONLINE       0     0 0
            gptid/8fa6b8ea-9787-11ea-a381-7824af43cfa9  ONLINE       0     0 0

errors: No known data errors

  pool: freenas-boot
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
        attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
        using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: scrub repaired 4K in 0 days 00:05:15 with 0 errors on Fri Jun 26 03:50:15 2020
config:

        NAME        STATE     READ WRITE CKSUM
        freenas-boot  ONLINE       0     0     0
          mirror-0  ONLINE       0     0     0
            ada0p2  ONLINE       0     0     0  block size: 512B configured, 4096B native
            ada1p2  ONLINE       0     0     4

errors: No known data errors

  pool: ssd
 state: ONLINE
  scan: scrub repaired 144K in 0 days 00:32:09 with 0 errors on Sun Jun 21 00:32:09 2020
config:

        NAME                                            STATE     READ WRITE CKSUM
        ssd                                             ONLINE       0     0 0
          mirror-0                                      ONLINE       0     0 0

Now i can see that one of my SSD's in the freenas boot mirror has a 4 under checksum and according to other topics this can be caused by SMART data of the drives. 1 of the SSD's is brandnew and the other one is the first SSD i ever owned but since it was only 120gb i didn't use it anymore (still worked fine for me).

Next someone said you need the device and run smartctl -a /dev/ada1 . Which in my case is ada1 and gave me the following:

Code:


/dev/ada1p2: Unable to detect device type
Please specify device type with the -d option.

Use smartctl -h to get a usage summary

root@Brisingr[~]# smartctl -a /dev/ada1
smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p9 amd64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     WD Blue / Red / Green SSDs
Device Model:     WDC WDS120G2G0A-00JH30
Serial Number:    2003BQ461911
LU WWN Device Id: 5 001b44 4a830af06
Firmware Version: UE510000
User Capacity:    120,040,980,480 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jun 30 21:56:14 2020 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  120) seconds.
Offline data collection
capabilities:                    (0x15) SMART execute Offline immediate.
                                        No Auto Offline data collection support.
                                        Abort Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        No Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (  21) minutes.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       1929
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       15
165 Block_Erase_Count       0x0032   100   100   000    Old_age   Always       -       379
166 Minimum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       2
167 Max_Bad_Blocks_per_Die  0x0032   100   100   ---    Old_age   Always       -       0
168 Maximum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       5
169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       225
170 Grown_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   000    Old_age   Always       -       0
173 Average_PE_Cycles_TLC   0x0032   100   100   000    Old_age   Always       -       2
174 Unexpected_Power_Loss   0x0032   100   100   000    Old_age   Always       -       10
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   065   051   000    Old_age   Always       -       35 (Min/Max 18/51)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       0
230 Media_Wearout_Indicator 0x0032   100   100   000    Old_age   Always       -       0x010800280108
232 Available_Reservd_Space 0x0033   100   100   005    Pre-fail  Always       -       100
233 NAND_GB_Written_TLC     0x0032   100   100   ---    Old_age   Always       -       262
234 NAND_GB_Written_SLC     0x0032   100   100   000    Old_age   Always       -       2115
241 Host_Writes_GiB         0x0030   100   100   000    Old_age   Offline      -       777
242 Host_Reads_GiB          0x0030   100   100   000    Old_age   Offline      -       37
244 Temp_Throttle_Status    0x0032   000   100   ---    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%      1913         -
# 2  Short offline       Completed without error       00%      1889         -
# 3  Extended offline    Completed without error       00%      1860         -
# 4  Short offline       Completed without error       00%      1841         -
# 5  Short offline       Completed without error       00%      1817         -
# 6  Short offline       Completed without error       00%      1794         -
# 7  Short offline       Completed without error       00%      1770         -
# 8  Short offline       Completed without error       00%      1745         -
# 9  Short offline       Completed without error       00%      1721         -
#10  Short offline       Completed without error       00%      1673         -
#11  Short offline       Completed without error       00%      1649         -
#12  Short offline       Completed without error       00%      1625         -
#13  Short offline       Completed without error       00%      1601      -
#14  Short offline       Completed without error       00%      1577         -
#15  Extended offline    Completed without error       00%      1524         -
#16  Extended offline    Self-test routine in progress 40%      1524         -

Selective Self-tests/Logging not supported

And that points me to my brandnew wd green 120gb ssd. I hope i provided enough information so that you can help me figure out why i keep getting that notification about the unrecoverable error.

Jerren · Jul 3, 2020

So does anyone have any idea why i would get that notification with a brand new ssd that seems to work fine otherwise?

Jerren · Jul 5, 2020

Since it might help figure this out id did the command on my other ssd in the bootpool as well. (smartctl -a /dev/ada0)

Code:

root@Brisingr[~]# smartctl -a /dev/ada0
smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p9 amd64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     SandForce Driven SSDs
Device Model:     Corsair Force 3 SSD
Serial Number:    11476502000008955500
LU WWN Device Id: 0 000000 000000000
Firmware Version: 1.3.3
User Capacity:    120,034,123,776 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS, ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Jul  5 14:48:50 2020 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x02) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   1) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                ( 1465) seconds.
Offline data collection
capabilities:                    (0x7f) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Abort Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (  48) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x0021) SCT Status supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   089   089   050    Pre-fail  Always       -       0/162780416
  5 Retired_Block_Count     0x0033   100   100   003    Pre-fail  Always       -       0
  9 Power_On_Hours_and_Msec 0x0032   059   059   000    Old_age   Always       -       36270h+46m+42.370s
12 Power_Cycle_Count       0x0032   097   097   000    Old_age   Always       -       3317
171 Program_Fail_Count      0x0032   000   000   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
174 Unexpect_Power_Loss_Ct  0x0030   000   000   000    Old_age   Offline      -       93
177 Wear_Range_Delta        0x0000   000   000   000    Old_age   Offline      -       7
181 Program_Fail_Count      0x0032   000   000   000    Old_age   Always       -       0
182 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   030   030   000    Old_age   Always       -       30 (Min/Max 30/30)
195 ECC_Uncorr_Error_Count  0x001c   120   120   000    Old_age   Offline      -       0/162780416
196 Reallocated_Event_Count 0x0033   100   100   003    Pre-fail  Always       -       0
201 Unc_Soft_Read_Err_Rate  0x001c   120   120   000    Old_age   Offline      -       0/162780416
204 Soft_ECC_Correct_Rate   0x001c   120   120   000    Old_age   Offline      -       0/162780416
230 Life_Curve_Status       0x0013   100   100   000    Pre-fail  Always       -       100
231 SSD_Life_Left           0x0013   099   099   010    Pre-fail  Always       -       0
233 SandForce_Internal      0x0000   000   000   000    Old_age   Offline      -       19441
234 SandForce_Internal      0x0032   000   000   000    Old_age   Always       -       16628
241 Lifetime_Writes_GiB     0x0032   000   000   000    Old_age   Always       -       16628
242 Lifetime_Reads_GiB      0x0032   000   000   000    Old_age   Always       -       28043

SMART Error Log not supported

SMART Self-test Log not supported

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Samuel Tai · Jul 5, 2020

Jerren said:
So does anyone have any idea why i would get that notification with a brand new ssd that seems to work fine otherwise?

This is because the SMART drive database needs to be updated. See https://www.ixsystems.com/community...ber-to-try-updating-the-drive-database.84635/

Jerren · Jul 5, 2020

Thanks for the reply, entered it now and it said "updated from trunk (NOT VERIFIED). Hopefully this works!

Samuel Tai · Jul 5, 2020

You can check immediately by running smartctl -i /dev/ada1.

Jerren · Jul 5, 2020

I now get a much short reply on that command:

Code:

=== START OF INFORMATION SECTION ===
Model Family:     WD Blue / Red / Green SSDs
Device Model:     WDC WDS120G2G0A-00JH30
Serial Number:    2003BQ461911
LU WWN Device Id: 5 001b44 4a830af06
Firmware Version: UE510000
User Capacity:    120,040,980,480 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jun 30 21:56:14 2020 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

I did the -P show command after and i don't know what it said before but now it does say the text below and since i see drive found in smartmontools Database, i think it did indeed work.

Code:

root@Brisingr[~]# smartctl -i /dev/ada1 -P show
smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p9 amd64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

Drive found in smartmontools Database.  Drive identity strings:
MODEL:              WDC WDS120G2G0A-00JH30
FIRMWARE:           UE510000
match smartmontools Drive Database entry:
MODEL REGEXP:       WDC WDBNCE(250|500|00[124])0PNC(-.*)?|WDC  ?WDS((120|240|250|480|500)G|[12]00T)(1B|2B|1G|2G|1R)0[AB](-.*)?
FIRMWARE REGEXP:    .*
MODEL FAMILY:       WD Blue / Red / Green SSDs
ATTRIBUTE OPTIONS:  165 Block_Erase_Count
                    166 Minimum_PE_Cycles_TLC
                    167 Max_Bad_Blocks_per_Die
                    168 Maximum_PE_Cycles_TLC
                    169 Total_Bad_Blocks
                    171 Program_Fail_Count
                    172 Erase_Fail_Count
                    173 Average_PE_Cycles_TLC
                    174 Unexpected_Power_Loss
                    230 Media_Wearout_Indicator
                    233 NAND_GB_Written_TLC
                    234 NAND_GB_Written_SLC
                    241 Host_Writes_GiB
                    244 Temp_Throttle_Status

Thanks alot for the help! If the error comes back so will i to continue the search but untill then, THANKS again!

Jerren · Jul 10, 2020

So i got a new notification joining the "presumed" solved one. So atm i got these 2 now. 1 of which points to my other SSD now the old one, so i did the previous commands (where i thought they mattered, still noob so tell me if i'm wrong).
CRITICAL
Boot pool status is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected..
&
CRITICAL
Device: /dev/ada0, not capable of SMART self-check.

Code:

root@Brisingr[~]# smartctl -a /dev/ada0
smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p9 amd64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     SandForce Driven SSDs
Device Model:     Corsair Force 3 SSD
Serial Number:    11476502000008955500
LU WWN Device Id: 0 000000 000000000
Firmware Version: 1.3.3
User Capacity:    120,034,123,776 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS, ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Jul 10 23:40:31 2020 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x02) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   1) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                ( 1465) seconds.
Offline data collection
capabilities:                    (0x7f) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Abort Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (  48) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x0021) SCT Status supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   094   094   050    Pre-fail  Always       -       0/178081428
  5 Retired_Block_Count     0x0033   100   100   003    Pre-fail  Always       -       0
  9 Power_On_Hours_and_Msec 0x0032   059   059   000    Old_age   Always       -       36399h+38m+10.300s
 12 Power_Cycle_Count       0x0032   097   097   000    Old_age   Always       -       3317
171 Program_Fail_Count      0x0032   000   000   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
174 Unexpect_Power_Loss_Ct  0x0030   000   000   000    Old_age   Offline      -       93
177 Wear_Range_Delta        0x0000   000   000   000    Old_age   Offline      -       7
181 Program_Fail_Count      0x0032   000   000   000    Old_age   Always       -       0
182 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   030   030   000    Old_age   Always       -       30 (Min/Max 30/30)
195 ECC_Uncorr_Error_Count  0x001c   120   120   000    Old_age   Offline      -       0/178081428
196 Reallocated_Event_Count 0x0033   100   100   003    Pre-fail  Always       -       0
201 Unc_Soft_Read_Err_Rate  0x001c   120   120   000    Old_age   Offline      -       0/178081428
204 Soft_ECC_Correct_Rate   0x001c   120   120   000    Old_age   Offline      -       0/178081428
230 Life_Curve_Status       0x0013   100   100   000    Pre-fail  Always       -       100
231 SSD_Life_Left           0x0013   099   099   010    Pre-fail  Always       -       0
233 SandForce_Internal      0x0000   000   000   000    Old_age   Offline      -       19454
234 SandForce_Internal      0x0032   000   000   000    Old_age   Always       -       16641
241 Lifetime_Writes_GiB     0x0032   000   000   000    Old_age   Always       -       16641
242 Lifetime_Reads_GiB      0x0032   000   000   000    Old_age   Always       -       28101
SMART Error Log not supported

SMART Self-test Log not supported

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

What confuses me in that one is that it says my device has "smart support" & that "smart support is enabled" but "smart self-test log is not supported". So is freenas tripping up because the log is not enabled or am i missing something?

Samuel Tai · Jul 10, 2020

In this case, SandForce opted not to support SMART self-test logs. This doesn't have anything to do with the pool being degraded. Please run zpool status -v freenas-boot again to reflect the status after the replacement.

Jerren · Jul 10, 2020

Oh so i have to replace the ssd?

Samuel Tai · Jul 11, 2020

Not necessarily. SMART support isn't binary. Vendors can implement some SMART features or not.

Jerren · Jul 11, 2020

Ah ok i'll try to find my old SSD on corsair's website then and see what they have on there. It's because of "Please run zpool status -v freenas-boot again to reflect the status after the replacement. " the word replacement in there that i thought that.

Jerren · Jul 12, 2020

Ok found something. This post seems to indicate that updating helps the ssd. They even gave a log for before and after and i THINK it shows it turns on logging but i wanted to ask if someone here can take a look at em too, before firmware update - after firmware update .

So if a firmware update helps like it seems to be the case in those, do i need to do the replace command? Or can i just unplug it, plug it into a different machine, update the firmware and plug it back in or do i need to follow certain steps?

Jerren · Jul 13, 2020

I updated the old corsair ssd firmware from 1.3.3 to 5.07 but if i'm reading this correctly it still doesn't support the smart logging.

Code:

root@Brisingr[~]# smartctl -a /dev/ada0
smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p9 amd64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     SandForce Driven SSDs
Device Model:     Corsair Force 3 SSD
Serial Number:    11476502000008955500
LU WWN Device Id: 0 000000 000000000
Firmware Version: 5.07
User Capacity:    120,034,123,776 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS, ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Mon Jul 13 17:11:10 2020 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (    0) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        (  48) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x0021) SCT Status supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   100   100   050    Pre-fail  Always       -       0/4679534
  5 Retired_Block_Count     0x0033   100   100   003    Pre-fail  Always       -       0
  9 Power_On_Hours_and_Msec 0x0032   059   059   000    Old_age   Always       -       36464h+54m+35.420s
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -      3
171 Program_Fail_Count      0x0032   000   000   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
174 Unexpect_Power_Loss_Ct  0x0030   000   000   000    Old_age   Offline      -       95
177 Wear_Range_Delta        0x0000   000   000   000    Old_age   Offline      -       7
181 Program_Fail_Count      0x0032   000   000   000    Old_age   Always       -       0
182 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   030   030   000    Old_age   Always       -       30 (Min/Max 30/30)
195 ECC_Uncorr_Error_Count  0x001c   120   120   000    Old_age   Offline      -       0/4679534
196 Reallocated_Event_Count 0x0033   100   100   003    Pre-fail  Always       -       0
201 Unc_Soft_Read_Err_Rate  0x001c   120   120   000    Old_age   Offline      -       0/4679534
204 Soft_ECC_Correct_Rate   0x001c   120   120   000    Old_age   Offline      -       0/4679534
230 Life_Curve_Status       0x0013   100   100   000    Pre-fail  Always       -       100
231 SSD_Life_Left           0x0013   099   099   010    Pre-fail  Always       -       0
233 SandForce_Internal      0x0000   000   000   000    Old_age   Offline      -       19517
234 SandForce_Internal      0x0032   000   000   000    Old_age   Always       -       16713
241 Lifetime_Writes_GiB     0x0032   000   000   000    Old_age   Always       -       16713
242 Lifetime_Reads_GiB      0x0032   000   000   000    Old_age   Always       -       28234

SMART Error Log not supported

SMART Self-test Log not supported

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing

Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

Samuel Tai · Jul 13, 2020

Jerren said:

I updated the old corsair ssd firmware from 1.3.3 to 5.07 but if i'm reading this correctly it still doesn't support the smart logging.

Code:

root@Brisingr[~]# smartctl -a /dev/ada0
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.

SMART Error Log not supported
SMART Self-test Log not supported

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing

Doesn't look like it.

Jerren · Aug 12, 2020

So i tried replacing the corsair drive following this youtube guide and everything seemed to work fine. It did it's resilvering but when i checked boot pool status it said degraded. So i went to look at Storage -> Disks and i saw that the drive was in a different pool. One i don't have? It says Boot pool under Pool with the new drive but when i go look at Storage -> Pools i only see my Panzer & ssd pools.

I honestly have no idea how it landed in that pool, i assumed when replacing a disk it would resilver with the other one and then they would both be in the same original pool.

Jerren · Aug 13, 2020

Ok it's a bit cooler today so i tried again and so far it seems to have done everything as it should have. I pulled the new drive out and formatted it, did the whole offline, replace thing and now it shows in the same pool as it should. No idea what went wrong there though.

Important Announcement for the TrueNAS Community.

SOLVED Alert: unrecoverable error

Jerren

Explorer

Jerren

Explorer

Jerren

Explorer

Samuel Tai

Never underestimate your own stupidity

Jerren

Explorer

Samuel Tai

Never underestimate your own stupidity

Jerren

Explorer

Jerren

Explorer

Samuel Tai

Never underestimate your own stupidity

Jerren

Explorer

Samuel Tai

Never underestimate your own stupidity

Jerren

Explorer

Jerren

Explorer

Jerren

Explorer

Samuel Tai

Never underestimate your own stupidity

Jerren

Explorer

Jerren

Explorer

Similar threads

Important Announcement for the TrueNAS Community.

SOLVED Alert: unrecoverable error

Explorer

Explorer

Explorer

Never underestimate your own stupidity

Explorer

Never underestimate your own stupidity

Explorer

Explorer

Never underestimate your own stupidity

Explorer

Never underestimate your own stupidity

Explorer

Explorer

Explorer

Never underestimate your own stupidity

Explorer

Explorer

Important Announcement for the TrueNAS Community.

Related topics on forums.truenas.com for thread: "Alert: unrecoverable error"

Similar threads