Help with identifying which drives are bad.

Grinas

Contributor
Joined
May 4, 2017
Messages
174
Sorry for the stupid question but im confused as to which drives are actually degraded.

So long story short i just realized today that i have no gotten an email from truenas in about a year so decides to have a look at the storage to make sure everything is ok but I see 2 drives degraded.

I am confused as to which drive so its showing 9 drives in total listed so that means at least one must be no longer show/detect by truenas. Is the only way for me to find out which one is to get serials of all and open server go through them all?

The other drive that is degraded seems to be da2 which is an very old Toshiba drive but when i go look at the drive states i get "DISK TYPE" "UNKNOWN" for da6 so is da6 degraded too?

Mod note: external images removed

Attached is output of all drives

da1
Code:
smartctl -a /dev/da1
smartctl 7.2 2020-12-30 r5155 [FreeBSD 12.2-RELEASE-p6 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    WD-WCC4N0XP648R
LU WWN Device Id: 5 0014ee 2640bcb11
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Aug  2 14:16:31 2023 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (40080) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 402) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x703d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       3
  3 Spin_Up_Time            0x0027   188   180   021    Pre-fail  Always       -       5575
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       227
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   040   040   000    Old_age   Always       -       44452
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       227
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       226
193 Load_Cycle_Count        0x0032   199   199   000    Old_age   Always       -       5068
194 Temperature_Celsius     0x0022   110   097   000    Old_age   Always       -       40
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       1

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     35590         -
# 2  Extended offline    Completed without error       00%     32877         -
# 3  Extended offline    Completed without error       00%     32853         -
# 4  Extended offline    Completed without error       00%     32783         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


da2
Code:
smartctl -a /dev/da2
smartctl 7.2 2020-12-30 r5155 [FreeBSD 12.2-RELEASE-p6 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Toshiba 3.5" DT01ACA... Desktop HDD
Device Model:     TOSHIBA DT01ACA300
Serial Number:    X3V9SV5GS
LU WWN Device Id: 5 000039 ff4d2885a
Firmware Version: MX6OABB0
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Aug  2 14:17:00 2023 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x84) Offline data collection activity
                                        was suspended by an interrupting command from host.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (22652) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 378) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   016    Pre-fail  Always       -       0
  2 Throughput_Performance  0x0005   139   139   054    Pre-fail  Offline      -       70
  3 Spin_Up_Time            0x0007   150   150   024    Pre-fail  Always       -       365 (Average 398)
  4 Start_Stop_Count        0x0012   100   100   000    Old_age   Always       -       586
  5 Reallocated_Sector_Ct   0x0033   094   094   005    Pre-fail  Always       -       276
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -       0
  8 Seek_Time_Performance   0x0005   124   124   020    Pre-fail  Offline      -       33
  9 Power_On_Hours          0x0012   092   092   000    Old_age   Always       -       60867
 10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       398
192 Power-Off_Retract_Count 0x0032   099   099   000    Old_age   Always       -       1332
193 Load_Cycle_Count        0x0012   099   099   000    Old_age   Always       -       1332
194 Temperature_Celsius     0x0002   133   133   000    Old_age   Always       -       45 (Min/Max 13/57)
196 Reallocated_Event_Count 0x0032   093   093   000    Old_age   Always       -       307
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -       120
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x000a   153   153   000    Old_age   Always       -       1101

Read SMART Error Log failed: Input/output error

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     60410         -
# 2  Short offline       Completed without error       00%     60398         -
# 3  Short offline       Completed without error       00%     60242         -
# 4  Short offline       Completed without error       00%     60230         -
# 5  Short offline       Completed without error       00%     60026         -
# 6  Short offline       Completed without error       00%     60014         -
# 7  Short offline       Completed without error       00%     59858         -
# 8  Short offline       Completed without error       00%     59846         -
# 9  Short offline       Completed without error       00%     59690         -
#10  Short offline       Completed without error       00%     59678         -
#11  Short offline       Completed without error       00%     59525         -
#12  Short offline       Completed without error       00%     59513         -
#13  Short offline       Completed without error       00%     59285         -
#14  Short offline       Completed without error       00%     59273         -
#15  Short offline       Completed without error       00%     59117         -
#16  Short offline       Completed without error       00%     59105         -
#17  Short offline       Completed without error       00%     58949         -
#18  Short offline       Completed without error       00%     58937         -
#19  Short offline       Completed without error       00%     58781         -
#20  Short offline       Completed without error       00%     58769         -
#21  Short offline       Completed without error       00%     58565         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


da3
Code:
smartctl -a /dev/da3
smartctl 7.2 2020-12-30 r5155 [FreeBSD 12.2-RELEASE-p6 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    WD-WCC4N6SS9F16
LU WWN Device Id: 5 0014ee 20e541427
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Aug  2 14:18:32 2023 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (40020) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 401) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x703d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       1
  3 Spin_Up_Time            0x0027   186   178   021    Pre-fail  Always       -       5666
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       257
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   028   028   000    Old_age   Always       -       52888
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       257
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       242
193 Load_Cycle_Count        0x0032   193   193   000    Old_age   Always       -       22445
194 Temperature_Celsius     0x0022   109   100   000    Old_age   Always       -       41
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     52767         -
# 2  Short offline       Completed without error       00%     52755         -
# 3  Short offline       Completed without error       00%     52599         -
# 4  Short offline       Completed without error       00%     52587         -
# 5  Short offline       Completed without error       00%     52431         -
# 6  Short offline       Completed without error       00%     52419         -
# 7  Short offline       Completed without error       00%     52264         -
# 8  Short offline       Completed without error       00%     52252         -
# 9  Short offline       Completed without error       00%     52048         -
#10  Short offline       Completed without error       00%     52036         -
#11  Short offline       Completed without error       00%     51880         -
#12  Short offline       Completed without error       00%     51868         -
#13  Short offline       Completed without error       00%     51712         -
#14  Short offline       Completed without error       00%     51700         -
#15  Short offline       Completed without error       00%     51548         -
#16  Short offline       Completed without error       00%     51536         -
#17  Short offline       Completed without error       00%     51308         -
#18  Short offline       Completed without error       00%     51296         -
#19  Short offline       Completed without error       00%     51140         -
#20  Short offline       Completed without error       00%     51128         -
#21  Short offline       Completed without error       00%     50972         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


da4
Code:
smartctl -a /dev/da4
smartctl 7.2 2020-12-30 r5155 [FreeBSD 12.2-RELEASE-p6 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Purple
Device Model:     WDC WD30PURZ-85GU6Y0
Serial Number:    WD-WCC4N7AY14Z9
LU WWN Device Id: 5 0014ee 265d1adc5
Firmware Version: 80.00A80
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Aug  2 14:18:58 2023 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (39600) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 398) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x703d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   190   182   021    Pre-fail  Always       -       5491
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       120
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   061   061   000    Old_age   Always       -       29072
 10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       120
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       118
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       647
194 Temperature_Celsius     0x0022   106   092   000    Old_age   Always       -       44
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     20214         -
# 2  Extended offline    Completed without error       00%     17496         -
# 3  Extended offline    Completed without error       00%     17472         -
# 4  Extended offline    Completed without error       00%     17402         -
# 5  Extended offline    Aborted by host               90%     17364         -
# 6  Short offline       Completed without error       00%     17179         -
# 7  Extended offline    Completed without error       00%     17137         -
# 8  Conveyance offline  Completed without error       00%     17129         -
# 9  Extended offline    Aborted by host               90%     17129         -
#10  Short offline       Completed without error       00%     17129         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


da6
Code:
smartctl -a /dev/da6
smartctl 7.2 2020-12-30 r5155 [FreeBSD 12.2-RELEASE-p6 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Purple
Device Model:     WDC WD30PURX-64P6ZY0
Serial Number:    WD-WCC4N6VVXV6T
LU WWN Device Id: 0 000000 000000000
Firmware Version: 01.12B.3
User Capacity:    3,001,489,956,864 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS (minor revision not indicated)
Local Time is:    Wed Aug  2 14:19:24 2023 IST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      ( 115) The previous self-test completed having
                                        the read element of the test failed.
Total time to complete Offline
data collection:                (42300) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 424) minutes.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0xfc77   200   200   051    Pre-fail  Always       -       60
  3 Spin_Up_Time            0xe455   223   218   021    Pre-fail  Offline      -       3808
  4 Start_Stop_Count        0xe054   100   100   000    Old_age   Offline      -       105
  5 Reallocated_Sector_Ct   0xf455   200   200   140    Pre-fail  Offline      -       0
  7 Seek_Error_Rate         0xe455   200   200   051    Pre-fail  Offline      -       0
  9 Power_On_Hours          0xec74   064   064   000    Old_age   Offline      -       26987
 10 Spin_Retry_Count        0xd675   100   100   051    Pre-fail  Offline      -       0
 11 Calibration_Retry_Count 0xac14   100   100   051    Old_age   Offline      -       0
 12 Power_Cycle_Count       0xfc34   100   100   000    Old_age   Offline      -       105
184 End-to-End_Error        0xf477   100   100   097    Pre-fail  Always       -       0
187 Reported_Uncorrect      0xb414   100   100   000    Old_age   Offline      -       0
188 Command_Timeout         0xb436   100   099   000    Old_age   Always       -       1
190 Airflow_Temperature_Cel 0xac76   056   042   000    Old_age   Always       -       44
192 Power-Off_Retract_Count 0xa474   200   200   000    Old_age   Offline      -       104
193 Load_Cycle_Count        0xa476   200   200   000    Old_age   Always       -       0
194 Temperature_Celsius     0xc45c   106   092   000    Old_age   Offline      -       44
195 Hardware_ECC_Recovered  0xdc7e   200   200   000    Old_age   Always       -       0
196 Reallocated_Event_Count 0xbc74   200   200   000    Old_age   Offline      -       0
197 Current_Pending_Sector  0xe674   200   200   000    Old_age   Offline      -       0
198 Offline_Uncorrectable   0xfc7e   100   253   000    Old_age   Always       -       0
199 UDMA_CRC_Error_Count    0xc456   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate   0xbc57   200   200   051    Pre-fail  Always       -       28

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed: read failure       30%     26986         4191536
# 2  Extended offline    Completed: read failure       10%     18117         4198216
# 3  Extended offline    Completed: read failure       90%     15402         4198216
# 4  Extended offline    Completed: read failure       90%     15380         4198216
# 5  Extended offline    Completed: read failure       90%     15380         4198216
# 6  Extended offline    Completed: read failure       90%     15378         4198216
# 7  Extended offline    Completed: read failure       90%     15308         4198216

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.


I assume its da2 and another drive thats not showing that are degraded but why is da6 showing as HDD type "unknown"
 
Last edited by a moderator:

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
What is the output of camcontrol devlist?

da6 can't even complete a SMART test... you could pull it out and subject it to a badblock pummeling, but I'd consider it dead.
 
Last edited:

Grinas

Contributor
Joined
May 4, 2017
Messages
174
What is the output of camcontrol devlist and zpool status?

da6 can't even complete a SMART test... you could pull it out and subject it to a badblock pummeling, but I'd consider it dead.

I decided to rip the server open and found the drive that was not being recognized.

I will spin the machine up then again after removing disk thats not being recognized and see whats going on. Ill give you the info you requested then
 

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
Also, are you using your HBA in IT mode? Because in your signature you categoryze it as RAID card.

 

Grinas

Contributor
Joined
May 4, 2017
Messages
174
Also, are you using your HBA in IT mode? Because in your signature you categoryze it as RAID card.


camcontrol devlist output
Code:
camcontrol devlist
<VMware Virtual disk 2.0>          at scbus32 target 0 lun 0 (pass0,da0)
<ATA WDC WD30EFRX-68E 0A82>        at scbus33 target 1 lun 0 (pass1,da1)
<ATA TOSHIBA DT01ACA3 ABB0>        at scbus33 target 3 lun 0 (pass2,da2)
<ATA WDC WD30EFRX-68E 0A82>        at scbus33 target 5 lun 0 (pass3,da3)
<ATA WDC WD30PURZ-85G 0A80>        at scbus33 target 7 lun 0 (pass4,da4)
<ATA NT-1TB 117D>                  at scbus33 target 8 lun 0 (pass5,da5)
<ATA WDC WD30PURX-64P 2B.3>        at scbus33 target 9 lun 0 (pass6,da6)
<ATA HGST HTS541010A9 A560>        at scbus33 target 11 lun 0 (pass7,da7)
<ATA WDC WD40EFZX-68A 0B81>        at scbus33 target 13 lun 0 (pass8,da8)


zpool status Output
Code:
Last login: Wed Aug  2 13:31:40 2023 from 192.168.0.15
FreeBSD 12.2-RELEASE-p6 df578562304(HEAD) TRUENAS

    TrueNAS (c) 2009-2021, iXsystems, Inc.
    All rights reserved.
    TrueNAS code is released under the modified BSD license with some
    files copyrighted by (c) iXsystems, Inc.

    For more information, documentation, help or support, go here:
    http://truenas.com
Welcome to FreeNAS

Warning: settings changed through the CLI are not written to
the configuration database and will be reset on reboot.

root@nas:~ # sas3flash -list
Avago Technologies SAS3 Flash Utility
Version 16.00.00.00 (2017.05.02)
Copyright 2008-2017 Avago Technologies. All rights reserved.

    No Avago SAS adapters found! Limited Command Set Available!
    ERROR: Command Not allowed without an adapter!
    ERROR: Couldn't Create Command -list
    Exiting Program.
root@nas:~ # zpool status
  pool: ThreeTBM
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
    continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Wed Aug  2 15:07:54 2023
    84.8G scanned at 639M/s, 37.9G issued at 285M/s, 13.9T total
    7.87G resilvered, 0.27% done, 14:07:06 to go
config:

    NAME                                            STATE     READ WRITE CKSUM
    ThreeTBM                                        DEGRADED     0     0     0
      raidz1-0                                      ONLINE       0     0     0
        gptid/0486c9b0-7721-11e8-aaf6-1866da124f27  ONLINE       0     0     0
        gptid/bb75f4de-56a9-11e8-8d38-1866da124f27  ONLINE       0     0     0
        gptid/bdeb9521-56a9-11e8-8d38-1866da124f27  ONLINE       0     0     0  (resilvering)
      raidz1-1                                      DEGRADED     0     0     0
        8578082641376959206                         UNAVAIL      0     0     0  was /dev/gptid/e12d2f5c-0cc1-11ed-b021-000c2980b62f
        gptid/cccb7648-baa2-11ea-853a-000c296d4317  ONLINE       0     0     0
        gptid/cce4a09d-baa2-11ea-853a-000c296d4317  ONLINE       0     0     0

errors: No known data errors

  pool: boot-pool
 state: ONLINE
  scan: scrub repaired 0B in 00:00:13 with 0 errors on Wed Aug  2 03:45:13 2023
config:

    NAME        STATE     READ WRITE CKSUM
    boot-pool   ONLINE       0     0     0
      da0p2     ONLINE       0     0     0

errors: No known data errors

  pool: onetb
 state: ONLINE
  scan: scrub repaired 0B in 00:09:53 with 0 errors on Sun Jul 30 00:09:53 2023
config:

    NAME                                          STATE     READ WRITE CKSUM
    onetb                                         ONLINE       0     0     0
      gptid/1de2fa9b-a91a-11ec-8be9-000c290bcc0f  ONLINE       0     0     0

errors: No known data errors

  pool: ssd1tb
 state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
    still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
    the pool may no longer be accessible by software that does not support
    the features. See zpool-features(5) for details.
  scan: scrub repaired 0B in 00:13:12 with 0 errors on Sun Jul  2 00:13:12 2023
config:

    NAME                                          STATE     READ WRITE CKSUM
    ssd1tb                                        ONLINE       0     0     0
      gptid/674157ba-a638-11ea-9c42-000c296d4317  ONLINE       0     0     0

errors: No known data errors


For the HBA as far as i know it was in IT mode as i have like 3 of these and i flashed them all at the same time but here is output of sas2flash -list which from my little knowledge it shows IT mode but correct me if i am wrong.

Code:
sas2flash -list
LSI Corporation SAS2 Flash Utility
Version 16.00.00.00 (2013.03.01)
Copyright (c) 2008-2013 LSI Corporation. All rights reserved

    Adapter Selected is a LSI SAS: SAS2308_2(D1)

    Controller Number              : 0
    Controller                     : SAS2308_2(D1)
    PCI Address                    : 00:0b:00:00
    SAS Address                    : 500605b-0-06d2-f690
    NVDATA Version (Default)       : 14.01.00.06
    NVDATA Version (Persistent)    : 14.01.00.06
    Firmware Product ID            : 0x2214 (IT)
    Firmware Version               : 20.00.07.00
    NVDATA Vendor                  : LSI
    NVDATA Product ID              : SAS9207-8i
    BIOS Version                   : 07.39.02.00
    UEFI BSD Version               : N/A
    FCODE Version                  : N/A
    Board Name                     : SAS9217-8i
    Board Assembly                 : H3-25569-00B
    Board Tracer Number            : SV33017995

    Finished Processing Commands Successfully.
    Exiting SAS2Flash.



On a positive note i replaced the drive that was not showing and the UI now detects the new drive. Now to find out why the other drive was not showing. Weird thing is the drive that was not detected was less than a year old and the newest out of the group. the toshbia is about 10+ years old and cant believe it lasted this long.

 
Last edited:
Joined
Jun 15, 2022
Messages
674
I am confused as to which drive so its showing 9 drives in total listed so that means at least one must be no longer show/detect by truenas. Is the only way for me to find out which one is to get serials of all and open server go through them all?

The other drive that is degraded seems to be da2 which is an very old Toshiba drive but when i go look at the drive states i get "DISK TYPE" "UNKNOWN" for da6 so is da6 degraded too?

I assume its da2 and another drive thats not showing that are degraded but why is da6 showing as HDD type "unknown"
Unless the Serial Number and/or ID along with which bay each drive went into was recorded when the drives were installed, then generally the server has to be shut down and the drives inspected manually. There are some servers that report which drive is in which bay, however this OEM functionality may be unavailable when using TrueNAS and IT Mode instead of the OEM RAID configuration and software. There are different "blink LED light" support threads depending on which brand/server you have that might help identify the failed drive.

I use a label maker to create a large-print label for sticking to the appropriate side of each drive which adds a lot of overhead initially but saves far more on the back-end. When using the OEM drive bays this wasn't necessary, however I replaced those with high-density bays with inlet fan cages in order to lower drive temperatures and extend drive life.

OPINION: Your drive temps (45°C/113°F or so) look a bit high (to me). I try to keep mine under 32°C/90°F with a goal of 30°C/86°F. I pack 'em pretty dense so 30 isn't always possible.
 

Grinas

Contributor
Joined
May 4, 2017
Messages
174
Unless the Serial Number and/or ID along with which bay each drive went into was recorded when the drives were installed, then generally the server has to be shut down and the drives inspected manually. There are some servers that report which drive is in which bay, however this OEM functionality may be unavailable when using TrueNAS and IT Mode instead of the OEM RAID configuration and software. There are different "blink LED light" support threads depending on which brand/server you have that might help identify the failed drive.

I use a label maker to create a large-print label for sticking to the appropriate side of each drive which adds a lot of overhead initially but saves far more on the back-end. When using the OEM drive bays this wasn't necessary, however I replaced those with high-density bays with inlet fan cages in order to lower drive temperatures and extend drive life.

OPINION: Your drive temps (45°C/113°F or so) look a bit high (to me). I try to keep mine under 32°C/90°F with a goal of 30°C/86°F. I pack 'em pretty dense so 30 isn't always possible.

Thanks for the response. My main concern here is da2 or is it da6 that is degraded as the UI is giving mixed info IMHO as pool status says one disk while disk status shows another disk as the issue. As i was saying in a previous comment the Toshiba drive is like 10+ years old and i cant believe its still alive.

"Great minds" with the label maker as i just pulled mine out after opening the server earlier so when i replace the other degraded drive later i can label everything and make life easier.

The HDDs are packed in pretty tight in my machine like yours. The T20 is only supposed to have 6 drives but i have 10 in it. I put in extra fans to reduce the heat but the thing was covered in dust which probably was not helping things. I gave it a quick clean but will do a more complete clean when i open it next time hopefully later today. HDDs temp range from 29°C - 40°C now.
 

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
If da6 is cooked, da2 is almost done as well: it has both 196 and 197 erros and plenty of them.
Replace both drives.
 
Joined
Jun 15, 2022
Messages
674
What I've seen is heat kills drives; I've done a few posts on that which also state my sample size is very small, however it "seems" to be applicable to your drives:

(I used CODE tags on the following sections...but they ate the highlighting of pertinent information, so I had to redo it without the code tags.)

Running warm, otherwise keep an eye on it:
smartctl -a /dev/da1
Serial Number: WD-WCC4N0XP648R
SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 3
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 040 040 000 Old_age Always - 44452
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
194 Temperature_Celsius 0x0022 110 097 000 Old_age Always - 40
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 1
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 35590 -
# 2 Extended offline Completed without error 00% 32877 -
# 3 Extended offline Completed without error 00% 32853 -
# 4 Extended offline Completed without error 00% 32783 -



This drive is failing and needs replacement now, a long test would show that.
smartctl -a /dev/da2
Serial Number: X3V9SV5GS
SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0
5 Reallocated_Sector_Ct 0x0033 094 094 005 Pre-fail Always - 276
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
9 Power_On_Hours 0x0012 092 092 000 Old_age Always - 60867
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
194 Temperature_Celsius 0x0002 133 133 000 Old_age Always - 45 (Min/Max 13/57)
196 Reallocated_Event_Count 0x0032 093 093 000 Old_age Always - 307
197 Current_Pending_Sector
0x0022 100 100 000 Old_age Always - 120
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 153 153 000 Old_age Always - 1101

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 60410 -
# 2 Short offline Completed without error 00% 60398 -
# 3 Short offline Completed without error 00% 60242 -
...
#21 Short offline Completed without error 00% 58565 -



Other than temp, looks good but keep an eye on it:
smartctl -a /dev/da3
Serial Number: WD-WCC4N6SS9F16
SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 1
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 028 028 000 Old_age Always - 52888
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
194 Temperature_Celsius 0x0022 109 100 000 Old_age Always - 41
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 52767 -
# 2 Short offline Completed without error 00% 52755 -
# 3 Short offline Completed without error 00% 52599 -
...
#21 Short offline Completed without error 00% 50972 -



Other than temp, looks good:
smartctl -a /dev/da4
Serial Number: WD-WCC4N7AY14Z9
SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 061 061 000 Old_age Always - 29072
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
194 Temperature_Celsius 0x0022 106 092 000 Old_age Always - 44
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 0
No Errors Logged

SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed without error 00% 20214 -
# 2 Extended offline Completed without error 00% 17496 -
# 3 Extended offline Completed without error 00% 17472 -
...
#10 Short offline Completed without error 00% 17129 -


da5 is MIA ???


This drive is failing and needs replacement now (it's not passing a short test):

smartctl -a /dev/da6
Serial Number: WD-WCC4N6VVXV6T
SMART overall-health self-assessment test result: PASSED

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0xfc77 200 200 051 Pre-fail Always - 60
5 Reallocated_Sector_Ct 0xf455 200 200 140 Pre-fail Offline - 0
7 Seek_Error_Rate 0xe455 200 200 051 Pre-fail Offline - 0
9 Power_On_Hours 0xec74 064 064 000 Old_age Offline - 26987
10 Spin_Retry_Count 0xd675 100 100 051 Pre-fail Offline - 0
11 Calibration_Retry_Count 0xac14 100 100 051 Old_age Offline - 0
194 Temperature_Celsius 0xc45c 106 092 000 Old_age Offline - 44
195 Hardware_ECC_Recovered 0xdc7e 200 200 000 Old_age Always - 0
196 Reallocated_Event_Count 0xbc74 200 200 000 Old_age Offline - 0
197 Current_Pending_Sector 0xe674 200 200 000 Old_age Offline - 0
198 Offline_Uncorrectable 0xfc7e 100 253 000 Old_age Always - 0
199 UDMA_CRC_Error_Count 0xc456 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0xbc57 200 200 051 Pre-fail Always - 28
No Errors Logged

Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed: read failure 30% 26986 4191536
# 2 Extended offline Completed: read failure 10% 18117 4198216
# 3 Extended offline Completed: read failure 90% 15402 4198216
# 4 Extended offline Completed: read failure 90% 15380 4198216
...
# 7 Extended offline Completed: read failure 90% 15308 4198216
 
Last edited:

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
Joined
Jun 15, 2022
Messages
674
da5 is an SSD, I suppose that's why he didn't list it.
I bet he's cooking it like a hot-dog. :grin:

(I edited the post to remove the CODE tags as they don't allow highlighting/bold for clarity.)
 
Last edited:

Grinas

Contributor
Joined
May 4, 2017
Messages
174
I bet he's cooking it like a hot-dog. :grin:

(I edited the post to remove the CODE tags as they don't allow highlighting/bold for clarity.)
SSD temp shows 30 so at least one is not cooking.

I guess it's time for more fans and for me to see if I can mover around some of these hdds for better airflow.


Thanks for the help all!!!.

Looks like I have a lot of work to do.
 
Last edited:

Grinas

Contributor
Joined
May 4, 2017
Messages
174
Just to update everyone. I found that one of the fans had failed so thats probably the reason for the high temp. Ordered a replacement fan and some extras to add and for replacement. Also made sure that all have LEDs on them so i can see pretty easily if they are dead.

On another note i got more bad news. So it looks like 3 drives are failing 1 Toshiba and 2 purple WD and i only have 2 replacements. Third drive is the one not being recognized. The two purple drives were suppose to still be in warranty as they were purchased in sept 2020 but when i went to fill out the RMA i see that the warranty on those drives expired in mar 2020. The drives are purple in color so not refurbs but looks like they might be test drives or something like what is discussed in this thread.


I have contacted WD to get more info and waiting on a response. Drives were purchased on Ebay labeled new and sealed from a seller with thousands of positives feedback but seller account no longer exists. They were actually new and sealed as a few months prior to purchasing these drives i order one new and seal and the seal was broken. A drive failed so i went to replace it with that drive ordered months prior but never actually opened the parcel it came in. That drive ended up being defective. Seller would not take it back as it was like 3 months since i purchased but was covered under WD warranty so I made sure these were seal when purchased.

I never knew there was possibility of reduced warranty period with drives and assumed they all had at least 3 years from manufacture date hence why i never/forgot to registered these on WD site till now. Drives manufacture date shows July 2019 cant understand why warranty period is only 9 months from date of manufacture. 9 months seems an odd period
 
Last edited:

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
The warranty starts when the drive is bought from the manufacturer, not when you buy it from Amazon/eBay (unless you buy from the manufacturer).

I wouldn't consider da1 as failing yet.
 

Grinas

Contributor
Joined
May 4, 2017
Messages
174
The warranty starts when the drive is bought from the manufacturer, not when you buy it from Amazon/eBay (unless you buy from the manufacturer).

I wouldn't consider da1 as failing yet.

Thats not the case from my past experience with WD and other devices. Warranty starts from day of purchase and not from when reseller purchased or manufacture date. It may vary depending on country/region. https://support-en.wd.com/app/Warranty_Policy#group2

when i enter in purchase proof the warranty expiry would show as 3 years in future or 5 years into the future for REDs not the manufacture date on the drive. I can see only one drive excluding the drives mentioned above of my drives on WD portal that does not have an expiry 3 or 5 years after purchase and i think i got that second hand.

screenshots of purple drives warranty period in WD portal.
Kdc3Gtk.png

aG10AnB.png
 
Joined
Jun 15, 2022
Messages
674
@Grinas : Happy Friday!

Generally, warranties are only from an OEM authorized seller listed on the Authorized Sellers list on the OEM website. (Guess who else learned that the hard way...)

I reformatted the post again, this time with monospace font so the text lined up.

Where'd you get the fans? (link) They sound interesting!
 

Grinas

Contributor
Joined
May 4, 2017
Messages
174
@Grinas : Happy Friday!

Generally, warranties are only from an OEM authorized seller listed on the Authorized Sellers list on the OEM website. (Guess who else learned that the hard way...)

I reformatted the post again, this time with monospace font so the text lined up.

Where'd you get the fans? (link) They sound interesting!

not so happy friday after all.

Waited for over 12 hours for re silvering to complete after replacing one of the drives(da6). Truenas was showing still 2 drives failing today in da4 and da2. I stupidly only replaced one at time. it then froze(cmd and webui) and when i accessed the console via ESXI it said pool suspended due to I/O errors.

I tired a reboot and got the same error but left it to see what would happen and 2 hours later truenas vm still not has started and getting following errors in esxi console.

HqeBfuL.png



More drives must of failed. Probably the replacement one i had. If so same thing happened when i replaced a degraded with a new drive last year. i should probably test when i get new ones instead of leaving them in sealed film in parcel they came in.

I'm assuming if i put in the degrading da6 that i removed earlier there might be some hope of restoring it if im lucky.


for the fans got them from aliexpress. I live in a country that most places wont post to e.g amazon, ebay etc.. or if they do its like €20+ custom charges so dont have many options for stuff unless im lucky with an ebay seller. Also fan was not broken after all it was just the connection was after coming out. must of happened last time i opened server.
 
Joined
Jun 15, 2022
Messages
674
@Grinas : Sad to see they all started having serious health issues at the same time. You may want to do write/read testing (in the user guides) on new drives and schedule SMART long tests in the TrueNAS interface. I'm guessing you used Z2 instead of Z3. Your best bet now is possibly to put the failing drives back into the system because they still have copies of your data, each one failing at a different area so you should be able to make a backup of your data onto something else. I use 4 TB USB-3 drives ($88, though on Amazon which you said doesn't deliver to you).

Thank you for the fan link, I ordered an 80mm and 120mm fan for testing since they're quite affordable. I'll put them on the scope and see how much line noise they generate and if it's minimal will order a few sets for an upcoming JBOD case project.
 

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
and when i accessed the console via ESXI it said pool suspended due to I/O errors.
Wait, you were virtualizing?
 
Joined
Jun 15, 2022
Messages
674
Wait, you were virtualizing?
Oh no, this generally doesn't end well. This is another reason I tell people to not run TrueNAS in a VM. Way smarter people here showed how dangerous this is and the many reasons why, for really important data do not virtualize (because eventually zebras will eat it).

---
See the video Zebras All the Way Down (why simplicity is important) that I keep talking about.
 
Last edited:
Top