Device FAULTED in scrub but OK in SMARTCTL report

Status
Not open for further replies.

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
My last SCRUB status has come back degraded with a device faulted due too many errors (output (1) below).

A SMARTCTL on the device though is showing status PASSED.

Should I remove and resilver the device at this stage or wait for it to actually show SMART problems?

Volume is configured as zfs3 (8+3) if that matters.

Ian


(1)
Code:
Checking status of zfs pools:
NAME           SIZE  ALLOC   FREE  EXPANDSZ   FRAG    CAP  DEDUP  HEALTH  ALTROOT
VOLUME1         40T  37.6T  2.37T         -    21%    94%  1.00x  DEGRADED  /mnt
freenas-boot  29.8G  3.52G  26.2G         -      -    11%  1.00x  ONLINE  -

  pool: VOLUME1
state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
    Sufficient replicas exist for the pool to continue functioning in a
    degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
    repaired.
  scan: scrub repaired 0 in 15h22m with 0 errors on Mon May  9 17:22:16 2016
config:

    NAME                                            STATE     READ WRITE CKSUM
    VOLUME1                                         DEGRADED     0     0     0
      raidz3-0                                      DEGRADED     0     0     0
        gptid/444302b9-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/44acaf47-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/458b61fe-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/45f04d30-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/46dd2963-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/47cdf0aa-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/48565317-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/48cd4928-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/4a58c8ac-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/4c508137-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
        gptid/4d50f8ba-da2a-11e3-90c3-002590878c66  FAULTED     70   187     0  too many errors

errors: No known data errors

-- End of daily output --


(2)
Code:
 % sudo smartctl -a /dev/da6

smartctl 6.3 2014-07-26 r3976 [FreeBSD 9.3-RELEASE-p13 amd64] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate NAS HDD
Device Model:     ST4000VN000-1H4168
Serial Number:    Z300XYLK
LU WWN Device Id: 5 000c50 0650ebb6d
Firmware Version: SC43
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5900 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue May 10 21:13:17 2016 COT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (   97) seconds.
Offline data collection
capabilities:                    (0x73) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 514) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x10bd) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   119   099   006    Pre-fail  Always       -       230762400
  3 Spin_Up_Time            0x0003   094   091   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       68
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   084   060   030    Pre-fail  Always       -       263206059
  9 Power_On_Hours          0x0032   087   087   000    Old_age   Always       -       12053
10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       66
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   001   001   000    Old_age   Always       -       1050
190 Airflow_Temperature_Cel 0x0022   068   059   045    Old_age   Always       -       32 (Min/Max 31/33)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       63
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       68
194 Temperature_Celsius     0x0022   032   041   000    Old_age   Always       -       32 (0 15 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       33

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%      7620         -
# 2  Short offline       Completed without error       00%      7596         -
# 3  Short offline       Completed without error       00%      7572         -
# 4  Short offline       Completed without error       00%      7563         -
# 5  Short offline       Completed without error       00%      7540         -
# 6  Extended offline    Completed without error       00%      7525         -
# 7  Short offline       Completed without error       00%      7515         -
# 8  Short offline       Completed without error       00%      7491         -
# 9  Short offline       Completed without error       00%      7467         -
#10  Short offline       Completed without error       00%      7443         -
#11  Short offline       Completed without error       00%      7413         -
#12  Short offline       Completed without error       00%      7388         -
#13  Extended offline    Completed without error       00%      7375         -
#14  Short offline       Completed without error       00%      7364         -
#15  Short offline       Completed without error       00%      7340         -
#16  Short offline       Completed without error       00%      7316         -
#17  Short offline       Completed without error       00%      7292         -
#18  Short offline       Completed without error       00%      7268         -
#19  Short offline       Completed without error       00%      7245         -
#20  Short offline       Completed without error       00%      7221         -
#21  Extended offline    Completed without error       00%      7207         -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
That drive hasn't seen a SMART test in nearly 5000 hours. Recommend you run a long test (and check your schedules) and see what results.
 

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
Yes I noticed that. For some reason it, and another, had dropped out of the daily short/weekly long schedules. It's already been reinstated and I'll have the results tomorrow.

But that shouldn't, iiuc, affect lines 62-82 should it? Aren't those updated on the fly?
 

ethereal

Guru
Joined
Sep 10, 2012
Messages
762
But that shouldn't, iiuc, affect lines 62-82 should it? Aren't those updated on the fly?

if you have smart tests and scrubs set up correctly and email the results to you - any problems maybe headed off before there is a big problem.
those errors have been building but because the smart tests were not running you didn't notice until the pool was degraded.
if you had the smart test configured you could have noticed the problem and ordered and tested a replacemnet.

my server sends me an email weekly and i check 10 hdds and ssd for any smart or scrub problems
 
Last edited:

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
But that shouldn't, iiuc, affect lines 62-82 should it? Aren't those updated on the fly?
No, it really shouldn't affect those; as you say, they're supposed to be updated on the fly. It may be that the SMART test will reveal nothing useful. But it will be good to see in any case.
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
The High_Fly_Writes attribute is very high, the drive was probably exposed to some shocks/vibrations, you may want to check that.
 

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
Well da6 is no longer online this morning so I'm not sure how to see the results of the overnight SMART tests.

Could the SMART test, or something else, have taken it offline?


if you have smart tests and scrubs set up correctly and email the results to you - any problems maybe headed off before there is a big problem.
those errors have been building but because the smart tests were not running you didn't notice until the pool was degraded.
if you had the smart test configured you could have noticed the problem and ordered and tested a replacemnet.

my server sends me an email weekly and i check 10 hdds and ssd for any smart or scrub problems

Agreed. It was not part of the plan to exclude that disk from the SMART tests!
Luckily I do have a cold standby in place.

The High_Fly_Writes attribute is very high, the drive was probably exposed to some shocks/vibrations, you may want to check that.

Interesting... The drive(s) were certainly /not/ exposed to shocks when running (unless swmbo has something to tell me...), but it /was/ part of the initial order of which FOUR drives were DOA.
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Could the SMART test, or something else, have taken it offline?
The SMART test would not have, and nothing else should have. You may be able to find in the system log (/var/log/messages) where it dropped offline, and why (a grep for da6 would be the place to start, I think). A power cycle may bring it back online so you can see the SMART status, but the drive dropping offline isn't a good sign.
 

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
The SMART test would not have, and nothing else should have. You may be able to find in the system log (/var/log/messages) where it dropped offline, and why (a grep for da6 would be the place to start, I think). A power cycle may bring it back online so you can see the SMART status, but the drive dropping offline isn't a good sign.

Thanks for that. This is what I find:

Code:
May 11 02:28:46 freenas mps0: IOCStatus = 0x4b while resetting device 0xe
May 11 02:28:46 freenas da6 at mps0 bus 0 scbus0 target 6 lun 0
May 11 02:28:46 freenas da6: <ATA ST4000VN000-1H41 SC43> s/n             Z300XYLK detached
May 11 02:28:46 freenas GEOM_ELI: Device da6p1.eli destroyed.
May 11 02:28:46 freenas GEOM_ELI: Detached da6p1.eli on last close.
May 11 02:28:46 freenas (da6:mps0:0:6:0): Periph destroyed


I must admit I have no idea how to interpret that.

I see that https://forums.freebsd.org/threads/39609/ also reports a drive going offline during a smart long test.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
There's your problem, probably.

Replace the cable and try again.

Just tried this. The device reattached and automatically started resilvering. Finished after 10 minutes and the status now shows:

Code:
 gptid/4d50f8ba-da2a-11e3-90c3-002590878c66  DEGRADED     0     0   472  too many errors


(was previously
Code:
gptid/4d50f8ba-da2a-11e3-90c3-002590878c66  FAULTED     70   187     0  too many errors
)

I'm assuming this is not good?
 

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
Well this is odd. Had to reboot again to fix another SATA cable which had come loose whilst I was changing the previous one and:


Code:
 % zpool status VOLUME1
  pool: VOLUME1
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
        still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
        the pool may no longer be accessible by software that does not support
        the features. See zpool-features(7) for details.
  scan: resilvered 1.64M in 0h0m with 0 errors on Wed May 11 23:14:31 2016
config:

        NAME                                            STATE     READ WRITE CKSUM
        VOLUME1                                         ONLINE       0     0     0
          raidz3-0                                      ONLINE       0     0     0
            gptid/444302b9-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/44acaf47-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/458b61fe-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/45f04d30-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/46dd2963-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/47cdf0aa-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/48565317-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/48cd4928-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/4a58c8ac-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/4c508137-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/4d50f8ba-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0


It this a reasonable thing to happen?

I'm running a SCRUB and a long SMART, we'll see what state it's in...
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Those counters are reset on reboot, so there's no useful data yet.
 

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
Those counters are reset on reboot, so there's no useful data yet.

Thanks Eric.



So I ran the SCRUB and a long SMART and this is what I got:

12K CKSUM errors on the suspect device but NO reallocated or current pending sectors on the SMART.

Does this make any sense?

Code:
 % zpool status VOLUME1
  pool: VOLUME1
 state: DEGRADED
status: One or more devices has experienced an unrecoverable error.  An
        attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
        using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: scrub repaired 76.8M in 33h42m with 0 errors on Fri May 13 09:04:58 2016
config:

        NAME                                            STATE     READ WRITE CKSUM
        VOLUME1                                         DEGRADED     0     0     0
          raidz3-0                                      DEGRADED     0     0     0
            gptid/444302b9-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/44acaf47-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/458b61fe-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/45f04d30-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/46dd2963-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/47cdf0aa-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/48565317-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/48cd4928-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/4a58c8ac-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/4c508137-da2a-11e3-90c3-002590878c66  ONLINE       0     0     0
            gptid/4d50f8ba-da2a-11e3-90c3-002590878c66  DEGRADED     0     0 12.0K  too many errors

errors: No known data errors


Code:
 % sudo smartctl -a /dev/da6
Password:
smartctl 6.3 2014-07-26 r3976 [FreeBSD 9.3-RELEASE-p13 amd64] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate NAS HDD
Device Model:     ST4000VN000-1H4168
Serial Number:    Z300XYLK
LU WWN Device Id: 5 000c50 0650ebb6d
Firmware Version: SC43
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5900 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri May 13 17:26:59 2016 COT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (   97) seconds.
Offline data collection
capabilities:                    (0x73) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        No Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 514) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x10bd) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   110   099   006    Pre-fail  Always       -       28092328
  3 Spin_Up_Time            0x0003   094   091   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       71
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   084   060   030    Pre-fail  Always       -       267465746
  9 Power_On_Hours          0x0032   087   087   000    Old_age   Always       -       12115
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       68
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0
189 High_Fly_Writes         0x003a   001   001   000    Old_age   Always       -       1050
190 Airflow_Temperature_Cel 0x0022   055   052   045    Old_age   Always       -       45 (Min/Max 27/48)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       66
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       71
194 Temperature_Celsius     0x0022   045   048   000    Old_age   Always       -       45 (0 15 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       33

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     12115         -
# 2  Extended offline    Aborted by host               90%     12072         -
# 3  Short offline       Completed without error       00%     12072         -
# 4  Extended offline    Interrupted (host reset)      00%     12058         -
# 5  Short offline       Completed without error       00%     12057         -
# 6  Short offline       Completed without error       00%      7620         -
# 7  Short offline       Completed without error       00%      7596         -
# 8  Short offline       Completed without error       00%      7572         -
# 9  Short offline       Completed without error       00%      7563         -
#10  Short offline       Completed without error       00%      7540         -
#11  Extended offline    Completed without error       00%      7525         -
#12  Short offline       Completed without error       00%      7515         -
#13  Short offline       Completed without error       00%      7491         -
#14  Short offline       Completed without error       00%      7467         -
#15  Short offline       Completed without error       00%      7443         -
#16  Short offline       Completed without error       00%      7413         -
#17  Short offline       Completed without error       00%      7388         -
#18  Extended offline    Completed without error       00%      7375         -
#19  Short offline       Completed without error       00%      7364         -
#20  Short offline       Completed without error       00%      7340         -
#21  Short offline       Completed without error       00%      7316         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

ethereal

Guru
Joined
Sep 10, 2012
Messages
762
  1. 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 33
probably a problem with your cable - make sure it is seated correctly or replace it
 

IanWorthington

Contributor
Joined
Sep 13, 2013
Messages
144
  1. 199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 33
probably a problem with your cable - make sure it is seated correctly or replace it

Aye, Eric spotted that. I replaced it and scrubbed: the previous post was the result of that. The UDMA_CRC count stayed at 33 so I assume the new cable is ok.
 

ethereal

Guru
Joined
Sep 10, 2012
Messages
762
potentially a bad drive then with your CKSUM errors.

some people have had positive results with removing the bad drive - deleting it's contents then replacing in the pool after re-silvering the problem is gone.

i would use badblocks if i were you
 
Last edited:
Status
Not open for further replies.
Top