SOLVED One or more devices has experienced an error resulting in data corruption (my fault) Suggestions?

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
Hello Community,

First let me thank you for helping me out in the past from building my first freenas setup trouble shooting it.

Below is what I am running before I get into details on what I did and if their is any way I can fix this besides backing up and rebuilding the pool.

System:
- FreeNAS-11.3-U3.2 main pool is running in raidz1
- Processor (Intel Xeon E3-1230V3 Haswell, 3.3GHz, 8MB L3 Cache LGA 1150, 80W Quad-Core Server Processor BX80646E31230V3)
- Motherboard (Supermicro X10SL7-F uATX DDR3 1600 LGA 1150)
-Crucial 32GB Kit (8GBx4) DDR3/DDR3L-1600MT/s (PC3-12800) DR x8 ECC UDIMM Server Memory CT2KIT102472BD160B/CT2CP102472BD160B Unbufferd)
- Hard Drives x6 (HGST Deskstar NAS 3.5-Inch 4TB 7200RPM SATA III 64MB Cache Internal Hard Drive Kit (0S03664)
- Computer Case (Antec One Computer Case)
- Power Supply (EVGA SuperNOVA 750 G1 80+ GOLD, 750W Fully Modular 10 Year Warranty Power Supply 120-G1-0750-XR)
- SSD for ZFS (Kingston Digital 120GB UV400 SSD C2C 2.5" SUV400S37/120G)

So I was replacing some failing case fans in the server. When completed I fired back up the server and all was good, or so I thought, I guess I was running it in a degraded I must have damaged one of the sata cables because the next day I was getting
Code:
(ada3ahcich5:0:0:0): CAM status: ATA status Error

See the picture for more deails on the error.

I then replaced the Sata cable on ada3 and all was well in the world. Once I booted back into freenas the pool automatically began to resilver the drive I had broke the sata cable on.

Stupidly during this time and a bit before I was doing some pretty heavy file transfers. I completely drew a blank that I was resilvering.

I was even monitoring the resilver process and all was working well till checked back in on with zpool status | more
Code:
root@freenas:~ # zpool status | more
  pool: freenas-boot
state: ONLINE
  scan: scrub repaired 0 in 0 days 00:01:21 with 0 errors on Sun Nov  8 03:46:21 2020
config:

        NAME                                          STATE     READ WRITE CKSUM
        freenas-boot                                  ONLINE       0     0     0
          gptid/4062447f-39d5-11e6-b6f5-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors

  pool: jailvolume
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
        still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
        the pool may no longer be accessible by software that does not support
        the features. See zpool-features(7) for details.
  scan: scrub repaired 0 in 0 days 00:02:23 with 0 errors on Sun Oct 11 00:02:23 2020
config:

        NAME                                          STATE     READ WRITE CKSUM
        jailvolume                                    ONLINE       0     0     0
          gptid/6e52b29a-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
          gptid/6e6a1f11-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors

  pool: mediavolume
state: DEGRADED
status: One or more devices has experienced an error resulting in data
        corruption.  Applications may be affected.
action: Restore the file in question if possible.  Otherwise restore the
        entire pool from backup.
   see: http://illumos.org/msg/ZFS-8000-8A
  scan: resilvered 63.1G in 0 days 01:07:54 with 0 errors on Tue Nov 10 13:28:34 2020
config:

        NAME                                            STATE     READ WRITE CKSUM
        mediavolume                                     DEGRADED     0     0     0
          raidz1-0                                      DEGRADED     0     0     0
            gptid/0eee7657-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/0f4ba8db-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/dff0074e-3381-11e7-b363-0cc47aaa6350  ONLINE       0     0     0
            gptid/10135bd5-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/c313f53c-bb99-11e8-8f76-0cc47aaa6350  ONLINE       0     0     0
            gptid/10da2a03-3a8c-11e6-b068-0cc47aaa6350  DEGRADED     0     0 2.20M  too many errors
            gptid/113daac7-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/11a059d7-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/12053b6c-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0

errors: 1209 data errors, use '-v' for a list


So then I did
Code:
root@freenas:~ # zpool status -v
  pool: freenas-boot
state: ONLINE
  scan: scrub repaired 0 in 0 days 00:01:21 with 0 errors on Sun Nov  8 03:46:21 2020
config:

        NAME                                          STATE     READ WRITE CKSUM
        freenas-boot                                  ONLINE       0     0     0
          gptid/4062447f-39d5-11e6-b6f5-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors

  pool: jailvolume
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
        still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
        the pool may no longer be accessible by software that does not support
        the features. See zpool-features(7) for details.
  scan: scrub repaired 0 in 0 days 00:02:23 with 0 errors on Sun Oct 11 00:02:23 2020
config:

        NAME                                          STATE     READ WRITE CKSUM
        jailvolume                                    ONLINE       0     0     0
          gptid/6e52b29a-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
          gptid/6e6a1f11-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors

  pool: mediavolume
state: DEGRADED
status: One or more devices has experienced an error resulting in data
        corruption.  Applications may be affected.
action: Restore the file in question if possible.  Otherwise restore the
        entire pool from backup.
   see: http://illumos.org/msg/ZFS-8000-8A
  scan: resilvered 63.1G in 0 days 01:07:54 with 0 errors on Tue Nov 10 13:28:34 2020
config:

        NAME                                            STATE     READ WRITE CKSUM
        mediavolume                                     DEGRADED     0     0     0
          raidz1-0                                      DEGRADED     0     0     0
            gptid/0eee7657-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/0f4ba8db-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/dff0074e-3381-11e7-b363-0cc47aaa6350  ONLINE       0     0     0
            gptid/10135bd5-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/c313f53c-bb99-11e8-8f76-0cc47aaa6350  ONLINE       0     0     0
            gptid/10da2a03-3a8c-11e6-b068-0cc47aaa6350  DEGRADED     0     0 2.20M  too many errors
            gptid/113daac7-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/11a059d7-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/12053b6c-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0

errors: Permanent errors have been detected in the following files:

        /var/db/system/syslog-f36704f2fe794cb6a75657843255655d/log/mdnsresponder.log
        /var/db/system/syslog-f36704f2fe794cb6a75657843255655d/log/samba4/log.nmbd
        /var/db/system/syslog-f36704f2fe794cb6a75657843255655d/log/cron
        /var/db/system/rrd-f36704f2fe794cb6a75657843255655d/localhost/disk-ada1/disk_octets.rrd
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.1b1cc729805fe7.mkv
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.bb38aae875b3ac.mkv
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.0464ff6fb0bb6e.mkv
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.a92b274e1852fe.mkv
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.a04ec95bd3ca60.mkv
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.dafb99f3c41d12.mkv
        mediavolume/media@auto-20201110.1100-4w:/4K TV/Game.of.Thrones.S05.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.0ee782e74fef14.mkv
        mediavolume/media@auto-20201110.1200-4w:<0x0>
        mediavolume/media@auto-20201110.1200-4w:/TempDownload/Game.of.Thrones.S03.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.f6c78c2a01883a.1.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.1cf8798fde96ed.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.3886cdb827e5d0.mkv
        mediavolume/media@auto-20201110.1200-4w:/TempDownload/Game.of.Thrones.S03.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.f6c78c2a01883a.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.8a62e89cb21f5f.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.9d8b3cb2f8d764.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.0b1ba0ea87336d.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.881d50eb74900f.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.f0ecca1d1f111b.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.80b6c83c139bb8.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.1caa4d84467c5a.mkv
        mediavolume/media@auto-20201110.1200-4w:/TempDownload/Game.of.Thrones.S03.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.158414a42565cc.mkv
        mediavolume/media@auto-20201110.1200-4w:/4K TV/Game.of.Thrones.S02.2160p.UHD.BluRay.REMUX.HDR.HEVC.Atmos-EPSiLON-xpost/abc.xyz.3fb8eea904f762.mkv


Yes I did buy Game of Thrones 4k on the 4th when I was released, just saying.

So I know the drive is not failing or so I think.

What is my best course of action? Is there anything else you need from me?

Any help would be much appreciated.

With much respect,
Andrew
 
Last edited:

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
mediavolume/media@auto-20201110.1100-4w
All of the corrupt media files are part of the snapshot, not the current versions, so just destroy that snapshot.

The other files are on your boot pool, so you're in for a rebuild and config restore if you want to put that right.

You'll also want to do a zpool clear and see if those checksum errors return with a scrub before you're really in the clear.
 
Joined
May 10, 2017
Messages
838
Stupidly during this time and a bit before I was doing some pretty heavy file transfers. I completely drew a blank that I was resilvering.

While it's good to avoid heavy I/O during a resilver so it finishes as fast as possible, that by itself should never cause data corruption.
 

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
Thank you so much team. So I went back to a previous snapshot and did a scrub. The scrub went well will no errors. So I went ahead and did a zpool clear. about 12 hours later the pool went back into a degraded state with the same drive having issues. Do I need to fix the boot pool first? Could it be the reason for the errors in my main pool? I also did a smart test on the drive in question with the errors.

root@freenas:~ # smartctl -t short /dev/da4 smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p14 amd64] (local build) Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART Short self-test routine immediately in off-line mode". Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful. Testing has begun. Please wait 2 minutes for test to complete. Test will complete after Wed Nov 11 22:28:20 2020 Use smartctl -X to abort test. root@freenas:~ # smartctl -l selftest /dev/da4 smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p14 amd64] (local build) Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 38356 - # 2 Short offline Completed without error 00% 38195 - # 3 Short offline Completed without error 00% 37956 - # 4 Extended offline Completed without error 00% 37871 - # 5 Short offline Completed without error 00% 37788 - # 6 Short offline Completed without error 00% 37620 - # 7 Extended offline Completed without error 00% 37535 - # 8 Short offline Completed without error 00% 37452 - # 9 Short offline Completed without error 00% 37236 - #10 Extended offline Completed without error 00% 37152 - #11 Short offline Completed without error 00% 37069 - #12 Short offline Completed without error 00% 36901 - #13 Extended offline Completed without error 00% 36816 - #14 Short offline Completed without error 00% 36733 - #15 Short offline Completed without error 00% 36493 - #16 Extended offline Completed without error 00% 36409 - #17 Short offline Completed without error 00% 36325 - #18 Short offline Completed without error 00% 36157 - #19 Extended offline Completed without error 00% 36073 - #20 Short offline Completed without error 00% 35990 - #21 Short offline Completed without error 00% 35750 -

Thank you for your help thus far. Any direction from here would be much appreciated.

Andrew
 

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
and here is the zpool status now.
Code:
root@freenas:~ # zpool status -v
  pool: freenas-boot
 state: ONLINE
  scan: scrub repaired 0 in 0 days 00:01:21 with 0 errors on Sun Nov  8 03:46:21 2020
config:

        NAME                                          STATE     READ WRITE CKSUM
        freenas-boot                                  ONLINE       0     0     0
          gptid/4062447f-39d5-11e6-b6f5-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors

  pool: jailvolume
 state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
        still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
        the pool may no longer be accessible by software that does not support
        the features. See zpool-features(7) for details.
  scan: scrub repaired 0 in 0 days 00:02:23 with 0 errors on Sun Oct 11 00:02:23 2020
config:

        NAME                                          STATE     READ WRITE CKSUM
        jailvolume                                    ONLINE       0     0     0
          gptid/6e52b29a-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
          gptid/6e6a1f11-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors

  pool: mediavolume
 state: DEGRADED
status: One or more devices are faulted in response to persistent errors.
        Sufficient replicas exist for the pool to continue functioning in a
        degraded state.
action: Replace the faulted device, or use 'zpool clear' to mark the device
        repaired.
  scan: scrub repaired 20K in 0 days 10:33:52 with 0 errors on Wed Nov 11 02:57:39 2020
config:

        NAME                                            STATE     READ WRITE CKSUM
        mediavolume                                     DEGRADED     0     0     0
          raidz1-0                                      DEGRADED     0     0     0
            gptid/0eee7657-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/0f4ba8db-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/dff0074e-3381-11e7-b363-0cc47aaa6350  ONLINE       0     0     0
            gptid/10135bd5-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/c313f53c-bb99-11e8-8f76-0cc47aaa6350  ONLINE       0     0     0
            gptid/10da2a03-3a8c-11e6-b068-0cc47aaa6350  FAULTED      6    75     0  too many errors
            gptid/113daac7-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/11a059d7-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0
            gptid/12053b6c-3a8c-11e6-b068-0cc47aaa6350  ONLINE       0     0     0

errors: No known data errors
root@freenas:~ #


Again, thank you for your help thus far.
Andrew
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
Showing us the bit when you launched the smartctl test is good, but we can't help based on that output...

Normally, you launch a test, wait the specified time (usually 3 minutes for a short, 20 for a long) then run smartctl -a /dev/da... to get the results.

Also, post the output in CODE tags, not inline code.
 

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
I am very sorry, I will use CODE tags going forward.

I started a smartctrl -t long /dev/da4 and it estimated it would take 540 minutes. That is a lot longer than 20 minutes and seems strange to me. I will post the results when it concludes.

Thank you for your help.
Andrew
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
estimated it would take 540 minutes. That is a lot longer than 20 minutes and seems strange to me.
That sounds about right for a 4 TB disk. What made you think it should be closer to 20 minutes?
 

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
oh sorry, I must have miss read/understood sretalla last msg. Thank you for clarifying and putting my mind at ease.
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Normally, you launch a test, wait the specified time (usually 3 minutes for a short, 20 for a long)
"usually 20 minutes for a long" test? On, what, a 100 GB disk? Multiple hours is the norm for anything I've seen.
 

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
First let me thank you all again for your help thus far.

Okay So smartctrl -t long /dev/da4 finished with no errors. Please see below.

Code:
root@freenas:~ # smartctl -a /dev/da4
smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p14 amd64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD40EFRX-68WT0N0
Serial Number:    WD-WCC4E0ETNF2R
LU WWN Device Id: 5 0014ee 20cb0ba60
Firmware Version: 82.00A82
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Thu Nov 12 18:39:40 2020 MST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (53460) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 534) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x703d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time            0x0027   183   183   021    Pre-fail  Always       -       7825
  4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       76
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   048   048   000    Old_age   Always       -       38376
 10 Spin_Retry_Count        0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count 0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       76
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       74
193 Load_Cycle_Count        0x0032   200   200   000    Old_age   Always       -       2374
194 Temperature_Celsius     0x0022   118   098   000    Old_age   Always       -       34
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   159   000    Old_age   Always       -       80
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     38376         -
# 2  Short offline       Completed without error       00%     38356         -
# 3  Short offline       Completed without error       00%     38195         -
# 4  Short offline       Completed without error       00%     37956         -
# 5  Extended offline    Completed without error       00%     37871         -
# 6  Short offline       Completed without error       00%     37788         -
# 7  Short offline       Completed without error       00%     37620         -
# 8  Extended offline    Completed without error       00%     37535         -
# 9  Short offline       Completed without error       00%     37452         -
#10  Short offline       Completed without error       00%     37236         -
#11  Extended offline    Completed without error       00%     37152         -
#12  Short offline       Completed without error       00%     37069         -
#13  Short offline       Completed without error       00%     36901         -
#14  Extended offline    Completed without error       00%     36816         -
#15  Short offline       Completed without error       00%     36733         -
#16  Short offline       Completed without error       00%     36493         -
#17  Extended offline    Completed without error       00%     36409         -
#18  Short offline       Completed without error       00%     36325         -
#19  Short offline       Completed without error       00%     36157         -
#20  Extended offline    Completed without error       00%     36073         -
#21  Short offline       Completed without error       00%     35990         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.



So the scrub went well and the the drive in question passed a long smart test, so I am at a loss as to what is going on. Any suggestions would be most welcome at this point.

Thank you in advance,
Andrew
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
"usually 20 minutes for a long" test? On, what, a 100 GB disk? Multiple hours is the norm for anything I've seen.
OK, so for some reason i thought I had seen it shown as shorter (maybe it was an SSD)... in any case, it tells you how long to wait.
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
So use a different sata cable and possibly a different port? If I place that hd in a different sata port will the pool be okay? Is there something I have to do if I do use a different port?

If a new cable and a new port do not work than it has to be the connection on the HD itself?

Thank you sretalla and all for your help

Andrew
 

sretalla

Powered by Neutrality
Moderator
Joined
Jan 1, 2016
Messages
9,703
No issue with using different ports, pool member disks are identified by the gptid, so arriving from a different port is handled properly.

It could be the connection on the HD side, but that's rather rare.
 

AndrewParsons

Dabbler
Joined
Jun 14, 2016
Messages
40
I just wanted to follow up. I replaced another Sata cable and I have not had an error for some time. Thank you to all that helped.

Andrew
 
Top