hunter
Explorer
- Joined
- Nov 24, 2013
- Messages
- 94
After months of having a ZFS mirror of two drives work fine, the last week my pool seems to continually disengage one of the two drives (the same one) and shortly afterwards begin resilvering it. The resilver completes. Then in 1 -2 days the same thing seems to happen again. Some of the error log entries I found are below. I looked at SMART report for the drive that keeps getting disengaged and did not see errors in the stored runs of SMART.
Can anyone suggest how to solve this problem? The drive that keeps disengaging is a 6 Tb drive and I don't have that size as a spare, and can't tell if anything is wrong with the drive.
Jul 16 06:41:33 freenas smartd[2489]: Device: /dev/ada0, Temperature 51 Celsius reached critical limit of 41 Celsius (Min/Max 48/54)
reached critical limit of 41 Celsius (Min/Max 48/54)
Jul 16 13:41:33 freenas smartd[2489]: Device: /dev/ada1, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 45/51)
Jul 16 13:54:14 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 13:54:14 freenas ada1: <HGST HDN726060ALE610 APGNT517> s/n NCGTRHJS detached
Jul 16 13:54:14 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/smart_alert.py -d ada1'
Jul 16 13:54:14 freenas (ada1:ahcich4:0:0:0): Periph destroyed
Jul 16 13:54:15 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 13:54:15 freenas ZFS: vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 13:54:26 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 13:54:26 freenas ada1: <HGST HDN726060ALE610 APGNT517> ACS-2 ATA SATA 3.x device
Jul 16 13:54:26 freenas ada1: Serial Number NCGTRHJS
Jul 16 13:54:26 freenas ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 13:54:26 freenas ada1: Command Queueing enabled
Jul 16 13:54:26 freenas ada1: 5723166MB (11721045168 512 byte sectors)
Jul 16 13:54:26 freenas ada1: Previously was known as ad12
Jul 16 13:54:26 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py ada1'
Jul 16 13:54:26 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 13:54:26 freenas ZFS: vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 14:11:33 freenas smartd[2489]: Device: /dev/ada0, Temperature 51 Celsius reached critical limit of 41 Celsius (Min/Max 48/54)
Jul 16 14:11:33 freenas smartd[2489]: Device: /dev/ada0, Temperature 51 Celsius reached critical limit of 41 Celsius (Min/Max 48/54)
Jul 16 14:11:33 freenas smartd[2489]: Device: /dev/ada1, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 45/51)
Jul 16 14:16:42 freenas notifier: shutdown: [pid 48089]
Jul 16 14:16:42 freenas notifier: Shutdown NOW!
Jul 16 14:16:42 freenas shutdown: reboot by root:
Jul 16 14:16:42 freenas notifier: Shutdown NOW!
Jul 16 14:16:42 freenas notifier:
Jul 16 14:16:42 freenas notifier: System shutdown time has arrived^G^G
Jul 16 14:19:30 freenas VT(vga): resolution 640x480
Jul 16 14:19:30 freenas CPU: Intel(R) Xeon(R) CPU E3-1230 V2 @ 3.30GHz (3300.10-MHz K8-class CPU)
Jul 16 14:19:30 freenas Origin="GenuineIntel" Id=0x306a9 Family=0x6 Model=0x3a Stepping=9
Jul 16 14:19:30 freenas Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
Jul 16 14:19:30 freenas Features2=0x7fbae3ff<SSE3,PCLMULQDQ,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,TSCDLT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND>
Jul 16 14:19:30 freenas AMD Features=0x28100800<SYSCALL,NX,RDTSCP,LM>
Jul 16 14:19:30 freenas AMD Features2=0x1<LAHF>
Jul 16 14:19:30 freenas Structured Extended Features=0x281<FSGSBASE,SMEP,ERMS>
Jul 16 14:19:30 freenas XSAVE Features=0x1<XSAVEOPT>
Jul 16 14:19:30 freenas VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID
Jul 16 14:19:30 freenas TSC: P-state invariant, performance statistics
Jul 16 14:19:30 freenas real memory = 17985175552 (17152 MB)
Jul 16 14:19:30 freenas avail memory = 16557699072 (15790 MB)
Jul 16 14:19:30 freenas Event timer "LAPIC" quality 600
Jul 16 14:19:30 freenas ACPI APIC Table: <SUPERM SMCI--MB>
Jul 16 14:19:30 freenas FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs
Jul 16 14:19:30 freenas FreeBSD/SMP: 1 package(s) x 4 core(s) x 2 SMT threads
Jul 16 14:19:30 freenas Timecounter "HPET" frequency 14318180 Hz quality 950
Jul 16 14:19:30 freenas Event timer "HPET" frequency 14318180 Hz quality 550
Jul 16 14:19:30 freenas ada0: Previously was known as ad8
Jul 16 14:19:30 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 14:19:30 freenas ada1: <HGST HDN726060ALE610 APGNT517> ACS-2 ATA SATA 3.x device
Jul 16 14:19:30 freenas ada1: Serial Number NCGTRHJS
Jul 16 14:19:30 freenas ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 14:19:30 freenas ada1: Command Queueing enabled
Jul 16 14:19:30 freenas ada1: 5723166MB (11721045168 512 byte sectors)
Jul 16 14:19:30 freenas ada1: Previously was known as ad12
Jul 16 14:19:30 freenas ada2 at ahcich5 bus 0 scbus5 target 0 lun 0
Jul 16 14:19:30 freenas ada2: <WDC WD30EFRX-68EUZN0 80.00A80> ACS-2 ATA SATA 3.x device
Jul 16 14:19:30 freenas ada2: Serial Number WD-WMC4N2323512
Jul 16 14:19:30 freenas ada2: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 14:19:30 freenas ada2: Command Queueing enabled
Jul 16 14:19:30 freenas ada2: 2861588MB (5860533168 512 byte sectors)
Jul 16 14:19:30 freenas ada2: quirks=0x1<4K>
Jul 16 14:19:30 freenas ada2: Previously was known as ad14
Jul 16 14:49:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 14:49:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 14:49:38 freenas smartd[2490]: Device: /dev/ada1, Temperature 49 Celsius reached critical limit of 41 Celsius (Min/Max 49/49)
Jul 16 15:19:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 16:19:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 16:19:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 16:19:38 freenas smartd[2490]: Device: /dev/ada1, Temperature 49 Celsius reached critical limit of 41 Celsius (Min/Max 49/49)
Jul 16 16:42:17 freenas notifier: Stopping smartd.
Jul 16 16:42:17 freenas notifier: Waiting for PIDS: 2490.
Jul 16 16:42:17 freenas notifier: smartd not running? (check /var/run/smartd.pid).
Jul 16 16:42:17 freenas notifier: Starting smartd.
Jul 16 18:59:57 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 18:59:57 freenas ada1: <HGST HDN726060ALE610 APGNT517> s/n NCGTRHJS detached
Jul 16 18:59:57 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/smart_alert.py -d ada1'
Jul 16 18:59:57 freenas GEOM_ELI: Device ada1p1.eli destroyed.
Jul 16 18:59:57 freenas GEOM_ELI: Detached ada1p1.eli on last close.
Jul 16 18:59:57 freenas (ada1:ahcich4:0:0:0): Periph destroyed
Jul 16 18:59:58 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 18:59:58 freenas ZFS: vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 19:00:09 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 19:00:09 freenas ada1: <HGST HDN726060ALE610 APGNT517> ACS-2 ATA SATA 3.x device
Jul 16 19:00:09 freenas ada1: Serial Number NCGTRHJS
Jul 16 19:00:09 freenas ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 19:00:09 freenas ada1: Command Queueing enabled
Jul 16 19:00:09 freenas ada1: 5723166MB (11721045168 512 byte sectors)
Jul 16 19:00:09 freenas ada1: Previously was known as ad12
Jul 16 19:00:09 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py ada1'
Jul 16 19:00:10 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 19:00:10 freenas ZFS: vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 20:16:13 freenas notifier: Stopping smartd.
Jul 16 20:16:13 freenas notifier: Waiting for PIDS: 29537.
Stop refresh
Can anyone suggest how to solve this problem? The drive that keeps disengaging is a 6 Tb drive and I don't have that size as a spare, and can't tell if anything is wrong with the drive.
Jul 16 06:41:33 freenas smartd[2489]: Device: /dev/ada0, Temperature 51 Celsius reached critical limit of 41 Celsius (Min/Max 48/54)
reached critical limit of 41 Celsius (Min/Max 48/54)
Jul 16 13:41:33 freenas smartd[2489]: Device: /dev/ada1, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 45/51)
Jul 16 13:54:14 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 13:54:14 freenas ada1: <HGST HDN726060ALE610 APGNT517> s/n NCGTRHJS detached
Jul 16 13:54:14 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/smart_alert.py -d ada1'
Jul 16 13:54:14 freenas (ada1:ahcich4:0:0:0): Periph destroyed
Jul 16 13:54:15 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 13:54:15 freenas ZFS: vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 13:54:26 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 13:54:26 freenas ada1: <HGST HDN726060ALE610 APGNT517> ACS-2 ATA SATA 3.x device
Jul 16 13:54:26 freenas ada1: Serial Number NCGTRHJS
Jul 16 13:54:26 freenas ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 13:54:26 freenas ada1: Command Queueing enabled
Jul 16 13:54:26 freenas ada1: 5723166MB (11721045168 512 byte sectors)
Jul 16 13:54:26 freenas ada1: Previously was known as ad12
Jul 16 13:54:26 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py ada1'
Jul 16 13:54:26 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 13:54:26 freenas ZFS: vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 14:11:33 freenas smartd[2489]: Device: /dev/ada0, Temperature 51 Celsius reached critical limit of 41 Celsius (Min/Max 48/54)
Jul 16 14:11:33 freenas smartd[2489]: Device: /dev/ada0, Temperature 51 Celsius reached critical limit of 41 Celsius (Min/Max 48/54)
Jul 16 14:11:33 freenas smartd[2489]: Device: /dev/ada1, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 45/51)
Jul 16 14:16:42 freenas notifier: shutdown: [pid 48089]
Jul 16 14:16:42 freenas notifier: Shutdown NOW!
Jul 16 14:16:42 freenas shutdown: reboot by root:
Jul 16 14:16:42 freenas notifier: Shutdown NOW!
Jul 16 14:16:42 freenas notifier:
Jul 16 14:16:42 freenas notifier: System shutdown time has arrived^G^G
Jul 16 14:19:30 freenas VT(vga): resolution 640x480
Jul 16 14:19:30 freenas CPU: Intel(R) Xeon(R) CPU E3-1230 V2 @ 3.30GHz (3300.10-MHz K8-class CPU)
Jul 16 14:19:30 freenas Origin="GenuineIntel" Id=0x306a9 Family=0x6 Model=0x3a Stepping=9
Jul 16 14:19:30 freenas Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
Jul 16 14:19:30 freenas Features2=0x7fbae3ff<SSE3,PCLMULQDQ,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,TSCDLT,AESNI,XSAVE,OSXSAVE,AVX,F16C,RDRAND>
Jul 16 14:19:30 freenas AMD Features=0x28100800<SYSCALL,NX,RDTSCP,LM>
Jul 16 14:19:30 freenas AMD Features2=0x1<LAHF>
Jul 16 14:19:30 freenas Structured Extended Features=0x281<FSGSBASE,SMEP,ERMS>
Jul 16 14:19:30 freenas XSAVE Features=0x1<XSAVEOPT>
Jul 16 14:19:30 freenas VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID
Jul 16 14:19:30 freenas TSC: P-state invariant, performance statistics
Jul 16 14:19:30 freenas real memory = 17985175552 (17152 MB)
Jul 16 14:19:30 freenas avail memory = 16557699072 (15790 MB)
Jul 16 14:19:30 freenas Event timer "LAPIC" quality 600
Jul 16 14:19:30 freenas ACPI APIC Table: <SUPERM SMCI--MB>
Jul 16 14:19:30 freenas FreeBSD/SMP: Multiprocessor System Detected: 8 CPUs
Jul 16 14:19:30 freenas FreeBSD/SMP: 1 package(s) x 4 core(s) x 2 SMT threads
Jul 16 14:19:30 freenas Timecounter "HPET" frequency 14318180 Hz quality 950
Jul 16 14:19:30 freenas Event timer "HPET" frequency 14318180 Hz quality 550
Jul 16 14:19:30 freenas ada0: Previously was known as ad8
Jul 16 14:19:30 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 14:19:30 freenas ada1: <HGST HDN726060ALE610 APGNT517> ACS-2 ATA SATA 3.x device
Jul 16 14:19:30 freenas ada1: Serial Number NCGTRHJS
Jul 16 14:19:30 freenas ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 14:19:30 freenas ada1: Command Queueing enabled
Jul 16 14:19:30 freenas ada1: 5723166MB (11721045168 512 byte sectors)
Jul 16 14:19:30 freenas ada1: Previously was known as ad12
Jul 16 14:19:30 freenas ada2 at ahcich5 bus 0 scbus5 target 0 lun 0
Jul 16 14:19:30 freenas ada2: <WDC WD30EFRX-68EUZN0 80.00A80> ACS-2 ATA SATA 3.x device
Jul 16 14:19:30 freenas ada2: Serial Number WD-WMC4N2323512
Jul 16 14:19:30 freenas ada2: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 14:19:30 freenas ada2: Command Queueing enabled
Jul 16 14:19:30 freenas ada2: 2861588MB (5860533168 512 byte sectors)
Jul 16 14:19:30 freenas ada2: quirks=0x1<4K>
Jul 16 14:19:30 freenas ada2: Previously was known as ad14
Jul 16 14:49:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 14:49:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 14:49:38 freenas smartd[2490]: Device: /dev/ada1, Temperature 49 Celsius reached critical limit of 41 Celsius (Min/Max 49/49)
Jul 16 15:19:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 16:19:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 16:19:38 freenas smartd[2490]: Device: /dev/ada0, Temperature 50 Celsius reached critical limit of 41 Celsius (Min/Max 50/50)
Jul 16 16:19:38 freenas smartd[2490]: Device: /dev/ada1, Temperature 49 Celsius reached critical limit of 41 Celsius (Min/Max 49/49)
Jul 16 16:42:17 freenas notifier: Stopping smartd.
Jul 16 16:42:17 freenas notifier: Waiting for PIDS: 2490.
Jul 16 16:42:17 freenas notifier: smartd not running? (check /var/run/smartd.pid).
Jul 16 16:42:17 freenas notifier: Starting smartd.
Jul 16 18:59:57 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 18:59:57 freenas ada1: <HGST HDN726060ALE610 APGNT517> s/n NCGTRHJS detached
Jul 16 18:59:57 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/smart_alert.py -d ada1'
Jul 16 18:59:57 freenas GEOM_ELI: Device ada1p1.eli destroyed.
Jul 16 18:59:57 freenas GEOM_ELI: Detached ada1p1.eli on last close.
Jul 16 18:59:57 freenas (ada1:ahcich4:0:0:0): Periph destroyed
Jul 16 18:59:58 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 18:59:58 freenas ZFS: vdev is removed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 19:00:09 freenas ada1 at ahcich4 bus 0 scbus4 target 0 lun 0
Jul 16 19:00:09 freenas ada1: <HGST HDN726060ALE610 APGNT517> ACS-2 ATA SATA 3.x device
Jul 16 19:00:09 freenas ada1: Serial Number NCGTRHJS
Jul 16 19:00:09 freenas ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes)
Jul 16 19:00:09 freenas ada1: Command Queueing enabled
Jul 16 19:00:09 freenas ada1: 5723166MB (11721045168 512 byte sectors)
Jul 16 19:00:09 freenas ada1: Previously was known as ad12
Jul 16 19:00:09 freenas devd: Executing '[ -e /tmp/.sync_disk_done ] && LD_LIBRARY_PATH=/usr/local/lib /usr/local/bin/python /usr/local/www/freenasUI/tools/sync_disks.py ada1'
Jul 16 19:00:10 freenas devd: Executing 'logger -p kern.notice -t ZFS 'vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389''
Jul 16 19:00:10 freenas ZFS: vdev state changed, pool_guid=17897178385600871224 vdev_guid=2516134913956804389
Jul 16 20:16:13 freenas notifier: Stopping smartd.
Jul 16 20:16:13 freenas notifier: Waiting for PIDS: 29537.
Stop refresh