Resilvering keeps restarting

FlyingPersian

Patron
Joined
Jan 27, 2014
Messages
237
Hello,

as posted in this thread, I've started replacing my WD Red 8TB (WD80EFAX) drives with Seagate Exos X X18 18 TB drives. I started on Thursday at 10 pm (German time) and since then the resilvering has restarted multiple times.

This screenshot is from Saturday 10 am:

1000136578-jpg.73492


This is from today, Sunday, 12 pm:

Code:
root@truenas[~]# zpool status -v
  pool: Data
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Sun Dec 17 09:54:05 2023
        7.56T scanned at 1017M/s, 5.12T issued at 690M/s, 42.3T total
        890G resilvered, 12.11% done, 15:42:19 to go
config:

        NAME                                              STATE     READ WRITE CKSUM
        Data                                              DEGRADED     0     0     0
          raidz2-0                                        DEGRADED     0     0     0
            replacing-0                                   DEGRADED     0     0     0
              gptid/111dd78e-103b-11e9-b74e-0cc47a406253  OFFLINE      0     0     0
              gptid/f16a1008-9ac4-11ee-91ae-0cc47a406253  ONLINE       0     0     0  (resilvering)
            gptid/f862c9ea-00b4-11ea-9daf-0cc47a406253    ONLINE       0     0     0
            gptid/e79d9e0e-f731-11e9-9ef4-0cc47a406253    ONLINE       0     0     0
            gptid/ed4cf2d0-e4eb-11e8-bded-0cc47a406253    ONLINE       0     0     0
            ada3p2                                        ONLINE       0     0     0
            gptid/b9c40f77-68f3-11e8-a08d-0cc47a406253    ONLINE       0     0     0  (resilvering)

errors: No known data errors


I'm opening a separate thread because most people are probably not reading up the old one where I asked for hardware recommendations.

Is this normal behavior or is something going wrong? What can I do?
 

FlyingPersian

Patron
Joined
Jan 27, 2014
Messages
237
It restarted again:

Code:
root@truenas[~]# zpool status -v
  pool: Data
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Sun Dec 17 20:14:34 2023
        2.83T scanned at 47.6G/s, 27.7G issued at 465M/s, 42.3T total
        4.56G resilvered, 0.06% done, 1 days 02:30:30 to go
config:

        NAME                                              STATE     READ WRITE C                                                       KSUM
        Data                                              DEGRADED     0     0                                                            0
          raidz2-0                                        DEGRADED     0     0                                                            0
            replacing-0                                   DEGRADED     0     0                                                            0
              gptid/111dd78e-103b-11e9-b74e-0cc47a406253  OFFLINE      0     0                                                            0
              gptid/f16a1008-9ac4-11ee-91ae-0cc47a406253  ONLINE       0     0                                                            0  (resilvering)
            gptid/f862c9ea-00b4-11ea-9daf-0cc47a406253    ONLINE       0     0                                                            0
            gptid/e79d9e0e-f731-11e9-9ef4-0cc47a406253    ONLINE       0     0                                                            0
            gptid/ed4cf2d0-e4eb-11e8-bded-0cc47a406253    ONLINE       0     0                                                            0
            ada3p2                                        ONLINE       0     0                                                            0
            gptid/b9c40f77-68f3-11e8-a08d-0cc47a406253    ONLINE       0     0                                                            0

errors: No known data errors


What I noticed in the console in the GUI that there was a message right before the resilvering re-started:

Code:
Dec 17 20:14:04 truenas ada1 at ahcich1 bus 0 scbus1 target 0 lun 0
Dec 17 20:14:04 truenas ada1: <WDC WD80EFZX-68UW8N0 83.H0A83> ACS-2 ATA SATA 3.x device
Dec 17 20:14:04 truenas ada1: Serial Number xxx1
Dec 17 20:14:04 truenas ada1: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
Dec 17 20:14:04 truenas ada1: Command Queueing enabled
Dec 17 20:14:04 truenas ada1: 7630885MB (15628053168 512 byte sectors)
Dec 17 20:14:04 truenas ada1 at ahcich1 bus 0 scbus1 target 0 lun 0
Dec 17 20:14:04 truenas ada1: <WDC WD80EFZX-68UW8N0 83.H0A83> s/n xxx1 detached
Dec 17 20:14:04 truenas (ada1:ahcich1:0:0:0): Periph destroyed


Seems like that disk (ada1) keeps detaching. That message kept repeating over and over. Seems to be an issue with that disk? Maybe a loose cable?

How do I stop the resilvering?
 
Top