Resilvering keeps restarting

FlyingPersian · Dec 17, 2023

Hello,

as posted in this thread, I've started replacing my WD Red 8TB (WD80EFAX) drives with Seagate Exos X X18 18 TB drives. I started on Thursday at 10 pm (German time) and since then the resilvering has restarted multiple times.

This screenshot is from Saturday 10 am:

This is from today, Sunday, 12 pm:

Code:

root@truenas[~]# zpool status -v
  pool: Data
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Sun Dec 17 09:54:05 2023
        7.56T scanned at 1017M/s, 5.12T issued at 690M/s, 42.3T total
        890G resilvered, 12.11% done, 15:42:19 to go
config:

        NAME                                              STATE     READ WRITE CKSUM
        Data                                              DEGRADED     0     0     0
          raidz2-0                                        DEGRADED     0     0     0
            replacing-0                                   DEGRADED     0     0     0
              gptid/111dd78e-103b-11e9-b74e-0cc47a406253  OFFLINE      0     0     0
              gptid/f16a1008-9ac4-11ee-91ae-0cc47a406253  ONLINE       0     0     0  (resilvering)
            gptid/f862c9ea-00b4-11ea-9daf-0cc47a406253    ONLINE       0     0     0
            gptid/e79d9e0e-f731-11e9-9ef4-0cc47a406253    ONLINE       0     0     0
            gptid/ed4cf2d0-e4eb-11e8-bded-0cc47a406253    ONLINE       0     0     0
            ada3p2                                        ONLINE       0     0     0
            gptid/b9c40f77-68f3-11e8-a08d-0cc47a406253    ONLINE       0     0     0  (resilvering)

errors: No known data errors

I'm opening a separate thread because most people are probably not reading up the old one where I asked for hardware recommendations.

Is this normal behavior or is something going wrong? What can I do?

FlyingPersian · Dec 17, 2023

It restarted again:

Code:

root@truenas[~]# zpool status -v
  pool: Data
 state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
        continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Sun Dec 17 20:14:34 2023
        2.83T scanned at 47.6G/s, 27.7G issued at 465M/s, 42.3T total
        4.56G resilvered, 0.06% done, 1 days 02:30:30 to go
config:

        NAME                                              STATE     READ WRITE C                                                       KSUM
        Data                                              DEGRADED     0     0                                                            0
          raidz2-0                                        DEGRADED     0     0                                                            0
            replacing-0                                   DEGRADED     0     0                                                            0
              gptid/111dd78e-103b-11e9-b74e-0cc47a406253  OFFLINE      0     0                                                            0
              gptid/f16a1008-9ac4-11ee-91ae-0cc47a406253  ONLINE       0     0                                                            0  (resilvering)
            gptid/f862c9ea-00b4-11ea-9daf-0cc47a406253    ONLINE       0     0                                                            0
            gptid/e79d9e0e-f731-11e9-9ef4-0cc47a406253    ONLINE       0     0                                                            0
            gptid/ed4cf2d0-e4eb-11e8-bded-0cc47a406253    ONLINE       0     0                                                            0
            ada3p2                                        ONLINE       0     0                                                            0
            gptid/b9c40f77-68f3-11e8-a08d-0cc47a406253    ONLINE       0     0                                                            0

errors: No known data errors

What I noticed in the console in the GUI that there was a message right before the resilvering re-started:

Code:

Dec 17 20:14:04 truenas ada1 at ahcich1 bus 0 scbus1 target 0 lun 0
Dec 17 20:14:04 truenas ada1: <WDC WD80EFZX-68UW8N0 83.H0A83> ACS-2 ATA SATA 3.x device
Dec 17 20:14:04 truenas ada1: Serial Number xxx1
Dec 17 20:14:04 truenas ada1: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes)
Dec 17 20:14:04 truenas ada1: Command Queueing enabled
Dec 17 20:14:04 truenas ada1: 7630885MB (15628053168 512 byte sectors)
Dec 17 20:14:04 truenas ada1 at ahcich1 bus 0 scbus1 target 0 lun 0
Dec 17 20:14:04 truenas ada1: <WDC WD80EFZX-68UW8N0 83.H0A83> s/n xxx1 detached
Dec 17 20:14:04 truenas (ada1:ahcich1:0:0:0): Periph destroyed

Seems like that disk (ada1) keeps detaching. That message kept repeating over and over. Seems to be an issue with that disk? Maybe a loose cable?

How do I stop the resilvering?

Important Announcement for the TrueNAS Community.

Resilvering keeps restarting

FlyingPersian

Patron

FlyingPersian

Patron

Similar threads

Important Announcement for the TrueNAS Community.

Resilvering keeps restarting

FlyingPersian

Patron

FlyingPersian

Patron

Important Announcement for the TrueNAS Community.

Related topics on forums.truenas.com for thread: "Resilvering keeps restarting"

Similar threads