Pool says Unhealthy and Not seeing why

rob90033 · Feb 12, 2021

Noob here
System AMD Ryzen 3900x with asrock motherboard, 32 GB ECC Memory and 14 10tb and 2 6TB NAS Drives and one SSD for boot. I am not showing any that are faulted or failed. Status is Online yet the pool says unhealthy.

If I need to provide something else please let me know.

Samuel Tai · Feb 12, 2021

Please provide the output of zpool status -v <name of your pool>.

rob90033 · Feb 12, 2021

Warning: settings changed through the CLI are not written to
the configuration database and will be reset on reboot.

root@truenas[~]# zpool status -v RobPool
pool: RobPool
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
scan: resilvered 2.06M in 00:00:00 with 0 errors on Fri Feb 12 11:50:30 2021
config:

NAME STATE READ WRITE CKSUM
RobPool ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
gptid/2b06a3de-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2bb2947e-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2bc00cef-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2bdbb4d8-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2c109440-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
raidz1-1 ONLINE 0 0 0
gptid/20ec90c2-5de7-11eb-88c0-a8a1591008b1 ONLINE 0 0 0
gptid/a946e568-522a-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/a9595cf6-522a-11eb-a6ad-a8a1591008b1 ONLINE 0 022

Hellione · Feb 12, 2021

the last drive has 22 checksum errors ^^

Samuel Tai · Feb 12, 2021

Your disk gptid/a9595cf6-522a-11eb-a6ad-a8a1591008b1 is showing 22 checksum errors. This is typically a sign the disk will fail soon. To figure out which disk this is, run midclt call pool.query | jq, which will output JSON code. One of the blocks will have this gptid, and will state which disk (e.g., ada4) this is. Once you've located the disk, run the procedure to replace a bad disk.

10. Storage — FreeNAS®11.3-U5 User Guide Table of Contents

www.ixsystems.com

HoneyBadger · Feb 12, 2021

While the steps to ID the disk are accurate, before jumping to "replace the disk" I would suggest "check the output of smartctl -a /dev/adaX for signs of failing hardware" and maybe "replace the cable" first, especially if it has signs of sharp bends or potential internal stress points.

Important Announcement for the TrueNAS Community.

Pool says Unhealthy and Not seeing why

rob90033

Dabbler

Samuel Tai

Never underestimate your own stupidity

rob90033

Dabbler

Hellione

Explorer

Samuel Tai

Never underestimate your own stupidity

10. Storage — FreeNAS®11.3-U5 User Guide Table of Contents

HoneyBadger

actually does care

Similar threads

Important Announcement for the TrueNAS Community.

Pool says Unhealthy and Not seeing why

rob90033

Dabbler

Samuel Tai

Never underestimate your own stupidity

rob90033

Dabbler

Hellione

Explorer

Samuel Tai

Never underestimate your own stupidity

10. Storage — FreeNAS®11.3-U5 User Guide Table of Contents

HoneyBadger

actually does care

Important Announcement for the TrueNAS Community.

Related topics on forums.truenas.com for thread: "Pool says Unhealthy and Not seeing why"

Similar threads