Pool says Unhealthy and Not seeing why

rob90033

Dabbler
Joined
Jan 1, 2015
Messages
20
Noob here
System AMD Ryzen 3900x with asrock motherboard, 32 GB ECC Memory and 14 10tb and 2 6TB NAS Drives and one SSD for boot. I am not showing any that are faulted or failed. Status is Online yet the pool says unhealthy.

If I need to provide something else please let me know.
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
Please provide the output of zpool status -v <name of your pool>.
 

rob90033

Dabbler
Joined
Jan 1, 2015
Messages
20
Warning: settings changed through the CLI are not written to
the configuration database and will be reset on reboot.

root@truenas[~]# zpool status -v RobPool
pool: RobPool
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
scan: resilvered 2.06M in 00:00:00 with 0 errors on Fri Feb 12 11:50:30 2021
config:

NAME STATE READ WRITE CKSUM
RobPool ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
gptid/2b06a3de-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2bb2947e-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2bc00cef-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2bdbb4d8-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/2c109440-4c31-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
raidz1-1 ONLINE 0 0 0
gptid/20ec90c2-5de7-11eb-88c0-a8a1591008b1 ONLINE 0 0 0
gptid/a946e568-522a-11eb-a6ad-a8a1591008b1 ONLINE 0 0 0
gptid/a9595cf6-522a-11eb-a6ad-a8a1591008b1 ONLINE 0 022
 

Hellione

Explorer
Joined
Jan 23, 2021
Messages
55
the last drive has 22 checksum errors ^^
 

Samuel Tai

Never underestimate your own stupidity
Moderator
Joined
Apr 24, 2020
Messages
5,399
Your disk gptid/a9595cf6-522a-11eb-a6ad-a8a1591008b1 is showing 22 checksum errors. This is typically a sign the disk will fail soon. To figure out which disk this is, run midclt call pool.query | jq, which will output JSON code. One of the blocks will have this gptid, and will state which disk (e.g., ada4) this is. Once you've located the disk, run the procedure to replace a bad disk.

 

HoneyBadger

actually does care
Administrator
Moderator
iXsystems
Joined
Feb 6, 2014
Messages
5,112
While the steps to ID the disk are accurate, before jumping to "replace the disk" I would suggest "check the output of smartctl -a /dev/adaX for signs of failing hardware" and maybe "replace the cable" first, especially if it has signs of sharp bends or potential internal stress points.
 
Top