Pool Online (Unhealthy)

mx2004

Cadet
Joined
Nov 5, 2022
Messages
4
Im quite new to TrueNAS, but managed to get a NAS up and running. But now after about 5 months of sometimes being turned on it now shows the pool as "Online (Unhealthy)", all drives are online with no read/write/checksum errors.
AMD Ryzen 5 3400G with stock cooler
4x 16GB G.Skill Aegis DDR4-3000 CL16
ASRock B450M Pro4 R2.0
Fujitsue 9211-8i SAS HBA
960GB Kingston A400
HP 560SFP+ dual ethernet card 10Gb
Sharkoon 550W WPM Gold Zero
Is that normal or should i worry?
 

Redcoat

MVP
Joined
Feb 18, 2014
Messages
2,925
Where do i find this?

In the Forum Rules (in red on Masthead) you'll find the following:

"When discussing a piece of software, please provide the exact build number. For TrueNAS, this can be found at System ‣ Information. You'll also want to post the amount of RAM in the system. If additional hardware information is needed, you will be asked to attach a debug file. Debug files are created with the System ‣ Advanced ‣ Save Debug menu entry.

It's important to remember that hardware information is extremely important when diagnosing problems so please make sure to include this information if you need help troubleshooting a problem. This includes:
  • Motherboard make and model
  • CPU make and model
  • RAM quantity
  • Hard drives, quantity, model numbers, and RAID configuration, including boot drives
  • Hard disk controllers
  • Network cards"

Provide that, plus open the Shell in TrueNAS. On the command line typezpool status -v and screen copy the response and paste in your response. post here.

If you have SSH set up (though seems unlikely from your post so far...) use SSH to get the response to the command's entry and paste the code in here, preferably in code tags.
 
Last edited:

mx2004

Cadet
Joined
Nov 5, 2022
Messages
4


Hardware:
ASRock B450M Pro4 R2.0
AMD Ryzen 5 3400G with stock cooler
4x 16GB G.Skill Aegis DDR4-3000 CL16
8x 960GB Kingston A400 in RaidZ2
1x Crucial BX500 240GB
Fujitsu 9211-8i SAS HBA
HP 560SFP+ dual ethernet card 10Gb
 

Attachments

  • debug-truenas-20231002191812.tgz
    889.4 KB · Views: 67

mx2004

Cadet
Joined
Nov 5, 2022
Messages
4


Hardware:
ASRock B450M Pro4 R2.0
AMD Ryzen 5 3400G with stock cooler
4x 16GB G.Skill Aegis DDR4-3000 CL16
8x 960GB Kingston A400 in RaidZ2
1x Crucial BX500 240GB
Fujitsu 9211-8i SAS HBA
HP 560SFP+ dual ethernet card 10Gb
Code:
root@truenas[~]# zpool status -v
  pool: Vault
 state: ONLINE
status: One or more devices has experienced an error resulting in data
        corruption.  Applications may be affected.
action: Restore the file in question if possible.  Otherwise restore the
        entire pool from backup.
   see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-8A
  scan: scrub repaired 236K in 00:05:25 with 4 errors on Sun Sep 10 00:05:25 2023
config:

        NAME                                            STATE     READ WRITE CKSUM
        Vault                                           ONLINE       0     0 0
          raidz2-0                                      ONLINE       0     0 0
            gptid/d37fec24-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d36913ed-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d36ac4f1-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d3896404-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d36c2110-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d384c3ec-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d37d3227-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0
            gptid/d3825afe-0c8f-11ee-bd1d-1402ec895b28  ONLINE       0     0 0

errors: Permanent errors have been detected in the following files:

        Vault:<0x88129>
        Vault:<0x2bc>
        Vault:<0x1bd>
        Vault:<0x881be>

  pool: boot-pool
 state: ONLINE
  scan: scrub repaired 0B in 00:00:12 with 0 errors on Sun Sep 10 03:45:12 2023
config:

        NAME        STATE     READ WRITE CKSUM
        boot-pool   ONLINE       0     0     0
          ada0p2    ONLINE       0     0     0

errors: No known data errors


This somehow got lost before replying
 
Joined
Jan 7, 2015
Messages
1,155
Lots of times it will have files there and a path to the offending path. That looks like a block addresses thats hosed up. Id try to clear any zfs errors as such and do a rescrub to verify.
 

NugentS

MVP
Joined
Apr 16, 2020
Messages
2,947
The normal repsonse to errors like that is that they are metadata errors, and unrecoverable. Thus requiring a destroy pool and restore from backup.
However there is normally a reason for them - and currently I don't see one.

Are you (the NAS) on a UPS?
Has there been a power event that could have corrupted the pool?
Does the system uptime (on the dashboard) show a figure that you think is reasonable?

What PSU are you using?

I suggest running a memtest - for at least 1 pass, preferably more
Oh, and make sure you have a backup - you do have a backup don't you?
 

planedrop

Dabbler
Joined
Jun 28, 2021
Messages
26
I think what @NugentS is asking are all the right questions, seems like maybe a power loss event happened or something along those lines.
 
Top