I'm having a strange issue - hoping someone smarter than me can figure it out:
I've had TrueNAS running on the same hardware for the past 3 years (specs below). recently, I've been having occasional issues where I'll get chksum errors. they all appear to be corrected without issue, and a subsequent scrub finds no faults. recently, the scrub started to hang during execution. after that, it would start to reboot during the scrub with page faults in the dmsg log. When I start the system in safe mode, scrub completes successfully, but as soon as I restart into normal mode, i get the same behavior - hang during scrub followed by reboot. I have taken the nas down, run memtest86 for 48 hours without issue, stress for 24 hours without issue, and am currently finishing up a set of tests on the drives (sequential and random access), so far without issue.
I guess I'd like to know:
* What could be causing this behavior
* What does BSD/TrueNAS do differently in Safe Mode vs Normal
* How can I stop / correct / prevent this behavior
My Hardware:
AMD Ryzen 5600
asrock rack x570d4u-2l2t
32 gb Micron ECC
8x seagate ironwolf 8tb 7200rpm
I can provide logs or any other info on request, my current HD tests should be done by the end of 05/24/2023. Thanks for any and all help!
Best,
Chris
I've had TrueNAS running on the same hardware for the past 3 years (specs below). recently, I've been having occasional issues where I'll get chksum errors. they all appear to be corrected without issue, and a subsequent scrub finds no faults. recently, the scrub started to hang during execution. after that, it would start to reboot during the scrub with page faults in the dmsg log. When I start the system in safe mode, scrub completes successfully, but as soon as I restart into normal mode, i get the same behavior - hang during scrub followed by reboot. I have taken the nas down, run memtest86 for 48 hours without issue, stress for 24 hours without issue, and am currently finishing up a set of tests on the drives (sequential and random access), so far without issue.
I guess I'd like to know:
* What could be causing this behavior
* What does BSD/TrueNAS do differently in Safe Mode vs Normal
* How can I stop / correct / prevent this behavior
My Hardware:
AMD Ryzen 5600
asrock rack x570d4u-2l2t
32 gb Micron ECC
8x seagate ironwolf 8tb 7200rpm
I can provide logs or any other info on request, my current HD tests should be done by the end of 05/24/2023. Thanks for any and all help!
Best,
Chris