Rendermandan
Dabbler
- Joined
- Sep 17, 2022
- Messages
- 20
I'm still trying to get my system up and running and I keep hitting errors and issues and could you some advice. I'm using Truenas Scale and I'm not very familiar with everything. but learning. So please go easy on me. :)
I'm getting random disk read io errors. on the boot pool, and the raidz2 storage. Its random though but usually locks the system up. A reboot seems to fix the issue for a few hours, but then issues comes back, usually not the same exact error though. I have been able to setup samba shares and start copying my movie files to the pool but that is when the errors start occurring. I've tried to research this as much as I can and try different solutions but I haven't' had much luck.
Sometimes when I reboot if I go to disks, it has write errors, but no Checksum errors. or sometimes I see checksum errors, and then after the reboot they disappear. Maybe I'm just not understanding something.
So far here is what I've tried.
Original set of raidZ2 disks were SMR, -They were old and some failed smart tests. at first I though those were the problem so, I changed those out.
Changed brand new Mirrored SSD Boot disks for a new SSD. Reinstalled Scale and pool setup, - Sill random Boot IO Errors -Went back to the mirrored SSD's.
Purchased an entire new set of new CMR 6TB Red Drives for the array. Ran Smart tests on each disk. Long, short and conveyance. All came back Successful.
Ran Smart tests on Boot SSD's. All came back Successful.
checked all cable and drive connections.
Reseated the controller card.
reseated memory
cleared and re flashed IT mode to my controller card.
deleted my pool and made a new pool with two Vdevs instead of one large one. both raid z2
Lastly I restored the Firmware to the controller and installed Ubuntu Server. Everything ran fine, but getting samba sharing on it is just too much work and is a HUGE pain in the ass, so I'd really like to use TrueNAS Scale.
The only thing I haven't tried is a new controller yet. Would that even help? I figured it couldn't hurt, but I really can't afford to keep throwing money at this!!!
I'm running Memtest86 as I type. So far, no errors but with 192GB of ram, its taking a long time, so I can't pull any error logs. sorry.
Here is my current system Hardware.
Dell R720XD With Dual processors.
192GB ECC ram (8) 8GB sticks and (8) 16gb sticks. running in optimized mode.
Mirrored Kingston SSD Boot Drives in the Back plane.
12 Brand new 6TB, CMR WD Red Plus hard drives.
H710P Mini monolithic controller in IT Mode.
Dual 750Watt PSU's
TrueNAS Scale 22.02.4
I really appreciate any help! Thanks.
I'm getting random disk read io errors. on the boot pool, and the raidz2 storage. Its random though but usually locks the system up. A reboot seems to fix the issue for a few hours, but then issues comes back, usually not the same exact error though. I have been able to setup samba shares and start copying my movie files to the pool but that is when the errors start occurring. I've tried to research this as much as I can and try different solutions but I haven't' had much luck.
Sometimes when I reboot if I go to disks, it has write errors, but no Checksum errors. or sometimes I see checksum errors, and then after the reboot they disappear. Maybe I'm just not understanding something.
So far here is what I've tried.
Original set of raidZ2 disks were SMR, -They were old and some failed smart tests. at first I though those were the problem so, I changed those out.
Changed brand new Mirrored SSD Boot disks for a new SSD. Reinstalled Scale and pool setup, - Sill random Boot IO Errors -Went back to the mirrored SSD's.
Purchased an entire new set of new CMR 6TB Red Drives for the array. Ran Smart tests on each disk. Long, short and conveyance. All came back Successful.
Ran Smart tests on Boot SSD's. All came back Successful.
checked all cable and drive connections.
Reseated the controller card.
reseated memory
cleared and re flashed IT mode to my controller card.
deleted my pool and made a new pool with two Vdevs instead of one large one. both raid z2
Lastly I restored the Firmware to the controller and installed Ubuntu Server. Everything ran fine, but getting samba sharing on it is just too much work and is a HUGE pain in the ass, so I'd really like to use TrueNAS Scale.
The only thing I haven't tried is a new controller yet. Would that even help? I figured it couldn't hurt, but I really can't afford to keep throwing money at this!!!
I'm running Memtest86 as I type. So far, no errors but with 192GB of ram, its taking a long time, so I can't pull any error logs. sorry.
Here is my current system Hardware.
Dell R720XD With Dual processors.
192GB ECC ram (8) 8GB sticks and (8) 16gb sticks. running in optimized mode.
Mirrored Kingston SSD Boot Drives in the Back plane.
12 Brand new 6TB, CMR WD Red Plus hard drives.
H710P Mini monolithic controller in IT Mode.
Dual 750Watt PSU's
TrueNAS Scale 22.02.4
I really appreciate any help! Thanks.