TN Version:TrueNAS-13.0-U2
CPU: 2 x Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz
Motherboard: Dell PowerEdge R720 16-Bay 2.5" 2U
Video: Whatever comes on that board
Boot: SAMSUNG SSD SM841N 256GB connected to motherboard.
RAID: 16x Samsung SSD 870 3.64 TiB
RAM: 12x 64GB ECC - not sure on brand it was selected to max the ram from the online shop we got it from. 768 GB total
PERC H310 controller reflashed for IT mode (passthrough)
Setup - 2 RAIDZ2 vdevs mirrored (hard to recall what I did, but think it mirrored. Doesn't say on the ui that I'm seeing) - for like 40 TB or something ish.
What happens:
Seems fine for at least a week. But over time something happens that causes it to "page fault" and reboot when clicking the rollback buttons on a snapshot for a zvol.
We run 3 vm server hosts with 8 or so vm's on each talking iscsi back to the shared zvols on truenas. The truenas has 2 backups nas' being streamed also.
Everything is fine usually. But after a month or an unknown amount of time. If you go in to truenas and attempt a rollback on a zvol disk, what you end up with instead is a reboot of the entire nas. -_- and everything goes down for 10 minutes.
Immediately following the reboot, the rollback looks like it worked. Typically I need to go back further because it either didn't actually work or I didn't go back far enough...sometimes hard to tell. But rollbacks work fine after that for at least a week or so.
Anyway, we just bought this setup as an upgrade from basically the same setup but with smaller Dell SSD's and an unflashed H310 that was the correct version and didn't need to be flashed. It had the same issue. Rollbacks cause the "page fault" error left in the crash logs and a reboot occurs.
We setup the new server as a backup and then made it the live. So it was a fresh install but the config was restored.
Anyone heard of this or experienced it? Fixes? Thoughts? Should I post some of these crash logs somehow? Let me know. I'm fearful of rollbacks now that this has happened 3 times on 2 different hardwares. (Not super different, I mean they were both dell 720's)
CPU: 2 x Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz
Motherboard: Dell PowerEdge R720 16-Bay 2.5" 2U
Video: Whatever comes on that board
Boot: SAMSUNG SSD SM841N 256GB connected to motherboard.
RAID: 16x Samsung SSD 870 3.64 TiB
RAM: 12x 64GB ECC - not sure on brand it was selected to max the ram from the online shop we got it from. 768 GB total
PERC H310 controller reflashed for IT mode (passthrough)
Setup - 2 RAIDZ2 vdevs mirrored (hard to recall what I did, but think it mirrored. Doesn't say on the ui that I'm seeing) - for like 40 TB or something ish.
What happens:
Seems fine for at least a week. But over time something happens that causes it to "page fault" and reboot when clicking the rollback buttons on a snapshot for a zvol.
We run 3 vm server hosts with 8 or so vm's on each talking iscsi back to the shared zvols on truenas. The truenas has 2 backups nas' being streamed also.
Everything is fine usually. But after a month or an unknown amount of time. If you go in to truenas and attempt a rollback on a zvol disk, what you end up with instead is a reboot of the entire nas. -_- and everything goes down for 10 minutes.
Immediately following the reboot, the rollback looks like it worked. Typically I need to go back further because it either didn't actually work or I didn't go back far enough...sometimes hard to tell. But rollbacks work fine after that for at least a week or so.
Anyway, we just bought this setup as an upgrade from basically the same setup but with smaller Dell SSD's and an unflashed H310 that was the correct version and didn't need to be flashed. It had the same issue. Rollbacks cause the "page fault" error left in the crash logs and a reboot occurs.
We setup the new server as a backup and then made it the live. So it was a fresh install but the config was restored.
Anyone heard of this or experienced it? Fixes? Thoughts? Should I post some of these crash logs somehow? Let me know. I'm fearful of rollbacks now that this has happened 3 times on 2 different hardwares. (Not super different, I mean they were both dell 720's)