TrueNAS users, developers, et alia:
I have a use case whereby a user has a TrueNAS SCALE dedicated server (an IBM x3650 "server grade server" with 48GB of RAM, 2 Xeon 4-core processors, and 8 SAS drives) that uses NFS mounts to several smaller servers (NUCs) that run ProxMox in a cluster. If the TrueNAS server crashes the VMs can crash (they are non critical VMs) because currently, the TrueNAS server is the only file server servicing the ProxMox cluster of NUCs. I am beginning to think that perhaps there might be wisdom in replacing the TrueNAS server a CEPH environment, seeing how the TrueNAS SCALE environment well belly up some 6 or so times in 1 day and CEPH is a more distributed system that survive any one server crashing in most cases. However, for the good of the TrueNAS SCLAE community I'd like to know what steps are recommended so I can assure this was a hardware based problem vice some bugulance that might have been run into that is worthy of being remediated.
Generally speaking, I know how to troubleshoot Linux when it crashes but I know TrueNAS (CORE and SCALE) are viewed as appliances and are heavily modified, so this is why I am posting this question.
Stuart
I have a use case whereby a user has a TrueNAS SCALE dedicated server (an IBM x3650 "server grade server" with 48GB of RAM, 2 Xeon 4-core processors, and 8 SAS drives) that uses NFS mounts to several smaller servers (NUCs) that run ProxMox in a cluster. If the TrueNAS server crashes the VMs can crash (they are non critical VMs) because currently, the TrueNAS server is the only file server servicing the ProxMox cluster of NUCs. I am beginning to think that perhaps there might be wisdom in replacing the TrueNAS server a CEPH environment, seeing how the TrueNAS SCALE environment well belly up some 6 or so times in 1 day and CEPH is a more distributed system that survive any one server crashing in most cases. However, for the good of the TrueNAS SCLAE community I'd like to know what steps are recommended so I can assure this was a hardware based problem vice some bugulance that might have been run into that is worthy of being remediated.
Generally speaking, I know how to troubleshoot Linux when it crashes but I know TrueNAS (CORE and SCALE) are viewed as appliances and are heavily modified, so this is why I am posting this question.
Stuart