anubiousL2596^!
Cadet
- Joined
- May 8, 2020
- Messages
- 7
Hello.
I have (2) Freenas boxes. One box has 40 TB and the other 80 TB. Performance when its working is amazing.
Lately, the main NAS's (the 80 TB 2630v2 with 32 GB of RAM) NFS/SMB access will lock up and the box cannot be reset or powered off. I can still log on to the freenas and I can browse all of my data sets.... I do not understand what's going on or even how to trouble shoot it. Again, NFS/SMB/SYSLOG/DMESG/Terminal screen put out no log files or even a hint of an issue. I cannot even predict when the problem starts because its random, always happens during the night, and usually when im pushing ~500MB/s traffic through it. The interface is a Mellanox ConnectX-3 with 40Gb fiber. I tried upgrading the host to 64GB ram and the problem still occurs.
- I tried switching the NIC out for another known working ConnectX3 40Gb. No dice.
- I tried switching the nic to a quad port Intel NIC and that made problems worse (the system was not stable unless all types of offloading were disabled or else the driver would crash). This brought my throughput from ~8Gbps to about ~983Mbps. I've also order a quad port chelsio to replace this just in case. The Mellanox adapter remains till then.
- The troubled nas is in a completely closed VLAN with jumbo mtu's enabled and verified working. This vlan has no route point and is only meant as a storage backbone for servers to access the NAS's.
- I completely wiped freenas and started from a fresh install on fresh enterprise SSD's. problem persists.
- I removed all of my 40Gb related sysctl tunings. No dice.
This happened all started happening when I selected "Upgrade ZFS pool", which I will never do again. The other box (who did not have there ZFS upgraded) is behaving as expected. I'm at my wits end with this.
Please help. Or if you want to tell me to RTFM for this problem, please send me the URL and I'll look right away.
I have (2) Freenas boxes. One box has 40 TB and the other 80 TB. Performance when its working is amazing.
Lately, the main NAS's (the 80 TB 2630v2 with 32 GB of RAM) NFS/SMB access will lock up and the box cannot be reset or powered off. I can still log on to the freenas and I can browse all of my data sets.... I do not understand what's going on or even how to trouble shoot it. Again, NFS/SMB/SYSLOG/DMESG/Terminal screen put out no log files or even a hint of an issue. I cannot even predict when the problem starts because its random, always happens during the night, and usually when im pushing ~500MB/s traffic through it. The interface is a Mellanox ConnectX-3 with 40Gb fiber. I tried upgrading the host to 64GB ram and the problem still occurs.
- I tried switching the NIC out for another known working ConnectX3 40Gb. No dice.
- I tried switching the nic to a quad port Intel NIC and that made problems worse (the system was not stable unless all types of offloading were disabled or else the driver would crash). This brought my throughput from ~8Gbps to about ~983Mbps. I've also order a quad port chelsio to replace this just in case. The Mellanox adapter remains till then.
- The troubled nas is in a completely closed VLAN with jumbo mtu's enabled and verified working. This vlan has no route point and is only meant as a storage backbone for servers to access the NAS's.
- I completely wiped freenas and started from a fresh install on fresh enterprise SSD's. problem persists.
- I removed all of my 40Gb related sysctl tunings. No dice.
This happened all started happening when I selected "Upgrade ZFS pool", which I will never do again. The other box (who did not have there ZFS upgraded) is behaving as expected. I'm at my wits end with this.
Please help. Or if you want to tell me to RTFM for this problem, please send me the URL and I'll look right away.