kurtc
Dabbler
- Joined
- Dec 17, 2017
- Messages
- 39
We have a recently deployed FreeNAS 11.1 system that is serving up data to a single Windows host over iSCSI from zvol extents/targets. The original NIC was an Intel X520-DA2 with 10G-SR fiber connection through an HP 2920 switch to the host. We were seeing that after heavy load for 20-45 minutes (4-5Gbit per second of continuous sequential iSCSI traffic), the connection would completely drop and both the FreeNAS system and the HP 2920 switch would log the interface going offline and then shortly thereafter coming back online. Almost like the NIC driver crashed. This obviously wreaks havoc with the VM running on that host when the storage is ripped out from underneath it.
Our first troubleshooting steps were to replace fiber cables, then SFP+ modules at both ends, then the 2920 Switch (there are two stacked to choose from), and finally replacing the NIC altogether with an X710-DA2 that we also had in stock. We can repeat the issue on demand within that referenced time window. We tried switching media from fiber to copper and replacing the NIC with an Intel X540-T2 into the same switches, with the same behavior. I don't have any other brands like Chelsio on hand to test with. We also tried disabling TSO and the drop/reset/whatever is happening, still happens.
It almost seems like this issue referenced here https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919 but I don't know enough about .diff's and compiling to even try it.
Any help would be greatly appreciated.
Thanks,
Kurt
Our first troubleshooting steps were to replace fiber cables, then SFP+ modules at both ends, then the 2920 Switch (there are two stacked to choose from), and finally replacing the NIC altogether with an X710-DA2 that we also had in stock. We can repeat the issue on demand within that referenced time window. We tried switching media from fiber to copper and replacing the NIC with an Intel X540-T2 into the same switches, with the same behavior. I don't have any other brands like Chelsio on hand to test with. We also tried disabling TSO and the drop/reset/whatever is happening, still happens.
It almost seems like this issue referenced here https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=221919 but I don't know enough about .diff's and compiling to even try it.
Any help would be greatly appreciated.
Thanks,
Kurt