gravely
Cadet
- Joined
- Jul 4, 2016
- Messages
- 6
I'm running FreeNAS 11.1U2 on a i3-4130 @3.4GHz, 28G memory with a 4x10TB raidz2 GELI encrypted volume. Tunables are enabled.
I'm intermittently hitting a soft lockup in the Rancher VM that I installed using the UI per the docs. I've configured Rancher to mount my FreeNAS volumes over NFS in cloud-config.yml. I've assigned the VM all 4 CPUs (I know there are only 2 real cores, I've tried giving it 1, 2 and 4 with no luck) and 16GB of memory. I'm running 7 of the usual suspect docker containers: crashplan pro, plex, transmission, etc, as well as the native Rancher Proxy stack w/ letsencrypt and haproxy. I really like Rancher's Prometheus stack but it can be taxing on the system so I've left it disabled while troubleshooting this problem.
When I find that none of the containers are responding, I
I'm unsure how to even troubleshoot what thread is locking the CPU in Rancher, or what I can do to give the VM more resources, or have it recover from this state.
Asides but possibly related: FreeNAS swaps more than I expected - with this much memory I would think it would never swap. Also: none of this was ever a problem for me under Corral, or under Freenas 11.0 when I managed my own Rancher installation using bhyve after following advice from other posters on this forum. This only started after migrating my 11.0 rancher container configs to 11.1.
Thanks in advance for any help!
I'm intermittently hitting a soft lockup in the Rancher VM that I installed using the UI per the docs. I've configured Rancher to mount my FreeNAS volumes over NFS in cloud-config.yml. I've assigned the VM all 4 CPUs (I know there are only 2 real cores, I've tried giving it 1, 2 and 4 with no luck) and 16GB of memory. I'm running 7 of the usual suspect docker containers: crashplan pro, plex, transmission, etc, as well as the native Rancher Proxy stack w/ letsencrypt and haproxy. I really like Rancher's Prometheus stack but it can be taxing on the system so I've left it disabled while troubleshooting this problem.
When I find that none of the containers are responding, I
cu
into the console, confirm that it's another soft lockup, and power cycle the VM. I've tried doing less in Rancher, like keeping the aforementioned Rancher Prometheus stack disabled but also disabling CrashPlan Pro, with no luck. I'm unsure how to even troubleshoot what thread is locking the CPU in Rancher, or what I can do to give the VM more resources, or have it recover from this state.
Asides but possibly related: FreeNAS swaps more than I expected - with this much memory I would think it would never swap. Also: none of this was ever a problem for me under Corral, or under Freenas 11.0 when I managed my own Rancher installation using bhyve after following advice from other posters on this forum. This only started after migrating my 11.0 rancher container configs to 11.1.
Thanks in advance for any help!