Thomas NAS
Cadet
- Joined
- Jun 29, 2017
- Messages
- 3
Hi -
Using the FreeNAS-11-STABLE train and up to date, a box which had been working for years has suddenly started behaving very oddly.
Some time after boot, the console becomes non-responsive. Pressing 9 to get a shell will just display the nine ... and then nothing else happens. Around the same time, the box stops responding to SSH and to SMB requests.
Oddly, a Plex jail (installed with the current plug-in) continues to work just fine and a Centos VM also continues to work just fine. FWIW, the VM has disk mounted via NFS from the underlying machine and that is still readable/writable. It's almost like the FreeNAS instance ceases to be able to start new processes but existing ones are unaffected.
Any ideas here?
The box has been working with FreeNAS since ~2013, starting with 8 or so, continuing with Corral, earlier versions of FreeNAS 11. The only recent change was to add in a few additional hard discs and a controller card for same back into the box. These discs and the controller card had been in the box previously (circa 2015/early 2016) and were working perfectly under the production version of FreeNAS available at that time.
This has now happened twice. After being in this state for a few days, I shutdown the VM (either via SSH or VNC, I can't recall) and then hit the power button on the box a few minutes later. All ZFS pools came up on reboot without complaint. I chalked it up a cosmic ray or some similar one-off and hoped that was that. A few days later (after it had been up and running since that initial forced power off & reboot), the same symptoms have reappeared.
The only hint of a hardware problem has been sporadic complaints about one disc or another, but they're on the order of 1 per day per disc and this particular hardware has always displayed such errors. (Cheap SATA cables maybe? -- but it wasn't any different in the days and weeks before these symptoms started that it is now in the "working" period after a power off & reboot.)
Does anyone have advice on how I can get the system to provide useful info to diagnose the problem?
Thanks
Thomas
(Mods, please feel free to move if this isn't in the best place)
Using the FreeNAS-11-STABLE train and up to date, a box which had been working for years has suddenly started behaving very oddly.
Some time after boot, the console becomes non-responsive. Pressing 9 to get a shell will just display the nine ... and then nothing else happens. Around the same time, the box stops responding to SSH and to SMB requests.
Oddly, a Plex jail (installed with the current plug-in) continues to work just fine and a Centos VM also continues to work just fine. FWIW, the VM has disk mounted via NFS from the underlying machine and that is still readable/writable. It's almost like the FreeNAS instance ceases to be able to start new processes but existing ones are unaffected.
Any ideas here?
The box has been working with FreeNAS since ~2013, starting with 8 or so, continuing with Corral, earlier versions of FreeNAS 11. The only recent change was to add in a few additional hard discs and a controller card for same back into the box. These discs and the controller card had been in the box previously (circa 2015/early 2016) and were working perfectly under the production version of FreeNAS available at that time.
This has now happened twice. After being in this state for a few days, I shutdown the VM (either via SSH or VNC, I can't recall) and then hit the power button on the box a few minutes later. All ZFS pools came up on reboot without complaint. I chalked it up a cosmic ray or some similar one-off and hoped that was that. A few days later (after it had been up and running since that initial forced power off & reboot), the same symptoms have reappeared.
The only hint of a hardware problem has been sporadic complaints about one disc or another, but they're on the order of 1 per day per disc and this particular hardware has always displayed such errors. (Cheap SATA cables maybe? -- but it wasn't any different in the days and weeks before these symptoms started that it is now in the "working" period after a power off & reboot.)
Does anyone have advice on how I can get the system to provide useful info to diagnose the problem?
Thanks
Thomas
(Mods, please feel free to move if this isn't in the best place)