Apple Eater
Cadet
- Joined
- Aug 30, 2016
- Messages
- 5
Hey all,
TL;DR: I have 4 of 12 disks dropping out of my RaidZ1+RaidZ1 striped volume simultaneously. Will I lose data if I keep mounting it and dropping disks? Any thoughts why they're dropping (disks are external via E-sata with independent power supply)?
First of all, thanks for taking the time to read my post. Unfortunately the detail I provide will be limited as my system is currently offline out of fear of data loss. It will remain offline until I am certain that continued attempts to diagnose will not result in data loss.
I have a FreeNAS system that has run rock solid for years. I started with a single RaidZ1 and later added another to add capacity and currently have 12 disks in a striped RaidZ1 configuration. Due to space within my case, 4 of these drives are in an external enclosure (via E-sata). Herein lies my problem.
Recently I moved across country. I powered my system up and it has been running without complaint for about a week (apart from a quick resilver of a single drive). However, starting last night at around midnight, the 4 drives from my external storage enclosure disappeared. Given my configuration, this is obviously an unrecoverable fault. This morning, I rebooted things as safely as possible, remounted, entered my encryption passphrase and hey presto! Everything is back in place. Then about 5 minutes pass, the drives disappear, and I have painfully dismounted my volume AGAIN. Hoping the third time would be the charm, I just recently tried again and experienced the same result.
My first concern: Is my data likely to be at risk from repeated "hard dismounts" of the volume? Intuitively I would say "No", but I doubt this is a use-case that is often tested. Are other members of the forum worried about continually unplugging drives, failing a volume, and adding them back? I want to be able to continue troubleshooting the issue, but if I am taking even a small risk of data loss, I'll figure out something else.
Second: Diagnosing the fault. The biggest change (in my mind) from the move is that I am currently running without my UPS (still in transit). The power supply for the external enclosure is separate from the system's main power supply (it has it's own power brick). I worry that slight disruptions in the local grid that are not significant enough to shut down the FreeNAS server but will drop power to the drives may be to blame (I know this sounds far-fetched, but I'm having trouble eliminating other components). Everything else has remained the same. I have no reason to believe my E-sata card or cable have failed. All drives seem to fail at once so drive problems are unlikely. Then again, things worked fine for about a week after arriving to my new home. I am absolutely open to other suggestions -- particularly things that are easy to eliminate!
Sorry for the large wall of text, and the lack of any useful details about system version, configuration, etc. Once I am certain I won't make the the problem worse, I'll happily collect any data that is requested for troubleshooting purposes.
EDIT: I have some error messages related to the "dropped" drives that I screenshot from the server console. Please see attached file -- sorry about the limited data.
TL;DR: I have 4 of 12 disks dropping out of my RaidZ1+RaidZ1 striped volume simultaneously. Will I lose data if I keep mounting it and dropping disks? Any thoughts why they're dropping (disks are external via E-sata with independent power supply)?
First of all, thanks for taking the time to read my post. Unfortunately the detail I provide will be limited as my system is currently offline out of fear of data loss. It will remain offline until I am certain that continued attempts to diagnose will not result in data loss.
I have a FreeNAS system that has run rock solid for years. I started with a single RaidZ1 and later added another to add capacity and currently have 12 disks in a striped RaidZ1 configuration. Due to space within my case, 4 of these drives are in an external enclosure (via E-sata). Herein lies my problem.
Recently I moved across country. I powered my system up and it has been running without complaint for about a week (apart from a quick resilver of a single drive). However, starting last night at around midnight, the 4 drives from my external storage enclosure disappeared. Given my configuration, this is obviously an unrecoverable fault. This morning, I rebooted things as safely as possible, remounted, entered my encryption passphrase and hey presto! Everything is back in place. Then about 5 minutes pass, the drives disappear, and I have painfully dismounted my volume AGAIN. Hoping the third time would be the charm, I just recently tried again and experienced the same result.
My first concern: Is my data likely to be at risk from repeated "hard dismounts" of the volume? Intuitively I would say "No", but I doubt this is a use-case that is often tested. Are other members of the forum worried about continually unplugging drives, failing a volume, and adding them back? I want to be able to continue troubleshooting the issue, but if I am taking even a small risk of data loss, I'll figure out something else.
Second: Diagnosing the fault. The biggest change (in my mind) from the move is that I am currently running without my UPS (still in transit). The power supply for the external enclosure is separate from the system's main power supply (it has it's own power brick). I worry that slight disruptions in the local grid that are not significant enough to shut down the FreeNAS server but will drop power to the drives may be to blame (I know this sounds far-fetched, but I'm having trouble eliminating other components). Everything else has remained the same. I have no reason to believe my E-sata card or cable have failed. All drives seem to fail at once so drive problems are unlikely. Then again, things worked fine for about a week after arriving to my new home. I am absolutely open to other suggestions -- particularly things that are easy to eliminate!
Sorry for the large wall of text, and the lack of any useful details about system version, configuration, etc. Once I am certain I won't make the the problem worse, I'll happily collect any data that is requested for troubleshooting purposes.
EDIT: I have some error messages related to the "dropped" drives that I screenshot from the server console. Please see attached file -- sorry about the limited data.
Attachments
Last edited: