Hi,
I have a 12 disk Z2 pool made up of 6 Seagate Ironwolf 4TB and 6 WD RED 4TB. I have a scheduled scrub every 2 weeks, along with frequent long and short SMART tests. No warning was ever given before this issue happened. I had noticed a scheduled scrub happening at the time a single drive went offline, no other drives dropped out at this time, but I had to abort the scrub shortly after by shutting down the system through the web GUI. Normally this system is up 24/7, but I wasn't going to have access to the system for the next few days and I didn't want to leave it running in this state it was currently in.
I have an encrypted pool that was setup initially when I created it, operating without issue until now. After this issue happened I am receiving the error [EFAULT] Pool could not be imported: 5 devices failed to decrypt. I have gone through my SMART logs and have noticed some troubling errors on some disks, specifically the RAW READ ERROR RATE and MULTI ZONE ERROR RATE. All 6 of the Seagate disks have the RAW READ ERROR RATE, and one of the WD disks has the MULTI ZONE ERROR RATE. These drives were all purchased new, from a few different retailers, and they were checked before putting into service using a burn in tool found on these forums. There was no warning given by the system anything was wrong before this incident happened. I have email alerts setup and verified working.
About the system: It is a IBM x3630 M3 with 48GB of ECC memory, dual power supplies, and two Xeon CPUS. It was originally running FreeNAS, before upgrading to TrueNAS-12.0-U4 a few months maybe, before this incident.
At this point, I do not know how to recover from this. I find it unusual and unlikely all 6 Seagate Ironwolf disks would go bad at the same time, but this is what I am seeing. Any suggestions on what to do or additional information I can provide to help diagnose the issue would be greatly appreciated.
I have a 12 disk Z2 pool made up of 6 Seagate Ironwolf 4TB and 6 WD RED 4TB. I have a scheduled scrub every 2 weeks, along with frequent long and short SMART tests. No warning was ever given before this issue happened. I had noticed a scheduled scrub happening at the time a single drive went offline, no other drives dropped out at this time, but I had to abort the scrub shortly after by shutting down the system through the web GUI. Normally this system is up 24/7, but I wasn't going to have access to the system for the next few days and I didn't want to leave it running in this state it was currently in.
I have an encrypted pool that was setup initially when I created it, operating without issue until now. After this issue happened I am receiving the error [EFAULT] Pool could not be imported: 5 devices failed to decrypt. I have gone through my SMART logs and have noticed some troubling errors on some disks, specifically the RAW READ ERROR RATE and MULTI ZONE ERROR RATE. All 6 of the Seagate disks have the RAW READ ERROR RATE, and one of the WD disks has the MULTI ZONE ERROR RATE. These drives were all purchased new, from a few different retailers, and they were checked before putting into service using a burn in tool found on these forums. There was no warning given by the system anything was wrong before this incident happened. I have email alerts setup and verified working.
About the system: It is a IBM x3630 M3 with 48GB of ECC memory, dual power supplies, and two Xeon CPUS. It was originally running FreeNAS, before upgrading to TrueNAS-12.0-U4 a few months maybe, before this incident.
At this point, I do not know how to recover from this. I find it unusual and unlikely all 6 Seagate Ironwolf disks would go bad at the same time, but this is what I am seeing. Any suggestions on what to do or additional information I can provide to help diagnose the issue would be greatly appreciated.