Charles Rhoades
Dabbler
- Joined
- Oct 27, 2015
- Messages
- 34
I've just built a new server and have started the HDD burn-in tests listed here: https://forums.freenas.org/index.php?threads/how-to-hard-drive-burn-in-testing.21451/#post-124942
The drives have passed the Conveyance, Short, and Long SMART tests, and I've kicked off the bad blocks testing. I have 6 Seagate 4TB NAS drives set up in a RaidZ2 volume. I started the bad blocks tests for each drive sequentially in the Shell, and made it through ADA4. When I entered the command for ADA5, the Shell window disappeared, and I cannot re-open. Looking at the Reporting for the Drives, it appears that all are running fine except for ADA5. Shortly thereafter I received an Alert System error report: "CRITICAL: The volume Vol1 (ZFS) state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected." I haven't stopped the testing yet, it has been running for about 6 hrs now, and I expect it will take a couple days to complete. How should I proceed? Should I continue the tests until they finish on the "good drives" or immediately stop the bad blocks testing and identify the problem drive(s), and start the RMA on the bad drive(s)? How do I safely stop the testing if I cannot reopen SHELL?
The drives have passed the Conveyance, Short, and Long SMART tests, and I've kicked off the bad blocks testing. I have 6 Seagate 4TB NAS drives set up in a RaidZ2 volume. I started the bad blocks tests for each drive sequentially in the Shell, and made it through ADA4. When I entered the command for ADA5, the Shell window disappeared, and I cannot re-open. Looking at the Reporting for the Drives, it appears that all are running fine except for ADA5. Shortly thereafter I received an Alert System error report: "CRITICAL: The volume Vol1 (ZFS) state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected." I haven't stopped the testing yet, it has been running for about 6 hrs now, and I expect it will take a couple days to complete. How should I proceed? Should I continue the tests until they finish on the "good drives" or immediately stop the bad blocks testing and identify the problem drive(s), and start the RMA on the bad drive(s)? How do I safely stop the testing if I cannot reopen SHELL?