mervincm
Contributor
- Joined
- Mar 21, 2014
- Messages
- 157
I am in a bit of a situation and I am not quite sure how to proceed.
I have a 6-disk Z1 and last week I noticed some disk errors on a single drive. There were about 10 errors. I thought no problem, I have a spare disk on the shelf, so I can just replace it, I noticed that my backup (I have an old Synology NAS that pulls files via rsync for backups) was still running (I made slight changes to terabytes worth of files) so this was going to be days. There were zero errors on the data vdev overall so I had confidence that it was worth letting the backup complete. after a couple of days, I came back and there were now hundreds of errors on all of the other disks (the same number of errors on each disk) still no errors on the vdev, and data seemed to be still accessible. Thinking this looked more like a controller failure (than 5 disks happened to all fail simultaneously with equal bad sector counts) I took a look and found a fan failure. knowing it would kill my backup and I would have to do that again from scratch, I still decided to power down, fixed the fan, and move all 6 HDD to my LSI controller. After powering up it did a re-silvering, and when it was done I had 2 errors on each of the 5, and hundreds on the single drive that originally had the failure. And again still 0 errors on the vdev. Wondering if I was at the end of it I did a manual scrub/resilver again, this time it incremented to 4 errors per disk on the 5 and still 0 errors on the data vdev.
I have all the data backed up on my Synology via active backup for business. even if I can't complete this backup, I can tolerate the loss of the most recent changes. Also I have never tried a restore of this magnitude (40TB).
I have two spare disks of the same size (14TB) as the 6 in my data vdev.
I have all of my most critical data also replicated to the cloud (one drive)
Any thoughts on how I can get from where I am to a stable setup would be appreciated.
I have a 6-disk Z1 and last week I noticed some disk errors on a single drive. There were about 10 errors. I thought no problem, I have a spare disk on the shelf, so I can just replace it, I noticed that my backup (I have an old Synology NAS that pulls files via rsync for backups) was still running (I made slight changes to terabytes worth of files) so this was going to be days. There were zero errors on the data vdev overall so I had confidence that it was worth letting the backup complete. after a couple of days, I came back and there were now hundreds of errors on all of the other disks (the same number of errors on each disk) still no errors on the vdev, and data seemed to be still accessible. Thinking this looked more like a controller failure (than 5 disks happened to all fail simultaneously with equal bad sector counts) I took a look and found a fan failure. knowing it would kill my backup and I would have to do that again from scratch, I still decided to power down, fixed the fan, and move all 6 HDD to my LSI controller. After powering up it did a re-silvering, and when it was done I had 2 errors on each of the 5, and hundreds on the single drive that originally had the failure. And again still 0 errors on the vdev. Wondering if I was at the end of it I did a manual scrub/resilver again, this time it incremented to 4 errors per disk on the 5 and still 0 errors on the data vdev.
I have all the data backed up on my Synology via active backup for business. even if I can't complete this backup, I can tolerate the loss of the most recent changes. Also I have never tried a restore of this magnitude (40TB).
I have two spare disks of the same size (14TB) as the 6 in my data vdev.
I have all of my most critical data also replicated to the cloud (one drive)
Any thoughts on how I can get from where I am to a stable setup would be appreciated.
Last edited: