Hi all,
We have a pair of FreeNAS Minis, made by iXsystems. One is primary, and the other one gets replicated to nightly . The current storage config on them is 4x6tb WD RED in a RAIDZ1, lz4 compression, no dedupe. We're running FreeNAS-9.3-STABLE-201512121950, and the Minis have 32GB of ram, and an Atom C2750. We had a disk go bad in one (the backup) and the resilver is taking a very long time (about five days so far):
zpool status
pool: backup
state: ONLINE
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Tue Jan 5 17:45:18 2016
8.45T scanned out of 8.91T at 1.72M/s, 78h18m to go
2.11T resilvered, 94.80% done
Short SMART checks show the disks are good, and there have been no r/w/c errors during the rebuild. The CPU is loafing, and the load (as reported by top) is a pretty steady .5 . The system is using 6.5 TB and has 8.1 TB left.
Running iostat -xc 10 shows the %b (time busy?) on the three "good" disks as 100% a lot of the time, and the disk the is rebuilding is much lower. Does this indicate what I hope it doesn't, which is my disks are just working as fast they can, and there's nothing much I can do?
This isn't the first time a scrub or resilver has taken many days on one of these systems. Anyone have any insight into why, and what can be done to speed things up?
Warmest regards,
Jordan
We have a pair of FreeNAS Minis, made by iXsystems. One is primary, and the other one gets replicated to nightly . The current storage config on them is 4x6tb WD RED in a RAIDZ1, lz4 compression, no dedupe. We're running FreeNAS-9.3-STABLE-201512121950, and the Minis have 32GB of ram, and an Atom C2750. We had a disk go bad in one (the backup) and the resilver is taking a very long time (about five days so far):
zpool status
pool: backup
state: ONLINE
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Tue Jan 5 17:45:18 2016
8.45T scanned out of 8.91T at 1.72M/s, 78h18m to go
2.11T resilvered, 94.80% done
Short SMART checks show the disks are good, and there have been no r/w/c errors during the rebuild. The CPU is loafing, and the load (as reported by top) is a pretty steady .5 . The system is using 6.5 TB and has 8.1 TB left.
Running iostat -xc 10 shows the %b (time busy?) on the three "good" disks as 100% a lot of the time, and the disk the is rebuilding is much lower. Does this indicate what I hope it doesn't, which is my disks are just working as fast they can, and there's nothing much I can do?
This isn't the first time a scrub or resilver has taken many days on one of these systems. Anyone have any insight into why, and what can be done to speed things up?
Warmest regards,
Jordan
Last edited: