NAS failing - anyting I can do?

marcelmmx

Cadet
Joined
Jan 7, 2019
Messages
5
I have a raidz1 pool with 3 1.5TB drives, where one of the drives has developed bad sectors and offline uncorrectable sectors. New drives are on the way, but my plan is to replace all 3 drives. To do that I would need to get as much data from the old pool as possible. Currently the NAS behaviour is erratic - I am usually able to copy about 1GB of data, then it slows down almost to zero and it takes several minutes before normal functionality resumes (this repeats). Sometimes I have to skip or retry a file to continue. The pool has launched resilvering several times from what I have seen, but the described behavior remains. Is there anything I can do to improve my chances of copying the data from my NAS or is prayer now my only option? ATM it looks like the resilver process is in a loop that does not want to end.

zpool status:

pool: FREENAS
state: ONLINE
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Mon Jan 7 10:22:12 2019
2.87T scanned at 154M/s, 2.76T issued at 3.66G/s, 3.13T total
6.59G resilvered, 88.07% done, 0 days 00:01:44 to go
config:

NAME STATE READ WRITE CKSUM
FREENAS ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
gptid/098d1aeb-feb0-11e7-a1de-d050992b2099 ONLINE 114 143 3 (resilvering)
gptid/0c378eea-feb0-11e7-a1de-d050992b2099 ONLINE 0 0 0
gptid/0e260f52-feb0-11e7-a1de-d050992b2099 ONLINE 0 0 0 (resilvering)

errors: No known data errors

pool: freenas-boot
state: ONLINE
scan: scrub repaired 0 in 0 days 00:00:28 with 0 errors on Mon Jan 7 03:45:38 2019
config:

NAME STATE READ WRITE CKSUM
freenas-boot ONLINE 0 0 0
ada2p2 ONLINE 0 0 0

errors: No known data errors
 
Last edited:

marcelmmx

Cadet
Joined
Jan 7, 2019
Messages
5
What is your hardware? You should copy your data somewhere else as soon as possible.
Build FreeNAS-11.1-U6.3
Platform Intel(R) Celeron(R) CPU J1900 @ 1.99GHz
Memory 7855MB
System Time Mon, 7 Jan 2019 11:19:58 +0100
Uptime 11:19AM up 1:41, 0 users
Load Average 0.11, 0.24, 0.33

It's a system I built myself and I am copying data, but at this rate it will take weeks. Should I stop the resilver process (which seems to be in an endless loop at the moment) ? Or can I do something else to speed up data copy rate?
 

sremick

Patron
Joined
Sep 24, 2014
Messages
323
You seem to have 2 disks failling.
This is why RAID5/RAIDZ1 is a bad idea.

It seems the OP didn't catch his first failing drive (do you have SMART tests and notifications set up?) and now has a second drive failing and so no redundancy. High chance of losing some/all data.
 

Bytales

Dabbler
Joined
Dec 17, 2018
Messages
31
1.5 TB drives ? must be really old real shitty drives, not ?
Personaly i went with a Enterprise drives, that will hopefully last me a Long time, a 10TB WD Gold. Supposedly better than plebean drives, well have to wait and see.
You better get some drives that are good this time....
 

Chris Moore

Hall of Famer
Joined
May 2, 2015
Messages
10,079
1.5 TB drives ? must be really old real crapty drives, not ?
Personaly i went with a Enterprise drives, that will hopefully last me a Long time, a 10TB WD Gold. Supposedly better than plebean drives, well have to wait and see.
You better get some drives that are good this time....
You would be much better off with a mirrored pair of less expensive drives. A single drive provides no redundancy and data can't be corrected if any corruption is detected. Redundancy is more important than higher cost drives.
 
Top