Problem replacing a bad drive with a new one in the same position

Status
Not open for further replies.

benamira

Explorer
Joined
Oct 12, 2011
Messages
61
Hi,
I have a problem trying to replace a hard drive that shows a SMART error with a new one in the same position.
Here you have the pictures of the config before replacing.
Captura de pantalla 2011-12-13 a la(s) 15.21.22.jpg

Captura de pantalla 2011-12-13 a la(s) 15.21.40.jpg

Here is the output from the cli: zpool status NFS

pool: NFS
state: DEGRADED
status: One or more devices could not be opened. Sufficient replicas exist for
the pool to continue functioning in a degraded state.
action: Attach the missing device and online it using 'zpool online'.
see: http://www.sun.com/msg/ZFS-8000-2Q
scrub: scrub completed after 9h9m with 0 errors on Tue Dec 13 12:10:50 2011
config:

NAME STATE READ WRITE CKSUM
NFS DEGRADED 0 0 0
raidz1 DEGRADED 0 0 0
gptid/127a73df-f4eb-11e0-a31f-f46d0473b996 UNAVAIL 3 1.15M 17 cannot open
gptid/132699fa-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/13dddd61-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/148d9aa8-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/153d15cd-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/15f92898-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0

It seems no data loss and raidz1 is working as expected, but when trying to replace the issues come in.
I shutdown the system, pull out the bad drive, and then insert in the same position the new one (same manufacturer, same size) because i dont have more ports available.
My guess was that i would be able to rebuild the volume with the new drive, but it seems it detects the new drive inside the Volume again, and each old drive takes a new name like: the current ada 1 becomes ada0, ada2 becomes ada 1 and so on.
I am not sure to do a "zpool replace NFS ada0" because ada0 now is one of the good drives, so i will miss data, wont i?

Any ideas about the procedure to follow to replace this drive??

Thanks a lot
 

benamira

Explorer
Joined
Oct 12, 2011
Messages
61
To give you more info. Of course i does not work gointg through the GUI because no option to Replace "IN PLACE". It asks for a member that does not exists...

After replacing the drive with the new one this is what i obatined after a zpool status:


pool: NFS
state: DEGRADED
status: One or more devices could not be opened. Sufficient replicas exist for
the pool to continue functioning in a degraded state.
action: Attach the missing device and online it using 'zpool online'.
see: http://www.sun.com/msg/ZFS-8000-2Q
scrub: none requested
config:

NAME STATE READ WRITE CKSUM
NFS DEGRADED 0 0 0
raidz1 DEGRADED 0 0 0
3482949819918333012 UNAVAIL 0 0 0 was /dev/gptid/127a73df-f4eb-11e0-a31f-f46d0473b996
gptid/132699fa-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/13dddd61-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/148d9aa8-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/153d15cd-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/15f92898-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0

Any suggestions??
 

William Grzybowski

Wizard
iXsystems
Joined
May 27, 2011
Messages
1,754
What if you go to Views Disks, click in Replace for the one that shows serial Unknown and select "ada0", wont that fix it?
 

benamira

Explorer
Joined
Oct 12, 2011
Messages
61
:-( i'm afraid i cant.
No option available.
If i click Replace i obtained a pop-up "Disk Replacement"
Replacing Disk None
Member Disk has no options in the drop-down menu.
I only can click in Replace Disk, but it says Member disk : This field is required.

So i thought to do in the CLI, but the problem is now if i see the drives the old ada1 is now ada0. See picture:

Captura de pantalla 2011-12-13 a la(s) 19.51.04.jpg

so i dont know if it is a good idea to do: "zpool replace NFS ada0", do u know what i mean??
Also the new drive "seems" to have the same uuid as the old one.
I am really confused and worried about a possible data loss if another drive fails before rebuildind the raidz1.

Thanks for your support.
 

William Grzybowski

Wizard
iXsystems
Joined
May 27, 2011
Messages
1,754
Ok, paste the output of "glabel status" and "sysctl kern.disks"

I think the system is not detecting the new drive
 

benamira

Explorer
Joined
Oct 12, 2011
Messages
61
Ok, paste the output of "glabel status"

Name Status Components
ufs/FreeNASs3 N/A da0s3
ufs/FreeNASs4 N/A da0s4
ufs/FreeNASs1a N/A da0s1a
gptid/132699fa-f4eb-11e0-a31f-f46d0473b996 N/A ada0p2
gptid/13dddd61-f4eb-11e0-a31f-f46d0473b996 N/A ada1p2
gptid/148d9aa8-f4eb-11e0-a31f-f46d0473b996 N/A ada2p2
gptid/153d15cd-f4eb-11e0-a31f-f46d0473b996 N/A ada3p2
gptid/15f92898-f4eb-11e0-a31f-f46d0473b996 N/A ada4p2



and "sysctl kern.disks"

kern.disks: ada4 ada3 ada2 ada1 ada0 da0

I think the system is not detecting the new drive

It seems you are right, what can i do??

thanks again...
 

William Grzybowski

Wizard
iXsystems
Joined
May 27, 2011
Messages
1,754
I don't know, did you take a look at the cables? are you sure the disk is spinning? reboot the machine? tested on another hardware? any weird messages on dmesg?
 

benamira

Explorer
Joined
Oct 12, 2011
Messages
61
Ok, just to update with the last info:
The "new" drive was also broken (bad luck), so i replaced again with a new one and now everithing is working as expected:


zpool status -v


pool: NFS
state: DEGRADED
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scrub: resilver in progress for 1h53m, 21.32% done, 6h57m to go
config:

NAME STATE READ WRITE CKSUM
NFS DEGRADED 0 0 0
raidz1 DEGRADED 0 0 0
replacing DEGRADED 0 0 0
3482949819918333012 UNAVAIL 0 0 0 was /dev/gptid/127a73df-f4eb-11e0-a31f-f46d0473b996
ada0p2 ONLINE 0 0 0 280G resilvered
gptid/132699fa-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/13dddd61-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/148d9aa8-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/153d15cd-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0
gptid/15f92898-f4eb-11e0-a31f-f46d0473b996 ONLINE 0 0 0

errors: No known data errors


So when it finish, do i have to do anything else like detach the old drive or reboot or....
 

peterh

Patron
Joined
Oct 19, 2011
Messages
315
how about "camcontrol rescan all"
( use manpages at freebsd.org for details)

( this was added before user reported that a fresh drive solved the problem
camcontrol is included and working in freenas and might help solve problems
like these)
 

benamira

Explorer
Joined
Oct 12, 2011
Messages
61
Yes, i see it.
I am very confident it is going to work now, and my data is safe again. :smile:
Thank you very much for your help.
 
Status
Not open for further replies.
Top