Replacing failed boot device

Status
Not open for further replies.

Revolution

Dabbler
Joined
Sep 8, 2015
Messages
39
Hello everyone,

tonight while scrubbing my boot-pool, I got a message that one of my boot devices has 3 checksum errors.
How can I determine which one of my boot drives is the failing one? I have one 4GB and one 16GB flash drive.
They are from different manufacturers, so I would see the difference in the label aswell. With normal drives I would look at the S/N but with those I'm not sure.


Greetings
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
You have a 4GB boot flash? What version of FreeNAS are you running?

EDIT: Also, post the email output.
 

Revolution

Dabbler
Joined
Sep 8, 2015
Messages
39
You have a 4GB boot flash? What version of FreeNAS are you running?

EDIT: Also, post the email output.

Yes the 4GB one was my first boot-device. Then I added the second to mirror that one. Latest freenas version is running smoothly.

This is the first email :
Code:
 The boot volume state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.


And this is the second one:
Code:
Checking status of zfs pools:
NAME           SIZE  ALLOC   FREE  EXPANDSZ   FRAG    CAP  DEDUP  HEALTH  ALTROOT
Alles         10.9T  8.42T  2.46T         -    25%    77%  1.00x  ONLINE  /mnt
freenas-boot  3.72G  1.47G  2.25G         -      -    39%  1.00x  ONLINE  -

  pool: freenas-boot
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
	attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
	using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: scrub repaired 225K in 0h13m with 0 errors on Sat Jan  9 03:58:39 2016
config:

	NAME                                            STATE     READ WRITE CKSUM
	freenas-boot                                    ONLINE       0     0     0
	  mirror-0                                      ONLINE       0     0     0
	    da0p2                                       ONLINE       0     0     3
	    gptid/46649c00-a914-11e5-a3b3-d050997979c3  ONLINE       0     0     0

errors: No known data errors

-- End of daily output --
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
I'd run a scrub on your freenas-boot pool again since it's still online and likely the suspect device is the 4GB because it was the first one installed and likely the one your system was configured to boot from. Of course you could just check to see what the size of the da0 drive is and match it up with the device.
 

Revolution

Dabbler
Joined
Sep 8, 2015
Messages
39
I run another scrub and this is the zpool status :
Code:
  pool: freenas-boot
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
        attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
        using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: scrub repaired 224K in 0h22m with 0 errors on Sun Jan 10 20:21:01 2016
config:

        NAME                                            STATE     READ WRITE CKS         UM
        freenas-boot                                    ONLINE       0     0              0
          mirror-0                                      ONLINE       0     0              0
            da0p2                                       ONLINE       0     0              7
            gptid/46649c00-a914-11e5-a3b3-d050997979c3  ONLINE       0     0              0

errors: No known data errors


How can I see the size of the stick?
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
In a shell type "gpart show | more" and it will list all the drive sizes, including partition info. Odds are the two boot devices will be the first listed.

Or you could do this, type (alsmost less information that the first one)
Code:
egrep 'da[0-9]' /var/run/dmesg.boot | more
 

ethereal

Guru
Joined
Sep 10, 2012
Messages
762
gpart show

this should be what you need

pesky joeschmuck
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
pesky joeschmuck
And I'm hardly on the internet today. Now it's time to watch the Redskins either fail or win. I figure they will fail.
 

Revolution

Dabbler
Joined
Sep 8, 2015
Messages
39
That is what I needed! Thanks! And yes it is the small 4GB one that has the checksum errors. New USB-Sticks on the way.
 

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996

joeschmuck

Old Man
Moderator
Joined
May 28, 2011
Messages
10,996
Just messing with ethereal.

Glad you got it figured out though.
 
Status
Not open for further replies.
Top