ZFS-8000-9P error

Status
Not open for further replies.

jal

Dabbler
Joined
Jan 12, 2015
Messages
14
Hello all,

I received my first error on my build.

Code:
# zpool status
  pool: freenas-boot
 state: ONLINE
  scan: scrub repaired 0 in 0h0m with 0 errors on Tue Nov 17 03:45:41 2015

config:


    NAME                                          STATE     READ WRITE CKSUM
    freenas-boot                                  ONLINE       0     0     0
      gptid/c2c02af1-e4db-11e4-803d-d05099642174  ONLINE       0     0     0



errors: No known data errors

  pool: void
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
    attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
    using 'zpool clear' or replace the device with 'zpool replace'.
  see: http://illumos.org/msg/ZFS-8000-9P
  scan: scrub repaired 0 in 16h15m with 0 errors on Tue Nov 17 16:15:40 2015

config:

    NAME                                                STATE     READ WRITE CKSUM
    void                                                ONLINE       0     0     0
      raidz2-0                                          ONLINE       0     0     0
        gptid/82201973-e78c-11e4-8165-d05099642174.eli  ONLINE       0     0     0
        gptid/82978f16-e78c-11e4-8165-d05099642174.eli  ONLINE       0     0     0
        gptid/830be023-e78c-11e4-8165-d05099642174.eli  ONLINE       0     0     1
        gptid/83812136-e78c-11e4-8165-d05099642174.eli  ONLINE       0     0     0
        gptid/83f5a381-e78c-11e4-8165-d05099642174.eli  ONLINE       0     0     0
        gptid/846a7b3a-e78c-11e4-8165-d05099642174.eli  ONLINE       0     0     0



errors: No known data errors


Since it is one checksum error, I'm tempted to clear and scrub again and see what happens. I have cold spares. I'm just unsure if I'm not taking this seriously enough or not.

Anyone have an opinion on how to proceed?

Thanks,

-j
 

Robert Trevellyan

Pony Wrangler
Joined
May 16, 2014
Messages
3,778
Since it is one checksum error, I'm tempted to clear and scrub again and see what happens.
Seems reasonable to me, but I would still take a look at the smartctl output for that drive.
 

jal

Dabbler
Joined
Jan 12, 2015
Messages
14
Thanks, Robert. I did end up clearing and scrubbing. Scrub finished early this morning, no errors. Have to chalk this one up to poorly aimed weather control rays.

And thanks for the reminder about smartctl - I did check and there were no reported problems.

My primary takeaway is once again wishing I had been patient enough to find an enclosure with hot swap bays while being happy I don't have to disassemble it right this minute.
 

Robert Trevellyan

Pony Wrangler
Joined
May 16, 2014
Messages
3,778
I assume you have SMART checks and SMART tests set up, with a tested email notification configuration?

Keep an eye on the drive for any changes in the number of pending, uncorrectable and reallocated sectors. If the counts start to creep up, consider replacing it.
 

Robert Trevellyan

Pony Wrangler
Joined
May 16, 2014
Messages
3,778
wishing I had been patient enough to find an enclosure with hot swap bays
It seems nice in theory, but I'm not convinced it's worth the added complexity and expense for a system with 6 drives. You really shouldn't have to replace a drive very often.
 

jal

Dabbler
Joined
Jan 12, 2015
Messages
14
It seems nice in theory, but I'm not convinced it's worth the added complexity and expense for a system with 6 drives. You really shouldn't have to replace a drive very often.

True. Part of that is that I'm so used to hot swap at work that I'm become little spoiled in that regard, so taking the machine apart to change a drive feels a bit like pulling the engine in a car to change the oil.
 
Status
Not open for further replies.
Top