Device disappear after scrub. Need help understanding logs.

Status
Not open for further replies.

audix

Dabbler
Joined
Jun 11, 2011
Messages
36
I have been running Freenas 8.0.1-Beta2 about 9-10 days and everything has worked perfectly (as far as I have been able to tell ;) ). There is one volume/pool, 2 datasets, 6 new disks, RAIDZ2.

Today I started a scrub ("zpool scrub tank") for the first time. Now when I do a status check I see a warning.

When I look at the disks in the GUI there is no device name besides one disk and it says "Unknown" in the serial number place.

Trying to run a smartctl for the device gives "No such file or directory".

I need help to understand exactly what the problem is and how an error like this is supposed to be reported/observed to/by me during normal operations. I thought I would get an email or some other "apearant" error message somewhere.

- What has happened? (Part of disk failure, a complete disk failure, or...?)
- How and where should one look for error reporting of this kind of failure?
- I have planned to run a scrub once a week as a cronjob. Is that a good idea?
- How is smart-errors supposed to be checked automatically? smartd?
- What else do I need to ask? :confused:


Here is the output:
Code:
# zpool status tank
  pool: tank

 state: ONLINE

status: One or more devices has experienced an unrecoverable error.  An

	attempt was made to correct the error.  Applications are unaffected.

action: Determine if the device needs to be replaced, and clear the errors

	using 'zpool clear' or replace the device with 'zpool replace'.

   see: http://www.sun.com/msg/ZFS-8000-9P

 scrub: scrub completed after 5h40m with 0 errors on Fri Jun 17 20:24:03 2011

config:



	NAME                                            STATE     READ WRITE CKSUM

	tank                                            ONLINE       0     0     0

	  raidz2                                        ONLINE       0     0     0

	    gptid/53b16438-9369-11e0-a633-f46d042635ac  ONLINE       0     0     0

	    gptid/5405edb0-9369-11e0-a633-f46d042635ac  ONLINE       0     0     0

	    gptid/54671eb7-9369-11e0-a633-f46d042635ac  ONLINE       0     0     0

	    gptid/54caadbb-9369-11e0-a633-f46d042635ac  ONLINE      82  940K     0

	    gptid/5522ca41-9369-11e0-a633-f46d042635ac  ONLINE       0     0     0

	    gptid/558f1630-9369-11e0-a633-f46d042635ac  ONLINE       0     0     0



errors: No known data errors


The end of "# dmesg" looks like this:
Code:
# dmesg
< deleted a bunch of lines >
ata3: SIGNATURE: 00000101

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

(ada3:ata3:0:0:0): lost device

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

ata3: timeout waiting to issue command

ata3: error issuing ATA_IDENTIFY command

ata3: SIGNATURE: ffffffff

swap_pager: I/O error - pageout failed; blkno 1572875,size 20480, error 6

xptioctl: pass driver is not in the kernel

xptioctl: put "device pass" in your kernel config file



Thanks :)
 

William Grzybowski

Wizard
iXsystems
Joined
May 27, 2011
Messages
1,754
Scrubs are already scheduled for a 30 day basis...

Your disk probably completely failed..

Probably your disks were already presenting errors that smart would show, however in BETA2 there is no smart gui yet, and to receive some kind of alert you should put '-m youremail' in the 'extra smart options' for each disk GUI..
 
Status
Not open for further replies.
Top