Hi guys !
Sorry for the long post :)
I have my Freenas setup runing since a year now. Here are the infos about it :
Build FreeNAS-8.3.0-RELEASE-p1-x64 (r12825)
Platform AMD E-350 Processor
Memory 7774MB
HDD 5 x 2TB ( RAIDZ2 )
SSD 1 x 64Gb ( cache )
The problem is, I have one of my HDD that seems to be dying, but the infos from the GUI and from command line don't match.
Everything start from the GUI displaying an alert message :
Ok so I go and check the Storage, View Disks and Volume status tabs and here we go :
Storage is said healthy, while there are only 4 disks displayed (instead of 6) and the Volume Status shows one disk and the SSD as null... That's the moment I decide to fire up an ssh connection to the machine to run a zpool status :
Everything is shown as online, but there is still this error message displayed by the GUI in the first place. Since everything is online, and I tend to trust a bit more the command line than the GUI, I run a zpool clear to get rid of the error message. But nothing changes, the error message stays as is.
So after all that, I decide to order a new drive to replace the "maybe" dying one. I stop the machine to replace the drive but forgot to note the serial number, so when I open the machine I don't remember which one to remove. I just remove a bit of dust, and restart the machine to get the info, but now everything is back as fully functional, no more errors, everything online. Perfect... But wait, I just restarted the machine ? Nevermind, I take the serial number of the "dying" disk but decide to leave it alone for now, since everything is back to normal.
But as you can guess, a couple hours after I get back to the same point. Error message, but everything online, but not in all the views... So now, I jsut want to be sure that the drive needs to be changed or not. Are there any tools installed in Freenas to do so ? Which one from the GUI or the command line should I trust ? And why a zpool clear does not remove the error message while it says everything is online and healthy ?
Thanks for those who read up to here, and I will take any advice you may have !
Sorry for the long post :)
I have my Freenas setup runing since a year now. Here are the infos about it :
Build FreeNAS-8.3.0-RELEASE-p1-x64 (r12825)
Platform AMD E-350 Processor
Memory 7774MB
HDD 5 x 2TB ( RAIDZ2 )
SSD 1 x 64Gb ( cache )
The problem is, I have one of my HDD that seems to be dying, but the infos from the GUI and from command line don't match.
Everything start from the GUI displaying an alert message :
Code:
WARNING: The volume share (ZFS) status is UNKNOWN: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'.
Ok so I go and check the Storage, View Disks and Volume status tabs and here we go :



Storage is said healthy, while there are only 4 disks displayed (instead of 6) and the Volume Status shows one disk and the SSD as null... That's the moment I decide to fire up an ssh connection to the machine to run a zpool status :
Code:
[root@freenas] ~# zpool status pool: share state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-9P scan: scrub repaired 0 in 2h51m with 0 errors on Sun Mar 3 14:51:49 2013 config: NAME STATE READ WRITE CKSUM share ONLINE 0 0 0 raidz2-0 ONLINE 0 0 0 gptid/a179b007-59c3-11e1-9ae0-14dae9686174 ONLINE 0 0 0 gptid/a20198bf-59c3-11e1-9ae0-14dae9686174 ONLINE 0 0 0 gptid/a2951acb-59c3-11e1-9ae0-14dae9686174 ONLINE 0 0 0 gptid/a322af78-59c3-11e1-9ae0-14dae9686174 ONLINE 0 0 0 gptid/a3ae295d-59c3-11e1-9ae0-14dae9686174 ONLINE 10 1.12K 0 cache ada4p1 ONLINE 0 19.3M 0 errors: No known data errors
Everything is shown as online, but there is still this error message displayed by the GUI in the first place. Since everything is online, and I tend to trust a bit more the command line than the GUI, I run a zpool clear to get rid of the error message. But nothing changes, the error message stays as is.
So after all that, I decide to order a new drive to replace the "maybe" dying one. I stop the machine to replace the drive but forgot to note the serial number, so when I open the machine I don't remember which one to remove. I just remove a bit of dust, and restart the machine to get the info, but now everything is back as fully functional, no more errors, everything online. Perfect... But wait, I just restarted the machine ? Nevermind, I take the serial number of the "dying" disk but decide to leave it alone for now, since everything is back to normal.
But as you can guess, a couple hours after I get back to the same point. Error message, but everything online, but not in all the views... So now, I jsut want to be sure that the drive needs to be changed or not. Are there any tools installed in Freenas to do so ? Which one from the GUI or the command line should I trust ? And why a zpool clear does not remove the error message while it says everything is online and healthy ?
Thanks for those who read up to here, and I will take any advice you may have !