Upgrading Really Old FreeNAS Setup

JonnyAlpha

Contributor
Joined
Jul 12, 2012
Messages
128
Hi,

I have a really old FreeNAS setup and talked about upgrading it here. I never managed to get around to upgrading it and occasionally fire it up and access the data.
Having been off work recovering from an Op and started to use it again in an attempt to sort old all of my old (but much still required) data.

It's been working OK, but I know I do need to seriously look at my data storage solution and upgrade.

Today I went to fire it up and got the dreaded - your pool is in a degraded state, one drive is not being recognised - or something like that.
With regards to fixing this, where is the best place to post questions about an old system?

WIth regards to replacement options, I'll start looking at the info on these Forums, but not sure which path I will go - ready made NAS, Raspberry Pi NAS or PC based NAS? It's been a long time since I researched any of this so I'll have to start from the beginning again.

First stop I think is to get a 4TB Seagate or WD USB HDD and backup what I have on the FreeNAS box!
 

Etorix

Wizard
Joined
Dec 30, 2020
Messages
2,134
This looks like you are the one and lonely who has a case to post his thread under "FreeNAS (Legacy Software Releases)" (are you still on FreeNAS 8.3.0?) whereas just about everybody else ends up posting under the legacy section while they really should have scrolled down to the "TrueNAS CORE" or "SCALE" sections as appropiate. Congratulations!

An external backup is a very good idea.
Posting the output of zpool status -v and smartctl -a /dev/adaN (replacing 'adaN' by whatever moniker the failing drive goes by) would help assessing the situation. Please use CODE tags (</>).

As for the replacement, it depends how much you're willing to spend, in money and in time to manage a self-made NAS. A PC-based NAS running TrueNAS has the benefit that you could just plug in your remaining drives—but they probably all need to be ultimately replaced due to age. Do you, by any chance, have some unused-but-not-desperately-old PC lying around?
 

JonnyAlpha

Contributor
Joined
Jul 12, 2012
Messages
128
Not sure how you quote a post on this forum so I just replied.

Anyway, before I switched the NAS on today I took the cabinet side off, low and behold the power cable for one of the HDDs appears to have come undone? No idea how that may have happened.
Just booted up but it's still showing a yellow alert.

Here is the message:
Code:
WARNING: The volume FreeNAS (ZFS) status is UNKNOWN: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'.


zpool status -v

Code:
 state: ONLINE                                                                 
status: One or more devices has experienced an unrecoverable error.  An         
        attempt was made to correct the error.  Applications are unaffected.   
action: Determine if the device needs to be replaced, and clear the errors     
        using 'zpool clear' or replace the device with 'zpool replace'.         
   see: http://www.sun.com/msg/ZFS-8000-9P                                     
  scan: scrub repaired 0 in 2h22m with 0 errors on Sun Oct  7 02:22:26 2018     
config:                                                                         
                                                                                
        NAME                                            STATE     READ WRITE CKS
UM                                                                             
        FreeNAS                                         ONLINE       0     0   
 0                                                                             
          raidz1-0                                      ONLINE       0     0   
 0                                                                             
            gptid/ed8f8de0-4317-11e3-97f9-001d92268ec7  ONLINE       0     0   
 0                                                                             
            gptid/2aeeff5a-6248-11e2-b3f1-001d92268ec7  ONLINE       0     0   
 1                                                                             
            gptid/2ba31e04-6248-11e2-b3f1-001d92268ec7  ONLINE       0     0   
 0                                                                             
                                                                                
errors: No known data errors 


smartctl for ada2 (which I think was the disconnected one:

Code:
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -
       0                                                                       
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -
       0                                                                       
                                                                                
SMART Error Log Version: 1                                                     
No Errors Logged                                                               
                                                                                
SMART Self-test log structure revision number 1                                 
No self-tests have been logged.  [To run self-tests, use: smartctl -t]         
                                                                                
                                                                                
SMART Selective self-test log data structure revision number 1                 
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                   
    1        0        0  Not_testing                                           
    2        0        0  Not_testing                                           
    3        0        0  Not_testing                                           
    4        0        0  Not_testing                                           
    5        0        0  Not_testing                                           
Selective self-test flags (0x0):                                               
  After scanning selected spans, do NOT read-scan remainder of disk.           
If Selective self-test is pending on power-up, resume after 0 minute delay. 


And ada0:

Code:
       0                                                                       
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -
       0                                                                       
                                                                                
SMART Error Log Version: 1                                                     
No Errors Logged                                                               
                                                                                
SMART Self-test log structure revision number 1                                 
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA
_of_first_error                                                                 
# 1  Short offline       Completed without error       00%        33         - 
                                                                                
SMART Selective self-test log data structure revision number 1                 
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                   
    1        0        0  Not_testing                                           
    2        0        0  Not_testing                                           
    3        0        0  Not_testing                                           
    4        0        0  Not_testing                                           
    5        0        0  Not_testing                                           
Selective self-test flags (0x0):                                               
  After scanning selected spans, do NOT read-scan remainder of disk.           
If Selective self-test is pending on power-up, resume after 0 minute delay.


and ada1:

Code:
SMART Error Log Version: 1                                                     
No Errors Logged                                                               
                                                                                
SMART Self-test log structure revision number 1                                 
No self-tests have been logged.  [To run self-tests, use: smartctl -t]         
                                                                                
                                                                                
SMART Selective self-test log data structure revision number 1                 
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS                                   
    1        0        0  Not_testing                                           
    2        0        0  Not_testing                                           
    3        0        0  Not_testing                                           
    4        0        0  Not_testing                                           
    5        0        0  Not_testing                                           
Selective self-test flags (0x0):                                               
  After scanning selected spans, do NOT read-scan remainder of disk.           
If Selective self-test is pending on power-up, resume after 0 minute delay. 
 

Etorix

Wizard
Joined
Dec 30, 2020
Messages
2,134
To quote a post, click the "Reply" button at the end of it.

If it was "just" a dislodged power cable (vibrations?), maybe you're fine. The critical parts in SMART reports would be #5 and #197, pending and reallocated (bad) sectors. (Best log in via ssh to copy-paste from the terminla output; the GUI terminal is broken.) And you should be running regular SMART tests.
I suggest a scrub, zpool clear if it returns fine to clear the lone checksum error; and long SMART tests on all drives.
 

JonnyAlpha

Contributor
Joined
Jul 12, 2012
Messages
128
To quote a post, click the "Reply" button at the end of it.

If it was "just" a dislodged power cable (vibrations?), maybe you're fine. The critical parts in SMART reports would be #5 and #197, pending and reallocated (bad) sectors. (Best log in via ssh to copy-paste from the terminla output; the GUI terminal is broken.) And you should be running regular SMART tests.
I suggest a scrub, zpool clear if it returns fine to clear the lone checksum error; and long SMART tests on all drives.
Hmm, I ssh all the time into my Raspberry Pi's but never into the FreeNAS. What / how do I find the hostname or whatever its called that goes in front of the @<ip addrss>?
I did a
Code:
zpool clear
and think I ran a scrub from the terminal
Code:
scrub FreeNAS
, the warning was not there when I rebooted.

P.S. Sorry these probably aren't written here correctly as I had to search how to.
 
Last edited:

Davvo

MVP
Joined
Jul 12, 2022
Messages
3,222
Hmm, I ssh all the time into my Raspberry Pi's but never into the FreeNAS. What / how do I find the hostname or whatever its called that goes in front of the @<ip addrss>?
SSH usually works with user@hostname:port; ie root@truenas.local:22. the port is usually required if you don't use the default (22) one.
 
Top