Thank you FreeNAS/ZFS

Status
Not open for further replies.

CraigD

Patron
Joined
Mar 8, 2016
Messages
343
I hate it when drives fail or are about to fail, but it is great to know that it is going to happen

freeNAS is scrolling uncorrectable table sectors and unreadable (pending) sectors from one of my drives

RIP my oldest WD 2TB drive it only had 43000 hours on it...

ZFS saved my data

Have Fun

Code:
                                                                                                                                   
=== START OF READ SMART DATA SECTION ===                                                                                           
SMART Attributes Data Structure revision number: 16                                                                                 
Vendor Specific SMART Attributes with Thresholds:                                                                                   
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE                                   
  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       1                                           
  3 Spin_Up_Time            0x0027   176   170   021    Pre-fail  Always       -       6183                                         
  4 Start_Stop_Count        0x0032   098   098   000    Old_age   Always       -       2581                                         
  5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail  Always       -       0                                           
  7 Seek_Error_Rate         0x002e   100   253   000    Old_age   Always       -       0                                           
  9 Power_On_Hours          0x0032   041   041   000    Old_age   Always       -       43359                                       
10 Spin_Retry_Count        0x0032   100   100   000    Old_age   Always       -       0                                           
11 Calibration_Retry_Count 0x0032   100   100   000    Old_age   Always       -       0                                           
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       332                                         
192 Power-Off_Retract_Count 0x0032   200   200   000    Old_age   Always       -       198                                         
193 Load_Cycle_Count        0x0032   119   119   000    Old_age   Always       -       243014                                       
194 Temperature_Celsius     0x0022   125   102   000    Old_age   Always       -       25                                           
196 Reallocated_Event_Count 0x0032   200   200   000    Old_age   Always       -       0                                           
197 Current_Pending_Sector  0x0032   200   200   000    Old_age   Always       -       1                                           
198 Offline_Uncorrectable   0x0030   200   200   000    Old_age   Offline      -       1                                           
199 UDMA_CRC_Error_Count    0x0032   200   197   000    Old_age   Always       -       98121                                       
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       48                                         
 

DrKK

FreeNAS Generalissimo
Joined
Oct 15, 2013
Messages
3,630
Having a spike on #199 is very often a bad SATA cable.
 

CraigD

Patron
Joined
Mar 8, 2016
Messages
343
I will change the cable

The ONE error in 197 and 198 and freeNAS is letting be know every 3 minutes or so is concerning me

Should I wait until the drive dies before replacing it or swap it out ASAP? (Drive is 25C so not overheating)

My pool in RAIDz2 still up and not DEGRADED and a scrub show no errors

Thanks
PS I hadn't build a PC in 25 years until this year when I build my freeNAS server. In this time nothing had greatly changed!
 

DrKK

FreeNAS Generalissimo
Joined
Oct 15, 2013
Messages
3,630
Well a couple things concern me. The drive has 43000 hours on it. That's elderly. 5 years, roughly, of 24/7. That's the service life for any drive, really, if you're lucky.

You've got 250000 load cycles on it. That means that either this never was an appropriate NAS drive (perhaps it's a WD green?), or, it had a previous life in an environment that wore the shit out of it.

And having any number but 0 in #200 is a general indication of failing health due to elderliness.

I would consider this drive at the end of its physical life, and getting ready to give out.

That being said, if the only thing wrong is a single sector reallocation, and nothing else is going wrong, and scrubs are completing without READ/WRITE/CKSUMs, then, meh? Who knows? Might be fine for a period of time.

I would say this drive has a 20% chance of dying, in any given month, and since (0.8)^12 = 6.8%, you are very unlikely to survive another year.

It's time. Your drive has performed admirable service, but he is ready for his retirement package.
 

CraigD

Patron
Joined
Mar 8, 2016
Messages
343
Yes it is a WD green, and more green drives are in the pool I know it is wrong but another US~$700 on drives was not going to happen at the time

I just created 8x2TB RAIDz2 pool with what I had on hand 3 different makes, and different models

I am adding planning another 8x4TB NAS Drive RAIDz2 vdev soon

Then the plan is to slowly replace the 2TB Drives with 8 TB or larger drives as they die/show errors or money allows this maybe 3-4 years away (a couple of the drives have 30000 hours on them)

The cost of hardware outside of America is insane

Thanks
Have Fun
Well a couple things concern me. The drive has 43000 hours on it. That's elderly. 5 years, roughly, of 24/7. That's the service life for any drive, really, if you're lucky.

You've got 250000 load cycles on it. That means that either this never was an appropriate NAS drive (perhaps it's a WD green?), or, it had a previous life in an environment that wore the **** out of it.

I would consider this drive at the end of its physical life, and getting ready to give out.

It's time. Your drive has performed admirable service, but he is ready for his retirement package.
 

pschatz100

Guru
Joined
Mar 30, 2014
Messages
1,184
High load cycle counts on WD Green drives can be mitigated by use of a utility called WDIDLE3. There are lots of old posts in this forum that discuss the utility. A quick internet search will also turn up a lot of useful advice. I have several WD Green drives that were re-purposed for NAS installations, and they have been as reliable as anything else.

If this were my system, I would start shopping for a replacement drive. You can probably take a little time to get exactly what you want, but start now. If you wait for the drive to fail, then you will have to work quickly with whatever drive you can get.
 

CraigD

Patron
Joined
Mar 8, 2016
Messages
343
RIP it died and I got this nasty little Email

The volume RaidA (ZFS) state is DEGRADED: One or more devices could not be opened. Sufficient replicas exist for the pool to continue functioning in a degraded state.

I was watching video from the server at the time and didn't even know until I checked my mail 2 hours later!

I got a warning of a problem Saturday, purchased a replacement WD RED Drive, then the old drive dies (5 days later) and I was Emailed

I have the replacement drive in hand but I'm not touching the server until later in the day as it is 0322

Have Fun
 

Stux

MVP
Joined
Jun 2, 2016
Messages
4,419
Refresh your backup before replacing the disk ;)
 
Status
Not open for further replies.
Top