How many SMART errors are ok

Status
Not open for further replies.

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
I have 47 disks of mixed sizes 1,2 and 4 TB. Some are 7200 rpm and others are 5900 rpm. 18 of them have at least one SMART error. The ones that do are pretty old with power on time averaging about 6 years.

Currently these disks are NOT in VDevs in a zpool, but rather individual drives under Windows running Stablebit Drivepool and Scanner, and I'm using SnapRAID for parity.

I want to convert to FreeNAS but I'm concerned about the drives with SMART errors. Here's a list of the drives I have:

diskcounts.JPG


And here are the ones with SMART errors:

smarterrors.JPG


And here's how I was planning to setup my VDevs:

vdevs.JPG


Would I be able to use ANY of the drives with SMART errors, or should I ditch them all?

My usage is going to be media storage for streaming, mostly full quality BD rips, 720p TV shows and about 400k worth of music tracks, all ripped full quality from CD. I have 6 clients that will be sucking media from the server, including one that is 4K so far with more to follow.

Also have a collection of family photos, movies and some documents (this all amounts to only a few gigs).

Will I have issues mixing different RPM drives within VDevs and VDevs of different sizes within the zpool?
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Can you use the drives with SMART errors? Sure you can. Is it a good idea? No--as @hugovsky said, they indicate defective drives. Why would you want to put your data on failing drives?

As to which numbers to be concerned with, the offline/uncorrectable sectors are definitely bad, especially with numbers in the hundreds. Reallocated sectors, especially in small numbers, wouldn't concern me too much--they indicate the drive electronics have found a bad sector, remapped it to a good one, and you can carry on. I'm not aware of spin retry being an issue.

Have you run long SMART tests on any of these disks recently? What was the result?
 

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
Thanks that helps. I'll discard the drives with error counts highlighted in orange and plan to retire the rest of them as soon as possible.

I have not run long SMART tests on the questionable drives. I'll definitely do so before considering using them, even short term, for the migration.

My challenge is going to be how to migrate my existing 50TB of data off the existing pool onto the zpool. Right now my data is stored on 7 x 4TB + 14 x 2TB (2 4TB drives for parity). None of those have any SMART errors. My plan was to use the ones with SMART errors to assist with the migration and then retire them.

I might just bite the bullet and order 10 x 4TB WD Reds and set up a 32TB RAIDZ2 array which would allow me to move most data across, then add a 2nd RAIDZ2 array using the now empty 4TB Segate drives and copy the rest over from the 2TB drives and then add a 3rd RAIDZ2 using the empty 2TB drives.

Best practices is for each each VDev in a pool to have the same number of disks, correct?
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
Best practice is for each vdev to be pretty much identical--same number of disks, same speed, same capacity. It isn't essential, though, and pools can run with mismatches in all areas.

I'd recommend you run long SMART tests on all your disks--and really, before you'd put them into a production FreeNAS server, a few passes of badblocks as well. Badblocks is destructive, so you'd obviously need to have data off the drive first. Once you set up the FreeNAS server, you can schedule regular SMART tests for your drives.

You're talking about quite a lot of drives for a single system--how are you planning to connect them? "An LSI HBA and SAS expander" is a good answer. "A hardware RAID card and SATA port expander", not so much.
 

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
Here's my current hardware setup:

2 x Supermicro 846 Chassis with SAS2 backplanes.
1 x Supermicro X7DWN+ w/ 32GB ECC PC2-6400 RAM and a pair of Xeon X5492 CPUs in one 846 chassis with a pair of IBM 1015 flashed to LSI9211-8i IT P16

I'm getting a 3rd 846 Chassis that was retired from work. I think it has a X8 mobo and SAS2 backplane. I do have a spare SAS2 backplane that I'll swap into in case it does not. I also have a spare 1015 on hand. The 846 I'm getting will have 24 1TB drives in it. Condition of drives unknown at this point.

I ordered the following a few days ago:

Supermicro X10SRL-F motherboard
Intel Xeon E5-1620 V2
2 x SAMSUNG 16GB DDR4 2133 ECC ran

So my plan is to pull the X8 mobo and install the X10 into the 3rd 846 chassis and get FreeNAS up and running using the 3rd 1015 card, thoroughly test RAM and everything else to ensure stability. Then start setting up my initial VDevs, test some more, and then begin migrating data over one disk at a time, to free them up.

Once everything is migrated over, pull the X7 out of the current "master" 846 chassis, and sell both the X7 and X8 mobos. I'll probably also sell 2 of my 1015 cards in favor of 9200-8e cards for a cleaner looking setup to connect to the external SAS expanders.

So at the end of the day, I'll have 72 hot swap bays. 72 / 9 = 8, so I'm thinking doing 9 disk VDevs from the get go makes sense. Over time I'll slowly upgrade the 1TB and 2TB drives within some of the VDevs to 4TB, and eventually have 224 TB. :D
 

danb35

Hall of Famer
Joined
Aug 16, 2011
Messages
15,504
OK, your hardware plans sound fine--we've seen enough folks do enough bad things that it was worth checking up front. You'll no doubt want more RAM in due course, but that board gives you plenty of room for expansion.

When you install FreeNAS, you'll want to make sure the SMART service is enabled, it's set up with an email address for notifications, and you schedule regular SMART tests on your drives. This will often give you early warning of disk failures, so you can replace failing disks before your pool is degraded (making the whole thing a much lower-stress process).
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
A few errors (~5 at most, for most people) are acceptable, given a decent pool layout. 100+ errors pretty much mean an unusable drive, though.
 

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
Thanks. Yeah, I'll go ahead and replace those 1TB drives with high error counts before getting started. Those Seagate Enterprise ES.2 have been good to me overall I think being that they have lasted almost a decade. Time to start looking around for good deals on some 4TB Reds I suppose.

I started out with a Buslogic controller way back in the day, moved to 3Ware, then Areca, where I enjoyed my 1170 and then added and 1680 and life was good running several years with a pair of 24 drive RAID6 arrays. Then, after almost loosing 44TB of data (I lost a 3rd drive in the 24x2TB array during a rebuilt, but was able to coax one of them back to life long enough to get back to a degraded state, and then healthy. I then converted over the tRAID for a year, and then more recently SnapRAID with drivepool as mentioned above. But my SnapRaid syncs, scrubs and fix runs are leaving more and more unrecoverable errors, and this is after phasing out all the drives with SMART errors.

So I'm going to stand up this new FreeNAS server and hopefully find it to be my long term solution for my ever growing media collection. I have a lot of homework ahead of me getting Emby, Sickbeard and Sabnzbd. I'm also running JRiver on my current server, so I need to look for alternatives there, perhaps standing up a 1U box running windows for that particular duty.
 

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
I pulled out the Barracuda's with high error counts and have a set of 10 that have relatively low SMART error counts. I ran the full suite of tests against them, following qwertymodo's excellent guide.

So I ran short, conveyance, long and badblocks. All 10 drives passed all 4 tests with no errors reported during testing. Took about 50 hours per drive. Ran badblock in parallel using screen.

The info for all 10 drives are identical and are as follows:

=== START OF INFORMATION SECTION ===
Model Family: Seagate Barracuda ES.2
Device Model: ST31000340NS
Firmware Version: SN06
User Capacity: 1,000,204,886,016 bytes [1.00 TB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS T13/1699-D revision 4
SATA Version is: SATA 2.6, 3.0 Gb/s
Local Time is: Sat May 9 05:33:36 2015 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

And here are the smartctl -A results for each drive (DA10-19):

Code:
DA10
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  082  063  044  Pre-fail  Always  -  200965087
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6315
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  20
  7 Seek_Error_Rate  0x000f  060  059  030  Pre-fail  Always  -  438188920521
  9 Power_On_Hours  0x0032  039  039  000  Old_age  Always  -  54220
10 Spin_Retry_Count  0x0013  100  100  097  Pre-fail  Always  -  182
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  361
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  069  042  045  Old_age  Always  In_the_past 31 (3 120 33 29 0)
194 Temperature_Celsius  0x0022  031  058  000  Old_age  Always  -  31 (0 12 0 0 0)
195 Hardware_ECC_Recovered  0x001a  043  018  000  Old_age  Always  -  200965087
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA11
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  081  063  006  Pre-fail  Always  -  159749386
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6645
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  8
  7 Seek_Error_Rate  0x000f  078  077  030  Pre-fail  Always  -  8751495614
  9 Power_On_Hours  0x0032  033  033  000  Old_age  Always  -  58757
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  158
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  361
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  001  001  000  Old_age  Always  -  970
190 Airflow_Temperature_Cel 0x0022  066  050  045  Old_age  Always  -  34 (Min/Max 32/36)
194 Temperature_Celsius  0x0022  034  050  000  Old_age  Always  -  34 (0 12 0 0 0)
195 Hardware_ECC_Recovered  0x001a  039  017  000  Old_age  Always  -  159749386
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA12
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  072  063  044  Pre-fail  Always  -  20432430
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6315
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000f  077  060  030  Pre-fail  Always  -  8692985671
  9 Power_On_Hours  0x0032  039  039  000  Old_age  Always  -  54220
10 Spin_Retry_Count  0x0013  100  100  097  Pre-fail  Always  -  279
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  361
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  064  053  045  Old_age  Always  -  36 (Min/Max 34/39)
194 Temperature_Celsius  0x0022  036  047  000  Old_age  Always  -  36 (0 12 0 0 0)
195 Hardware_ECC_Recovered  0x001a  042  023  000  Old_age  Always  -  20432430
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA13
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  078  063  044  Pre-fail  Always  -  77805818
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  095  095  020  Old_age  Always  -  5944
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000f  064  060  030  Pre-fail  Always  -  124642314272
  9 Power_On_Hours  0x0032  042  042  000  Old_age  Always  -  51313
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  386
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  344
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  068  060  045  Old_age  Always  -  32 (Min/Max 30/34)
194 Temperature_Celsius  0x0022  032  040  000  Old_age  Always  -  32 (0 10 0 0 0)
195 Hardware_ECC_Recovered  0x001a  045  016  000  Old_age  Always  -  77805818
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  4

DA14
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  075  063  044  Pre-fail  Always  -  36209999
  3 Spin_Up_Time  0x0003  099  089  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  095  095  020  Old_age  Always  -  5635
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  1
  7 Seek_Error_Rate  0x000f  078  060  030  Pre-fail  Always  -  70462456
  9 Power_On_Hours  0x0032  045  045  000  Old_age  Always  -  48345
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  404
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  303
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  4295032833
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  069  055  045  Old_age  Always  -  31 (Min/Max 29/32)
194 Temperature_Celsius  0x0022  031  045  000  Old_age  Always  -  31 (0 12 0 0 0)
195 Hardware_ECC_Recovered  0x001a  042  023  000  Old_age  Always  -  36209999
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA15
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  081  063  006  Pre-fail  Always  -  166043302
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6639
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  1
  7 Seek_Error_Rate  0x000f  078  074  030  Pre-fail  Always  -  8751031262
  9 Power_On_Hours  0x0032  033  033  000  Old_age  Always  -  59096
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  150
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  355
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  067  051  045  Old_age  Always  -  33 (Min/Max 31/35)
194 Temperature_Celsius  0x0022  033  049  000  Old_age  Always  -  33 (0 13 0 0 0)
195 Hardware_ECC_Recovered  0x001a  041  018  000  Old_age  Always  -  166043302
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA16
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  082  063  006  Pre-fail  Always  -  180458818
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6650
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000f  081  075  030  Pre-fail  Always  -  4435280436
  9 Power_On_Hours  0x0032  036  036  000  Old_age  Always  -  56136
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  177
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  347
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  067  057  045  Old_age  Always  -  33 (Min/Max 31/35)
194 Temperature_Celsius  0x0022  033  043  000  Old_age  Always  -  33 (0 11 0 0 0)
195 Hardware_ECC_Recovered  0x001a  045  020  000  Old_age  Always  -  180458818
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA17
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  083  058  006  Pre-fail  Always  -  208044956
  3 Spin_Up_Time  0x0003  099  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6620
  5 Reallocated_Sector_Ct  0x0033  094  094  036  Pre-fail  Always  -  136
  7 Seek_Error_Rate  0x000f  075  072  030  Pre-fail  Always  -  17341597228
  9 Power_On_Hours  0x0032  033  033  000  Old_age  Always  -  59009
10 Spin_Retry_Count  0x0013  100  100  097  Pre-fail  Always  -  132
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  352
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  001  001  000  Old_age  Always  -  165
188 Command_Timeout  0x0032  100  097  000  Old_age  Always  -  12885098499
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  066  058  045  Old_age  Always  -  34 (Min/Max 32/36)
194 Temperature_Celsius  0x0022  034  042  000  Old_age  Always  -  34 (0 12 0 0 0)
195 Hardware_ECC_Recovered  0x001a  051  026  000  Old_age  Always  -  208044956
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA18
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  073  055  044  Pre-fail  Always  -  22877827
  3 Spin_Up_Time  0x0003  098  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6297
  5 Reallocated_Sector_Ct  0x0033  100  100  036  Pre-fail  Always  -  2
  7 Seek_Error_Rate  0x000f  080  060  030  Pre-fail  Always  -  104228391
  9 Power_On_Hours  0x0032  039  039  000  Old_age  Always  -  54222
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  191
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  356
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  099  000  Old_age  Always  -  1
189 High_Fly_Writes  0x003a  096  096  000  Old_age  Always  -  4
190 Airflow_Temperature_Cel 0x0022  067  060  045  Old_age  Always  -  33 (Min/Max 31/35)
194 Temperature_Celsius  0x0022  033  040  000  Old_age  Always  -  33 (0 11 0 0 0)
195 Hardware_ECC_Recovered  0x001a  034  016  000  Old_age  Always  -  22877827
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA19
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  083  063  044  Pre-fail  Always  -  208857009
  3 Spin_Up_Time  0x0003  098  084  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  094  094  020  Old_age  Always  -  6570
  5 Reallocated_Sector_Ct  0x0033  098  098  036  Pre-fail  Always  -  43
  7 Seek_Error_Rate  0x000f  081  060  030  Pre-fail  Always  -  137379283
  9 Power_On_Hours  0x0032  035  035  000  Old_age  Always  -  57100
10 Spin_Retry_Count  0x0013  100  099  097  Pre-fail  Always  -  171
12 Power_Cycle_Count  0x0032  100  037  020  Old_age  Always  -  397
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  100  100  000  Old_age  Always  -  0
190 Airflow_Temperature_Cel 0x0022  066  046  045  Old_age  Always  -  34 (Min/Max 32/37)
194 Temperature_Celsius  0x0022  034  054  000  Old_age  Always  -  34 (0 11 0 0 0)
195 Hardware_ECC_Recovered  0x001a  047  019  000  Old_age  Always  -  208857009
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0


So none of the drives have any Current_Pending_Sector or Offline_Uncorrectable lines errors

One drive has a 4 count on the UDMA_CRC_Error_Count

7 drives have Reallocated_Sector_Ct counts, but they are relatively low

Should I be worried about any of the other raw value counts?

Now I know you guys are recommending not using drives with ANY smart errors, but the fact that these all passed the badblock test with no errors, would that not provide *some* indication that they still have some life left in them?
 
Last edited:

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
The CRC errors are usually because of a bad cable or a bad connection (reseating the connectors at both ends is a good idea), you might want to check that :)

Keep an eye on the drives with reallocated sectors (especially the reallocated sectors and pending sectors values), if the values start to rise significantly then the drive is pretty much dying. It's normal during the life of a drive to have a few bad sectors but not more than a few dozen.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
The ECC errors bother me, but may just be internal stuff that is expected to grow tremendously.

8 reallocated sectors is just outside of my comfort comfort zone. 20 is troubling. 100 is unacceptable, in my opinion.
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
This drive has more than 50 kh of usage so 20 reallocated sectors isn't that bad, plus there's no pending sectors. It's not perfect but the drive is still good enough to use it I think. Of course I don't think it'll last very long as it has already about 6 years of usage.

However I'd keep some spare drives to replace the drives at the first sign of failure just to be safe ;)
 

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
Appreciate the feedback. Yes, I'll keep some spares on hand for sure. On the one with the CRC errors, I'm not sure there's anything I can do as far as checking cables since all the drives are directly connected to a SAS2 backplane with a SFF-8087 cable going to the IBM 1050 controller. I can pull the drive, inspect the connector and re-seat it, to see if that makes a difference.

I do plan to migrate this VDev over time to 4TB drives, so I'll get a few now and replaced the ones with the highest reallocated sectors and keep one more as a spare.

My set of 10 2TB drives completed testing overnight (no errors reported by badblocks on any of these either) and comparing the results to the 10 1TB drives is interesting.

8 of the drives are of this type:

=== START OF INFORMATION SECTION ===
Model Family: Hitachi Deskstar 7K2000
Device Model: Hitachi HDS722020ALA330
Firmware Version: JKAOA3MA
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Sector Size: 512 bytes logical/physical
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS T13/1699-D revision 4
SATA Version is: SATA 2.6, 3.0 Gb/s
Local Time is: Sun May 10 05:16:12 2015 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

And the results for them are as follows:

Code:
DA0
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  100  100  054  Pre-fail  Offline  -  0
  3 Spin_Up_Time  0x0007  127  127  024  Pre-fail  Always  -  632 (Average 509)
  4 Start_Stop_Count  0x0012  099  099  000  Old_age  Always  -  4739
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  100  100  020  Pre-fail  Offline  -  0
  9 Power_On_Hours  0x0012  095  095  000  Old_age  Always  -  36872
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  178
192 Power-Off_Retract_Count 0x0032  096  096  000  Old_age  Always  -  5791
193 Load_Cycle_Count  0x0012  096  096  000  Old_age  Always  -  5791
194 Temperature_Celsius  0x0002  166  166  000  Old_age  Always  -  36 (Min/Max 15/44)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA1
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  100  100  054  Pre-fail  Offline  -  0
  3 Spin_Up_Time  0x0007  130  130  024  Pre-fail  Always  -  621 (Average 489)
  4 Start_Stop_Count  0x0012  099  099  000  Old_age  Always  -  4735
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  100  100  020  Pre-fail  Offline  -  0
  9 Power_On_Hours  0x0012  095  095  000  Old_age  Always  -  36846
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  176
192 Power-Off_Retract_Count 0x0032  096  096  000  Old_age  Always  -  5717
193 Load_Cycle_Count  0x0012  096  096  000  Old_age  Always  -  5717
194 Temperature_Celsius  0x0002  171  171  000  Old_age  Always  -  35 (Min/Max 14/43)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA2
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  100  100  054  Pre-fail  Offline  -  0
  3 Spin_Up_Time  0x0007  130  130  024  Pre-fail  Always  -  632 (Average 479)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  1552
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  100  100  020  Pre-fail  Offline  -  0
  9 Power_On_Hours  0x0012  095  095  000  Old_age  Always  -  38265
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  160
192 Power-Off_Retract_Count 0x0032  098  098  000  Old_age  Always  -  2646
193 Load_Cycle_Count  0x0012  098  098  000  Old_age  Always  -  2646
194 Temperature_Celsius  0x0002  166  166  000  Old_age  Always  -  36 (Min/Max 15/57)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA3
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  133  133  054  Pre-fail  Offline  -  101
  3 Spin_Up_Time  0x0007  118  118  024  Pre-fail  Always  -  616 (Average 612)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  176
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  121  121  020  Pre-fail  Offline  -  35
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  11352
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  62
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  564
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  564
194 Temperature_Celsius  0x0002  181  181  000  Old_age  Always  -  33 (Min/Max 16/44)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA4
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  133  133  054  Pre-fail  Offline  -  101
  3 Spin_Up_Time  0x0007  117  117  024  Pre-fail  Always  -  618 (Average 614)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  772
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  121  121  020  Pre-fail  Offline  -  35
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  11638
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  83
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  1019
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  1019
194 Temperature_Celsius  0x0002  193  193  000  Old_age  Always  -  31 (Min/Max 15/55)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA6
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  65536
  2 Throughput_Performance  0x0005  100  100  054  Pre-fail  Offline  -  0
  3 Spin_Up_Time  0x0007  132  132  024  Pre-fail  Always  -  623 (Average 472)
  4 Start_Stop_Count  0x0012  099  099  000  Old_age  Always  -  4745
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  100  100  020  Pre-fail  Offline  -  0
  9 Power_On_Hours  0x0012  095  095  000  Old_age  Always  -  37051
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  187
192 Power-Off_Retract_Count 0x0032  096  096  000  Old_age  Always  -  5806
193 Load_Cycle_Count  0x0012  096  096  000  Old_age  Always  -  5806
194 Temperature_Celsius  0x0002  157  157  000  Old_age  Always  -  38 (Min/Max 15/46)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA7
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  132  132  054  Pre-fail  Offline  -  103
  3 Spin_Up_Time  0x0007  117  117  024  Pre-fail  Always  -  617 (Average 619)
  4 Start_Stop_Count  0x0012  100  100  000  Old_age  Always  -  239
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  121  121  020  Pre-fail  Offline  -  35
  9 Power_On_Hours  0x0012  099  099  000  Old_age  Always  -  12103
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  88
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  626
193 Load_Cycle_Count  0x0012  100  100  000  Old_age  Always  -  626
194 Temperature_Celsius  0x0002  181  181  000  Old_age  Always  -  33 (Min/Max 16/57)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0

DA9
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000b  100  100  016  Pre-fail  Always  -  0
  2 Throughput_Performance  0x0005  100  100  054  Pre-fail  Offline  -  0
  3 Spin_Up_Time  0x0007  100  100  024  Pre-fail  Always  -  470
  4 Start_Stop_Count  0x0012  099  099  000  Old_age  Always  -  4740
  5 Reallocated_Sector_Ct  0x0033  100  100  005  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000b  100  100  067  Pre-fail  Always  -  0
  8 Seek_Time_Performance  0x0005  100  100  020  Pre-fail  Offline  -  0
  9 Power_On_Hours  0x0012  095  095  000  Old_age  Always  -  37099
10 Spin_Retry_Count  0x0013  100  100  060  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  180
192 Power-Off_Retract_Count 0x0032  096  096  000  Old_age  Always  -  5815
193 Load_Cycle_Count  0x0012  096  096  000  Old_age  Always  -  5815
194 Temperature_Celsius  0x0002  166  166  000  Old_age  Always  -  36 (Min/Max 15/44)
196 Reallocated_Event_Count 0x0032  100  100  000  Old_age  Always  -  0
197 Current_Pending_Sector  0x0022  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0008  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x000a  200  200  000  Old_age  Always  -  0


So 5 of them have about 35k hours on them and 2 are around 12k hours. It is interesting to note that they all have 0 for the Raw_Read_Error_rate, except for one, which has a value of 65,536. This value is still way less than any of the 1TB Seagates. That said, being that none of the other Deskstar drives a value here, I'll be sure to keep a close watch on this one.

The last 2 drives in this VDev set are these guys:

=== START OF INFORMATION SECTION ===
Model Family: Seagate NAS HDD
Device Model: ST2000VN000-1H3164
Firmware Version: SC42
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5900 rpm
Form Factor: 3.5 inches
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Sun May 10 05:21:48 2015 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

And the results after testing are as follows:

Code:
DA5
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  114  099  006  Pre-fail  Always  -  77369760
  3 Spin_Up_Time  0x0003  095  095  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  100  100  020  Old_age  Always  -  265
  5 Reallocated_Sector_Ct  0x0033  100  100  010  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000f  071  060  030  Pre-fail  Always  -  14205777
  9 Power_On_Hours  0x0032  088  088  000  Old_age  Always  -  10532
10 Spin_Retry_Count  0x0013  100  100  097  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  020  Old_age  Always  -  96
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  039  039  000  Old_age  Always  -  61
190 Airflow_Temperature_Cel 0x0022  071  055  045  Old_age  Always  -  29 (Min/Max 27/32)
191 G-Sense_Error_Rate  0x0032  100  100  000  Old_age  Always  -  0
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  86
193 Load_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  265
194 Temperature_Celsius  0x0022  029  045  000  Old_age  Always  -  29 (0 14 0 0 0)
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0

DA8
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME  FLAG  VALUE WORST THRESH TYPE  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate  0x000f  107  099  006  Pre-fail  Always  -  14196232
  3 Spin_Up_Time  0x0003  095  095  000  Pre-fail  Always  -  0
  4 Start_Stop_Count  0x0032  100  100  020  Old_age  Always  -  96
  5 Reallocated_Sector_Ct  0x0033  100  100  010  Pre-fail  Always  -  0
  7 Seek_Error_Rate  0x000f  069  060  030  Pre-fail  Always  -  8961338
  9 Power_On_Hours  0x0032  092  092  000  Old_age  Always  -  7200
10 Spin_Retry_Count  0x0013  100  100  097  Pre-fail  Always  -  0
12 Power_Cycle_Count  0x0032  100  100  020  Old_age  Always  -  62
184 End-to-End_Error  0x0032  100  100  099  Old_age  Always  -  0
187 Reported_Uncorrect  0x0032  100  100  000  Old_age  Always  -  0
188 Command_Timeout  0x0032  100  100  000  Old_age  Always  -  0
189 High_Fly_Writes  0x003a  001  001  000  Old_age  Always  -  175
190 Airflow_Temperature_Cel 0x0022  069  060  045  Old_age  Always  -  31 (Min/Max 28/33)
191 G-Sense_Error_Rate  0x0032  100  100  000  Old_age  Always  -  0
192 Power-Off_Retract_Count 0x0032  100  100  000  Old_age  Always  -  62
193 Load_Cycle_Count  0x0032  100  100  000  Old_age  Always  -  96
194 Temperature_Celsius  0x0022  031  040  000  Old_age  Always  -  31 (0 17 0 0 0)
197 Current_Pending_Sector  0x0012  100  100  000  Old_age  Always  -  0
198 Offline_Uncorrectable  0x0010  100  100  000  Old_age  Offline  -  0
199 UDMA_CRC_Error_Count  0x003e  200  200  000  Old_age  Always  -  0


So these Seagates are relatively new compared to the 1TB with between 7200 and 10.5K hours on them. But both have high Raw_Read_Error_Rate and Seek_Error_Rate counts, so that must be a Seagate thing.
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
Raw_Read_Error_Rate and Seek_Error_Rate aren't directly human readable, see this page to understand how to read them ;)
 
Last edited:

pclausen

Patron
Joined
Apr 19, 2015
Messages
267
Thanks. Very interesting read!
 

SirMaster

Patron
Joined
Mar 19, 2014
Messages
241
For me reallocated sectors is more about if it keeps increasing rather than the absolute number.

I have a drive that I still use that one day suddenly got about 300 reallocated sectors. This was a single event and the drive has been used for 2 years since and the number has never increased since. 300 aws not enough to cause the VALUE to cross the THRESH.

If the reallocated sectors kept increasing every so often, then I would replace the drive because something is clearly wrong with it. If it was just an isolated incident then the drive is able to remap those sectors like it's supposed to and continue functioning. If it happens in even just 2 separate incidents then it's pretty suspicious to me.
 

Bidule0hm

Server Electronics Sorcerer
Joined
Aug 5, 2013
Messages
3,710
Reallocated sectors are just a small part of what you should watch. There's also the pending sectors and the uncorrectable sectors plus a few others attributes that are important ;)
 
Status
Not open for further replies.
Top