System Alert - Uncorrectable sectors

Status
Not open for further replies.

BeeoHat

Explorer
Joined
Oct 2, 2015
Messages
63
  • I received this system alert :

  • CRITICAL: Device: /dev/ada1, 158 Offline uncorrectable sectors
It says uncorrectable, so I assume I cannot fix/restore these sectors, but can I isolate them in some way, so that the rest of the volume is accessible and operational?
 

Inxsible

Guru
Joined
Aug 14, 2017
Messages
1,123
You should replace that drive soon depending on your storage configuration.

If you have SMART tests configured to run every so often, it should list what the raw values are. The important parameters to watch are

Offline_Uncorrectable
Current_Pending_Sector
Reallocated_Sector_Ct

Ideally you want them all to be 0. However, if they are atleast below threshold values, you should be fine but you should keep an eye on those numbers and if they ever go up, replace the drive asap. In fact, I would suggest go ahead and buy a new drive if you need to and replace it.
 

BeeoHat

Explorer
Joined
Oct 2, 2015
Messages
63
Just in the process of running SMART tests now but, expecting the worst, I've already ordered a replacement drive. You mentioned threshold values..where would I find these?
 

Inxsible

Guru
Joined
Aug 14, 2017
Messages
1,123
Just in the process of running SMART tests now but, expecting the worst, I've already ordered a replacement drive. You mentioned threshold values..where would I find these?
When you check the status of the drive after the SMART test is complete, the VALUE column is the threshold value and the RAW_VALUE is the actual value.

so if the threshold is at 200 (for eg.) and you have 158 in the RAW_VALUE, SMART will consider the test to pass even though you have 158 sectors that are uncorrectable.

See this thread for more info. That thread talks about hard drive burn in, but the same concept applies to existing drives which start to show signs of failure.
 

BeeoHat

Explorer
Joined
Oct 2, 2015
Messages
63
The following are the results of a short SMART test. which confirms the 158 Offline uncorrectable sectors. The other two parameters you referred to show a RAW value of 0. But all three are below the Threshold value of 200.

So, can I assume my volume is still safe to use, for now, but to keep an eye on any escalating results?
=== START OF READ SMART DATA SECTION ===

SMART Attributes Data Structure revision number: 16

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE

1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 30

3 Spin_Up_Time 0x0027 182 179 021 Pre-fail Always - 5866

4 Start_Stop_Count 0x0032 092 092 000 Old_age Always - 8262

5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0

7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0

9 Power_On_Hours 0x0032 081 081 000 Old_age Always - 14528

10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0

11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0

12 Power_Cycle_Count 0x0032 092 092 000 Old_age Always - 8169

192 Power-Off_Retract_Count 0x0032 195 195 000 Old_age Always - 4096

193 Load_Cycle_Count 0x0032 192 192 000 Old_age Always - 24105

194 Temperature_Celsius 0x0022 119 101 000 Old_age Always - 31

196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0

197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0

198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 158

199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 1

200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 219
 

Inxsible

Guru
Joined
Aug 14, 2017
Messages
1,123
You can take that chance. I wouldn't.
 

BeeoHat

Explorer
Joined
Oct 2, 2015
Messages
63
I hear you, so I'll just swap out for the new one when it arrives. Thanks so much for all the help. This forum is great!!
 

Inxsible

Guru
Joined
Aug 14, 2017
Messages
1,123
Status
Not open for further replies.
Top