Unreadable Pending Sectors but HDD tests O.K.

Status
Not open for further replies.

shan81

Dabbler
Joined
Oct 11, 2011
Messages
21
Hi Everybody,
I'm having a problem with a drive in my array (4 X 4TB) and would like some advice about how to clear the error. The drive in question is HSGT Deskstar 7k4000 4TB Sata-3 Hard-Drive.
The Error: Some time ago, I had the unfortunate error of:
"Device: /dev/ada3 112 Currently unreadable (pending) sectors"
To correct this problem, I started following the guide detailed here: http://www.freebsddiary.org/smart-fixing-bad-sector.php
In paragraph four, where the guide says to check the results of the long off-line test for the first unreadable LBA sector - I do not get this error. The test completes (after many hours) without any errors. This means that I don't know where to try and correct the unreadable sector. I'm also unsure how to clear the error messages that keep appearing in my log.
I've thought about removing the drive from the pool and running the HGST - 'Windows Drive Fitness Tool' with the hopes that it may find something. Alternatively, could I organise an R.M.A as drive as it's still under warranty?
What would you recommend in this situation?
A copy my logs is detailed below and all data has been previously backed up to an external source.
Thanks!
[root@freenas] ~# smartctl -A /dev/ada3
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p4 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_ FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0
2 Throughput_Performance 0x0005 136 136 054 Pre-fail Offline - 81
3 Spin_Up_Time 0x0007 126 126 024 Pre-fail Always - 612 (Average 615)
4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 503
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 5
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0
8 Seek_Time_Performance 0x0005 114 114 020 Pre-fail Offline - 37
9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 1857
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 257
192 Power-Off_Retract_Count 0x0032 093 093 000 Old_age Always - 8841
193 Load_Cycle_Count 0x0012 093 093 000 Old_age Always - 8841
194 Temperature_Celsius 0x0002 176 176 000 Old_age Always - 34 (Min/Max 14/60)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 5
197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 112
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0

[root@freenas] ~# smartctl -t short /dev/ada3
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p4 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in off-line mode".
Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful.
Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Wed May 21 18:54:57 2014

Use smartctl -X to abort test.
[root@freenas] ~# smartctl -l selftest /dev/ada3
smartctl 6.2 2013-07-26 r3841 [FreeBSD 9.2-RELEASE-p4 amd64] (local build)
Copyright (C) 2002-13, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 1857 -
# 2 Extended offline Completed without error 00% 1307 -
# 3 Short offline Completed without error 00% 1298 -
# 4 Extended offline Completed without error 00% 440 -
# 5 Extended offline Completed without error 00% 429 -

[root@freenas] ~#
 

Yatti420

Wizard
Joined
Aug 12, 2012
Messages
1,437
Can you execute smart tests again? Attempting to fix SMART alerts is a bad idea.. Your drive hours are so low that these weren’t even burned in.. You could have more drive flakeyness or failures resulting in pool failure.. This is why I didn't use Z1..
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
You shouldn't continue to try to use that drive. It's got less than 2000 hours on it and is already failing. RMA the thing if you can or replace it. That's a dud drive.

In any event, are you really going to tell me you're going to go one by one and fix every single bad LBA address for 212 entries? You'd literally spend days or weeks doing that....
 

shan81

Dabbler
Joined
Oct 11, 2011
Messages
21
Hi Yatti420 & Cyberjock,

I'll try and RMA the faulty drive.

Should I include a copy of smartd result for proof of the drive failing?
 

toadman

Guru
Joined
Jun 4, 2013
Messages
619
While I've never RMAed an HGST drive, I'm sure if you just state "SMART reporting unreadable sectors" you're good to go. I've never had to give more info than that on a drive RMA. But if you have the smart test results handy I'm sure you can cut/paste into the RMA as a bonus.
 

shan81

Dabbler
Joined
Oct 11, 2011
Messages
21
Hello Everybody,

I sent the drive back to HGST and a new drive was returned in under a week.

Very quick and efficient service.

Cheers!
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Hello Everybody,

I sent the drive back to HGST and a new drive was returned in under a week.

Very quick and efficient service.

Cheers!

Was it a new drive or some dubious refurb?
 

shan81

Dabbler
Joined
Oct 11, 2011
Messages
21
Was it a new drive or some dubious refurb?

As far as I can tell, the drive was brand-new. It came sealed in silver anti-static bag but it did not come with the retail packaging. I will check the smartd information and confirm.
 
Status
Not open for further replies.
Top