RMA or force reallocation

Status
Not open for further replies.

ShimadaRiku

Contributor
Joined
Aug 28, 2015
Messages
104
smart long selftest had "read failure 90% remaining" in one of my mirrored drive, but reallocated sector is still zero.

Should I be trying to RMA or force reallocation by offline the disk then re-write with

Code:
dd if=/dev/zero of=/dev/da1 conv=sync bs=4096 count=1 seek=4835180


Code:
Welcome to FreeNAS
[root@freenas] ~# smartctl -x /dev/da1
smartctl 6.3 2014-07-26 r3976 [FreeBSD 9.3-RELEASE-p26 amd64] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Green
Device Model:     WDC WD40EZRX-00SPEB0
Serial Number:    WD-WCC4EDHUHY0L
LU WWN Device Id: 5 0014ee 260468266
Firmware Version: 80.00A80
User Capacity:    4,000,787,030,016 bytes [4.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Nov  1 06:28:52 2015 PST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:   Unavailable
APM feature is:   Unavailable
Rd look-ahead is: Enabled
Write cache is:   Enabled
ATA Security is:  Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      ( 121) The previous self-test completed having
                                        the read element of the test failed.
Total time to complete Offline
data collection:                (54480) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 545) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x7035) SCT Status supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR-K   200   200   051    -    0
  3 Spin_Up_Time            POS--K   179   179   021    -    8016
  4 Start_Stop_Count        -O--CK   098   098   000    -    2603
  5 Reallocated_Sector_Ct   PO--CK   200   200   140    -    0
  7 Seek_Error_Rate         -OSR-K   200   200   000    -    0
  9 Power_On_Hours          -O--CK   093   093   000    -    5725
10 Spin_Retry_Count        -O--CK   100   100   000    -    0
11 Calibration_Retry_Count -O--CK   100   253   000    -    0
12 Power_Cycle_Count       -O--CK   100   100   000    -    81
192 Power-Off_Retract_Count -O--CK   200   200   000    -    35
193 Load_Cycle_Count        -O--CK   154   154   000    -    138531
194 Temperature_Celsius     -O---K   120   108   000    -    32
196 Reallocated_Event_Count -O--CK   200   200   000    -    0
197 Current_Pending_Sector  -O--CK   200   200   000    -    0
198 Offline_Uncorrectable   ----CK   200   200   000    -    0
199 UDMA_CRC_Error_Count    -O--CK   200   200   000    -    0
200 Multi_Zone_Error_Rate   ---R--   200   200   000    -    1
                            ||||||_ K auto-keep
                            |||||__ C event count
                            ||||___ R error rate
                            |||____ S speed/performance
                            ||_____ O updated online
                            |______ P prefailure warning

General Purpose Log Directory Version 1
SMART           Log Directory Version 1 [multi-sector log support]
Address    Access  R/W   Size  Description
0x00       GPL,SL  R/O      1  Log Directory
0x01           SL  R/O      1  Summary SMART error log
0x02           SL  R/O      5  Comprehensive SMART error log
0x03       GPL     R/O      6  Ext. Comprehensive SMART error log
0x06           SL  R/O      1  SMART self-test log
0x07       GPL     R/O      1  Extended self-test log
0x09           SL  R/W      1  Selective self-test log
0x10       GPL     R/O      1  NCQ Command Error log
0x11       GPL     R/O      1  SATA Phy Event Counters
0x80-0x9f  GPL,SL  R/W     16  Host vendor specific log
0xa0-0xa7  GPL,SL  VS      16  Device vendor specific log
0xa8-0xb6  GPL,SL  VS       1  Device vendor specific log
0xb7       GPL,SL  VS      39  Device vendor specific log
0xbd       GPL,SL  VS       1  Device vendor specific log
0xc0       GPL,SL  VS       1  Device vendor specific log
0xc1       GPL     VS      93  Device vendor specific log
0xe0       GPL,SL  R/W      1  SCT Command/Status
0xe1       GPL,SL  R/W      1  SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged

SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%      5724         38681440
# 2  Extended offline    Completed: read failure       90%      5722         38688272

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:                  3
SCT Version (vendor specific):       258 (0x0102)
SCT Support Level:                   1
Device State:                        Active (0)
Current Temperature:                    32 Celsius
Power Cycle Min/Max Temperature:     30/34 Celsius
Lifetime    Min/Max Temperature:      3/44 Celsius
Under/Over Temperature Limit Count:   0/0

SCT Temperature History Version:     2
Temperature Sampling Period:         1 minute
Temperature Logging Interval:        1 minute
Min/Max recommended Temperature:      0/60 Celsius
Min/Max Temperature Limit:           -41/85 Celsius
Temperature History Size (Index):    478 (349)

Index    Estimated Time   Temperature Celsius
350    2015-10-31 23:31    30  ***********
...    ..( 30 skipped).    ..  ***********
381    2015-11-01 00:02    30  ***********
382    2015-11-01 00:03    31  ************
...    ..( 10 skipped).    ..  ************
393    2015-11-01 00:14    31  ************
394    2015-11-01 00:15    32  *************
...    ..( 10 skipped).    ..  *************
405    2015-11-01 00:26    32  *************
406    2015-11-01 00:27    31  ************
407    2015-11-01 00:28    32  *************
408    2015-11-01 00:29    31  ************
...    ..(  4 skipped).    ..  ************
413    2015-11-01 00:34    31  ************
414    2015-11-01 00:35    32  *************
...    ..( 16 skipped).    ..  *************
431    2015-11-01 00:52    32  *************
432    2015-11-01 00:53    31  ************
...    ..( 29 skipped).    ..  ************
462    2015-11-01 01:23    31  ************
463    2015-11-01 01:24    32  *************
...    ..( 16 skipped).    ..  *************
   2    2015-11-01 01:41    32  *************
   3    2015-11-01 01:42    31  ************
...    ..( 18 skipped).    ..  ************
  22    2015-11-01 01:01    31  ************
  23    2015-11-01 01:02    30  ***********
...    ..(  2 skipped).    ..  ***********
  26    2015-11-01 01:05    30  ***********
  27    2015-11-01 01:06    31  ************
...    ..( 11 skipped).    ..  ************
  39    2015-11-01 01:18    31  ************
  40    2015-11-01 01:19    32  *************
...    ..(  4 skipped).    ..  *************
  45    2015-11-01 01:24    32  *************
  46    2015-11-01 01:25    31  ************
...    ..(  9 skipped).    ..  ************
  56    2015-11-01 01:35    31  ************
  57    2015-11-01 01:36    32  *************
...    ..(  4 skipped).    ..  *************
  62    2015-11-01 01:41    32  *************
  63    2015-11-01 01:42    31  ************
...    ..( 16 skipped).    ..  ************
  80    2015-11-01 01:59    31  ************
  81    2015-11-01 02:00    30  ***********
  82    2015-11-01 02:01    30  ***********
  83    2015-11-01 02:02    31  ************
...    ..(  8 skipped).    ..  ************
  92    2015-11-01 02:11    31  ************
  93    2015-11-01 02:12    32  *************
...    ..(113 skipped).    ..  *************
207    2015-11-01 04:06    32  *************
208    2015-11-01 04:07    31  ************
209    2015-11-01 04:08    32  *************
...    ..(  8 skipped).    ..  *************
218    2015-11-01 04:17    32  *************
219    2015-11-01 04:18    31  ************
...    ..( 13 skipped).    ..  ************
233    2015-11-01 04:32    31  ************
234    2015-11-01 04:33    30  ***********
235    2015-11-01 04:34    30  ***********
236    2015-11-01 04:35    31  ************
...    ..(  9 skipped).    ..  ************
246    2015-11-01 04:45    31  ************
247    2015-11-01 04:46    30  ***********
...    ..(101 skipped).    ..  ***********
349    2015-11-01 06:28    30  ***********

SCT Error Recovery Control command not supported

Device Statistics (GP Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID      Size     Value  Description
0x0001  2            0  Command failed due to ICRC error
0x0002  2            0  R_ERR response for data FIS
0x0003  2            0  R_ERR response for device-to-host data FIS
0x0004  2            0  R_ERR response for host-to-device data FIS
0x0005  2            0  R_ERR response for non-data FIS
0x0006  2            0  R_ERR response for device-to-host non-data FIS
0x0007  2            0  R_ERR response for host-to-device non-data FIS
0x0008  2            0  Device-to-host non-data FIS retries
0x0009  2            3  Transition from drive PhyRdy to drive PhyNRdy
0x000a  2            4  Device-to-host register FISes sent due to a COMRESET
0x000b  2            0  CRC errors within host-to-device FIS
0x000f  2            0  R_ERR response for host-to-device data FIS, CRC
0x0012  2            0  R_ERR response for host-to-device non-data FIS, CRC
0x8000  4       460741  Vendor specific
 
Last edited:

BigDave

FreeNAS Enthusiast
Joined
Oct 6, 2013
Messages
2,479

DrKK

FreeNAS Generalissimo
Joined
Oct 15, 2013
Messages
3,630
LCC is high, but the green drives have this issue with the heads parking after
just a few minutes. This can and should be adjusted with the wdidle utility
recommended in this forum post.
https://forums.freenas.org/index.php?threads/hacking-wd-greens-and-reds-with-wdidle3-exe.18171/

Since your drive is under warranty, I'd contact Western Digital and follow their advice about
the possibility of an RMA
Your drive is only 2/3 of a year old. RMA. This is a no-brainer.

Also, as Dave says, and as you can find in somewhere around 2 million posts in the forum, WD Greens are not recommended for NAS because of the way in which they park/load, resulting in unnecessary wear and tear in the form of loading cycles.
 

ShimadaRiku

Contributor
Joined
Aug 28, 2015
Messages
104
Your drive is only 2/3 of a year old. RMA. This is a no-brainer.

Also, as Dave says, and as you can find in somewhere around 2 million posts in the forum, WD Greens are not recommended for NAS because of the way in which they park/load, resulting in unnecessary wear and tear in the form of loading cycles.

Its only been used in a NAS for like 2 weeks with head parkting set to 300 seconds. But, was used as a normal windows desktop drive before that for half a year.

Is having just high LCC enough for WD to honor a warranty? it doesn't have any Reallocated_Sector_Ct or Raw_Read_Error_Rate
 

BigDave

FreeNAS Enthusiast
Joined
Oct 6, 2013
Messages
2,479
Code:
SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%      5724         38681440
# 2  Extended offline    Completed: read failure       90%      5722         38688272

This should get you an RMA ^^^^^^^^^^^^^^
It's like asking a REALLY pretty girl for a date, all they can do is say "no".
Don't be shy :p
 

DrKK

FreeNAS Generalissimo
Joined
Oct 15, 2013
Messages
3,630
Code:
SMART Extended Self-test Log Version: 1 (1 sectors)
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed: read failure       90%      5724         38681440
# 2  Extended offline    Completed: read failure       90%      5722         38688272

This should get you an RMA ^^^^^^^^^^^^^^
It's like asking a REALLY pretty girl for a date, all they can do is say "no".
Don't be shy :p
All the high LCC says is that you used the drive in conditions that were specifically advised against by Western Digital ;)

I would say, as Dave says, if your log shows failed SMART long's, that alone is sufficient for RMA.
 

Ericloewe

Server Wrangler
Moderator
Joined
Feb 15, 2014
Messages
20,194
Should I be trying to RMA or force reallocation by offline the disk then re-write with
To answer this question, for the general case:

A single bad sector doesn't bother me. Trying to reallocate it is pointless, though. Drives will have problems, that's what ZFS is designed to deal with and fix. A manual scrub might be a good idea, but manually messing with the drive directly most certainly isn't.
An RMA probably wouldn't be granted for a single bad sector. In this particular case, the SMART test failed, so it's probably going to be accepted.
 

ShimadaRiku

Contributor
Joined
Aug 28, 2015
Messages
104
To answer this question, for the general case:

A single bad sector doesn't bother me. Trying to reallocate it is pointless, though. Drives will have problems, that's what ZFS is designed to deal with and fix. A manual scrub might be a good idea, but manually messing with the drive directly most certainly isn't.
An RMA probably wouldn't be granted for a single bad sector. In this particular case, the SMART test failed, so it's probably going to be accepted.

Resilvering atm zzzz, since I am going to RMA that drive I'll play around with it for a bit. :D
 

cyberjock

Inactive Account
Joined
Mar 25, 2012
Messages
19,526
Also noteworthy is the fact that you've gotten two failed long tests, and the test stops on the first error. There could be a million bad sectors after the first error, and you won't know. In your case, it's very likely you have at least 2 bad sectors though.... 38681440 and 38688272.

Just do the RMA. :P
 
Status
Not open for further replies.
Top