Replaced drive, FreeNAS reports sector allocation that's not there

Status
Not open for further replies.
Joined
Feb 22, 2017
Messages
29
I'm a tad confused here.

I recently RMA'd a Seagate 2.5" drive and got the replacement today. Pulled out the WD that was filling in for the bad drive, plopped in the new one. zpool replace'd it. It's resilvering.

While eating dinner, my FreeNAS box sends me this:

Code:
Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.


The console shows this:

Code:
Mar 21 15:58:49 iscsi notifier: Waiting for PIDS: 2508.
Mar 21 16:24:16 iscsi smartd[29511]: Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.
Mar 21 16:24:16 iscsi smartd[29511]: Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.
Mar 21 17:24:15 iscsi smartd[29511]: Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.
Mar 21 17:24:15 iscsi smartd[29511]: Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.
Mar 21 18:24:15 iscsi smartd[29511]: Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.
Mar 21 18:24:16 iscsi smartd[29511]: Device: /dev/ada3, Failed SMART usage Attribute: 5 Reallocated_Sector_Ct.
Mar 21 18:27:21 iscsi notifier: Performing sanity check on openssh configuration.


Eh wha? Brand spanky new drive is already failing?

So when I got home, I ran a short SMART test:

Code:
[root@iscsi] ~# smartctl -t short /dev/ada3
smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION ===
Sending command: "Execute SMART Short self-test routine immediately in off-line mode".
Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful.
Testing has begun.
Please wait 1 minutes for test to complete.
Test will complete after Tue Mar 21 18:31:08 2017

Use smartctl -X to abort test.
[root@iscsi] ~# zpool status
  pool: CCTV_iSCSI
state: ONLINE
  scan: scrub repaired 0 in 4h17m with 0 errors on Sun Mar 19 04:17:20 2017
config:

		NAME											STATE	 READ WRITE CKSUM
		CCTV_iSCSI									  ONLINE	   0	 0	 0
		  mirror-0									  ONLINE	   0	 0	 0
			gptid/eec919ff-ea74-11e6-83f6-001517b8e52a  ONLINE	   0	 0	 0
			gptid/efaf0627-ea74-11e6-83f6-001517b8e52a  ONLINE	   0	 0	 0

errors: No known data errors

  pool: Static_iSCSI
state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
		continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scan: resilver in progress since Tue Mar 21 15:57:37 2017
		414G scanned out of 1.15T at 46.2M/s, 4h42m to go
		98.6G resilvered, 35.11% done
config:

		NAME											STATE	 READ WRITE CKSUM
		Static_iSCSI									DEGRADED	 0	 0	 0
		  raidz1-0									  DEGRADED	 0	 0	 0
			gptid/48b69161-e48f-11e6-90db-001517b8e52a  ONLINE	   0	 0	 0
			gptid/49c1a8a3-e48f-11e6-90db-001517b8e52a  ONLINE	   0	 0	 0
			gptid/4ad5bb15-e48f-11e6-90db-001517b8e52a  ONLINE	   0	 0	 0
			replacing-3								 REMOVED	  0	 0	 0
			  5511836415938567715					   REMOVED	  0	 0	 0  was /dev/ada3/old
			  ada3									  ONLINE	   0	 0	 0  (resilvering)

errors: No known data errors

  pool: VM_iSCSI
state: ONLINE
  scan: scrub repaired 0 in 1h49m with 0 errors on Sat Mar 18 01:49:54 2017
config:

		NAME											STATE	 READ WRITE CKSUM
		VM_iSCSI										ONLINE	   0	 0	 0
		  raidz1-0									  ONLINE	   0	 0	 0
			gptid/29d7c4e5-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
			gptid/2a63e72f-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
			gptid/2af41408-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
			gptid/2b97ebe4-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
		  raidz1-1									  ONLINE	   0	 0	 0
			gptid/2c2b3621-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
			gptid/2cb8d114-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
			gptid/2d404114-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0
			gptid/2dd34ade-007d-11e7-be90-001517b8e52a  ONLINE	   0	 0	 0

errors: No known data errors

  pool: freenas-boot
state: ONLINE
  scan: scrub repaired 0 in 0h0m with 0 errors on Sat Mar 18 03:45:07 2017
config:

		NAME		STATE	 READ WRITE CKSUM
		freenas-boot  ONLINE	   0	 0	 0
		  ada4p2	ONLINE	   0	 0	 0

errors: No known data errors
[root@iscsi] ~# smartctl -a /dev/ada3
smartctl 6.5 2016-05-07 r4318 [FreeBSD 10.3-STABLE amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:	 ST1000LM035-1RK172
Serial Number:	WES195LK
LU WWN Device Id: 5 000c50 09d0b6958
Firmware Version: SBM3
User Capacity:	1,000,204,886,016 bytes [1.00 TB]
Sector Sizes:	 512 bytes logical, 4096 bytes physical
Rotation Rate:	5400 rpm
Form Factor:	  2.5 inches
Device is:		Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-3 T13/2161-D revision 3b
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:	Tue Mar 21 18:34:01 2017 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
										was never started.
										Auto Offline Data Collection: Disabled.
Self-test execution status:	  ( 249) Self-test routine in progress...
										90% of test remaining.
Total time to complete Offline
data collection:				(	0) seconds.
Offline data collection
capabilities:					(0x71) SMART execute Offline immediate.
										No Auto Offline data collection support.
										Suspend Offline collection upon new
										command.
										No Offline surface scan supported.
										Self-test supported.
										Conveyance Self-test supported.
										Selective Self-test supported.
SMART capabilities:			(0x0003) Saves SMART data before entering
										power-saving mode.
										Supports SMART auto save timer.
Error logging capability:		(0x01) Error logging supported.
										General Purpose Logging supported.
Short self-test routine
recommended polling time:		(   1) minutes.
Extended self-test routine
recommended polling time:		( 172) minutes.
Conveyance self-test routine
recommended polling time:		(   2) minutes.
SCT capabilities:			  (0x3035) SCT Status supported.
										SCT Feature Control supported.
										SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate	 0x000f   083   077   006	Pre-fail  Always	   -	   212446258
  3 Spin_Up_Time			0x0003   100   100   000	Pre-fail  Always	   -	   0
  4 Start_Stop_Count		0x0032   100   100   020	Old_age   Always	   -	   1
  5 Reallocated_Sector_Ct   0x0033   100   100   036	Pre-fail  Always	   -	   0
  7 Seek_Error_Rate		 0x000f   100   253   045	Pre-fail  Always	   -	   700594
  9 Power_On_Hours		  0x0032   100   100   000	Old_age   Always	   -	   2 (191 83 0)
10 Spin_Retry_Count		0x0013   100   100   097	Pre-fail  Always	   -	   0
12 Power_Cycle_Count	   0x0032   100   100   020	Old_age   Always	   -	   1
184 End-to-End_Error		0x0032   100   100   099	Old_age   Always	   -	   0
187 Reported_Uncorrect	  0x0032   100   100   000	Old_age   Always	   -	   0
188 Command_Timeout		 0x0032   100   100   000	Old_age   Always	   -	   0
189 High_Fly_Writes		 0x003a   100   100   000	Old_age   Always	   -	   0
190 Airflow_Temperature_Cel 0x0022   074   073   040	Old_age   Always	   -	   26 (Min/Max 23/27)
191 G-Sense_Error_Rate	  0x0032   100   100   000	Old_age   Always	   -	   0
192 Power-Off_Retract_Count 0x0032   100   100   000	Old_age   Always	   -	   0
193 Load_Cycle_Count		0x0032   100   100   000	Old_age   Always	   -	   3
194 Temperature_Celsius	 0x0022   026   040   000	Old_age   Always	   -	   26 (0 23 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000	Old_age   Always	   -	   0
198 Offline_Uncorrectable   0x0010   100   100   000	Old_age   Offline	  -	   0
199 UDMA_CRC_Error_Count	0x003e   200   200   000	Old_age   Always	   -	   0
240 Head_Flying_Hours	   0x0000   100   253   000	Old_age   Offline	  -	   2 (125 107 0)
241 Total_LBAs_Written	  0x0000   100   253   000	Old_age   Offline	  -	   212435048
242 Total_LBAs_Read		 0x0000   100   253   000	Old_age   Offline	  -	   11210
254 Free_Fall_Sensor		0x0032   100   100   000	Old_age   Always	   -	   0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline	   Self-test routine in progress 90%		 2		 -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
	1		0		0  Not_testing
	2		0		0  Not_testing
	3		0		0  Not_testing
	4		0		0  Not_testing
	5		0		0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

[root@iscsi] ~#


SMART reports no errors. Do sector errors only pop up after a long or offline test?
 
Status
Not open for further replies.
Top