Unrecoverable Error

Status
Not open for further replies.

Blackout

Cadet
Joined
Jun 29, 2014
Messages
6
This week I got the following error
Code:
Device: /dev/ada4, 8 Currently unreadable (pending) sectors
Device: /dev/ada4, 8 Offline uncorrectable sectors

I assumed this drive was on its way out (its a Seagate and I have had 3 others die this year) so I replaced it.

During the resilvering process, I got another notification for what looks like another drive:
Code:
The volume Backup state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.

Any ideas if this drive is failing too?
*Note: I noticed the temp was quite high so I am addressing this already.
Code:
root@freenas:~ # zpool status
  pool: Backup
state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
		attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
		using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: resilvered 1.19T in 3h4m with 0 errors on Thu Jul 27 02:59:12 2017
config:

		NAME											STATE	 READ WRITE CKSUM
		Backup										  ONLINE	   0	 0	 0
		  raidz1-0									  ONLINE	   0	 0	 0
			gptid/d7ad98e8-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/d7f8f707-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/14179716-727f-11e7-8a5a-6805ca245f8e  ONLINE	   0	 0	 0
		  raidz1-1									  ONLINE	   0	 0	 0
			gptid/d896e8ad-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/e3db0d69-3ce2-11e7-a471-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/e3b82856-f217-11e6-a148-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/d431db24-a38a-11e6-aeac-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/da30fc34-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	28

errors: No known data errors

  pool: freenas-boot
state: ONLINE
  scan: scrub repaired 0 in 0h5m with 0 errors on Thu Jun 22 03:50:12 2017
config:

		NAME		STATE	 READ WRITE CKSUM
		freenas-boot  ONLINE	   0	 0	 0
		  da0p2	 ONLINE	   0	 0	 0

errors: No known data errors

Confirmed what drive was the issue:
Code:
glabel status | grep gptid/da30fc34-0c70-11e4-ac96-6805ca245f8e
gptid/da30fc34-0c70-11e4-ac96-6805ca245f8e	 N/A  ada7p2

Then grabbed the SMART details:
Code:
smartctl -a /dev/ada7
smartctl 6.5 2016-05-07 r4318 [FreeBSD 11.0-STABLE amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:	 Seagate Barracuda 7200.14 (AF)
Device Model:	 ST3000DM001-1E6166
Serial Number:	W1F3Z6K9
LU WWN Device Id: 5 000c50 06e1d6bda
Firmware Version: SC48
User Capacity:	3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:	 512 bytes logical, 4096 bytes physical
Rotation Rate:	7200 rpm
Form Factor:	  3.5 inches
Device is:		In smartctl database [for details use: -P show]
ATA Version is:   ATA8-ACS T13/1699-D revision 4
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:	Thu Jul 27 11:44:12 2017 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
										was completed without error.
										Auto Offline Data Collection: Enabled.
Self-test execution status:	  (   0) The previous self-test routine completed
										without error or no self-test has ever
										been run.
Total time to complete Offline
data collection:				(  575) seconds.
Offline data collection
capabilities:					(0x7b) SMART execute Offline immediate.
										Auto Offline data collection on/off support.
										Suspend Offline collection upon new
										command.
										Offline surface scan supported.
										Self-test supported.
										Conveyance Self-test supported.
										Selective Self-test supported.
SMART capabilities:			(0x0003) Saves SMART data before entering
										power-saving mode.
										Supports SMART auto save timer.
Error logging capability:		(0x01) Error logging supported.
										General Purpose Logging supported.
Short self-test routine
recommended polling time:		(   1) minutes.
Extended self-test routine
recommended polling time:		( 331) minutes.
Conveyance self-test routine
recommended polling time:		(   2) minutes.
SCT capabilities:			  (0x3081) SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate	 0x000f   119   099   006	Pre-fail  Always	   -	   206489376
  3 Spin_Up_Time			0x0003   091   090   000	Pre-fail  Always	   -	   0
  4 Start_Stop_Count		0x0032   100   100   020	Old_age   Always	   -	   67
  5 Reallocated_Sector_Ct   0x0033   100   100   010	Pre-fail  Always	   -	   0
  7 Seek_Error_Rate		 0x000f   055   051   030	Pre-fail  Always	   -	   962168643140
  9 Power_On_Hours		  0x0032   074   074   000	Old_age   Always	   -	   23342
10 Spin_Retry_Count		0x0013   100   100   097	Pre-fail  Always	   -	   0
12 Power_Cycle_Count	   0x0032   100   100   020	Old_age   Always	   -	   67
183 Runtime_Bad_Block	   0x0032   100   100   000	Old_age   Always	   -	   0
184 End-to-End_Error		0x0032   100   100   099	Old_age   Always	   -	   0
187 Reported_Uncorrect	  0x0032   100   100   000	Old_age   Always	   -	   0
188 Command_Timeout		 0x0032   100   099   000	Old_age   Always	   -	   0 0 8
189 High_Fly_Writes		 0x003a   090   090   000	Old_age   Always	   -	   10
190 Airflow_Temperature_Cel 0x0022   057   050   045	Old_age   Always	   -	   43 (Min/Max 38/45)
191 G-Sense_Error_Rate	  0x0032   100   100   000	Old_age   Always	   -	   0
192 Power-Off_Retract_Count 0x0032   100   100   000	Old_age   Always	   -	   34
193 Load_Cycle_Count		0x0032   100   100   000	Old_age   Always	   -	   77
194 Temperature_Celsius	 0x0022   043   050   000	Old_age   Always	   -	   43 (0 18 0 0 0)
197 Current_Pending_Sector  0x0012   100   100   000	Old_age   Always	   -	   0
198 Offline_Uncorrectable   0x0010   100   100   000	Old_age   Offline	  -	   0
199 UDMA_CRC_Error_Count	0x003e   200   116   000	Old_age   Always	   -	   1249
240 Head_Flying_Hours	   0x0000   100   253   000	Old_age   Offline	  -	   23342h+56m+40.973s
241 Total_LBAs_Written	  0x0000   100   253   000	Old_age   Offline	  -	   3877893327
242 Total_LBAs_Read		 0x0000   100   253   000	Old_age   Offline	  -	   57573628944

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline	   Completed without error	   00%	 23340		 -
# 2  Short offline	   Completed without error	   00%	 23309		 -
# 3  Short offline	   Completed without error	   00%	 23261		 -
# 4  Extended offline	Completed without error	   00%	 23222		 -
# 5  Short offline	   Completed without error	   00%	 23189		 -
# 6  Short offline	   Completed without error	   00%	 23141		 -
# 7  Short offline	   Completed without error	   00%	 23093		 -
# 8  Short offline	   Completed without error	   00%	 22997		 -
# 9  Short offline	   Completed without error	   00%	 22949		 -
#10  Short offline	   Completed without error	   00%	 22901		 -
#11  Extended offline	Completed without error	   00%	 22884		 -
#12  Short offline	   Completed without error	   00%	 22805		 -
#13  Short offline	   Completed without error	   00%	 22744		 -
#14  Short offline	   Completed without error	   00%	 22648		 -
#15  Short offline	   Completed without error	   00%	 22600		 -
#16  Short offline	   Completed without error	   00%	 22552		 -
#17  Extended offline	Completed without error	   00%	 22511		 -
#18  Short offline	   Completed without error	   00%	 22480		 -
#19  Short offline	   Completed without error	   00%	 22432		 -
#20  Short offline	   Completed without error	   00%	 22384		 -
#21  Short offline	   Completed without error	   00%	 22288		 -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
	1		0		0  Not_testing
	2		0		0  Not_testing
	3		0		0  Not_testing
	4		0		0  Not_testing
	5		0		0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
 

m0nkey_

MVP
Joined
Oct 27, 2015
Messages
2,739
The pool looks clean, you can run zpool clear Backup and then re-run the scrub using zpool scrub Backup.
 

Blackout

Cadet
Joined
Jun 29, 2014
Messages
6
results:
Code:
root@freenas:~ # zpool status
  pool: Backup
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
		attempt was made to correct the error.  Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
		using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://illumos.org/msg/ZFS-8000-9P
  scan: scrub repaired 56K in 3h17m with 0 errors on Fri Jul 28 01:42:33 2017
config:

		NAME											STATE	 READ WRITE CKSUM
		Backup										  ONLINE	   0	 0	 0
		  raidz1-0									  ONLINE	   0	 0	 0
			gptid/d7ad98e8-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/d7f8f707-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/14179716-727f-11e7-8a5a-6805ca245f8e  ONLINE	   0	 0	 0
		  raidz1-1									  ONLINE	   0	 0	 0
			gptid/d896e8ad-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/e3db0d69-3ce2-11e7-a471-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/e3b82856-f217-11e6-a148-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/d431db24-a38a-11e6-aeac-6805ca245f8e  ONLINE	   0	 0	 0
			gptid/da30fc34-0c70-11e4-ac96-6805ca245f8e  ONLINE	   0	 0	14

errors: No known data errors

  pool: freenas-boot
 state: ONLINE
  scan: scrub repaired 0 in 0h5m with 0 errors on Thu Jun 22 03:50:12 2017
config:

		NAME		STATE	 READ WRITE CKSUM
		freenas-boot  ONLINE	   0	 0	 0
		  da0p2	 ONLINE	   0	 0	 0

errors: No known data errors
 
Status
Not open for further replies.
Top