The volume pool state is DEGRADED: One or more devices has experienced an unrecoverable error

Status
Not open for further replies.

lnix

Dabbler
Joined
Aug 16, 2014
Messages
29
Hello together,

my zpool has a big issue I think.

Code:
[root@freenas ~]# zpool status -v pool

  pool: pool

 state: DEGRADED

status: One or more devices has experienced an unrecoverable error.  An

	attempt was made to correct the error.  Applications are unaffected.

action: Determine if the device needs to be replaced, and clear the errors

	using 'zpool clear' or replace the device with 'zpool replace'.

   see: http://illumos.org/msg/ZFS-8000-9P

  scan: scrub repaired 6.96M in 0 days 09:33:44 with 0 errors on Sun Aug 12 00:18:58 2018

config:


	NAME											STATE	 READ WRITE CKSUM

	pool											DEGRADED	 0	 0	 2

	  raidz1-0									  DEGRADED	 0	 0	 4

		gptid/1d131aa1-261c-11e4-a0a2-d050992ba901  DEGRADED	 0	 0   121  too many errors

		gptid/1d7cb2d8-261c-11e4-a0a2-d050992ba901  DEGRADED	 0	 0   123  too many errors

		gptid/1de5f709-261c-11e4-a0a2-d050992ba901  DEGRADED	 0	 0   131  too many errors

		gptid/1e542192-261c-11e4-a0a2-d050992ba901  DEGRADED	 0	 0   129  too many errors


errors: No known data errors



I tried a scrub last Sunday but I get the message "too many errors" for my pool.

What can I do ?

My Data:

Version FreeNAS-11.1-U5
Speicher 7856MB ECC RAM

Regards
 

garm

Wizard
Joined
Aug 19, 2017
Messages
1,556
It’s unlikely all four drives start vomiting checksum errors all together. But to understand what is going on it would be useful to see S.M.A.R.T reports. Any comment based on your post would be wild speculation based on personal experience and very to that degree. Read the rules to get an understanding of the kind of questions you will generate with a vague post like this. They are especially tailored to that.
 

BigDave

FreeNAS Enthusiast
Joined
Oct 6, 2013
Messages
2,479

lnix

Dabbler
Joined
Aug 16, 2014
Messages
29
Yes, I have a backup with rsync to my linux server.

Here are my SMART results:

HD1

Code:
[root@freenas ~]# smartctl -a /dev/ada0

smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.1-STABLE amd64] (local build)

Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org


=== START OF INFORMATION SECTION ===

Model Family:	 Western Digital Red

Device Model:	 WDC WD40EFRX-68WT0N0

Serial Number:	WD-WCC4EDYVY00K

LU WWN Device Id: 5 0014ee 25ff16fe1

Firmware Version: 80.00A80

User Capacity:	4,000,787,030,016 bytes [4.00 TB]

Sector Sizes:	 512 bytes logical, 4096 bytes physical

Rotation Rate:	5400 rpm

Device is:		In smartctl database [for details use: -P show]

ATA Version is:   ACS-2 (minor revision not indicated)

SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)

Local Time is:	Mon Aug 13 17:55:25 2018 CEST

SMART support is: Available - device has SMART capability.

SMART support is: Enabled


=== START OF READ SMART DATA SECTION ===

SMART overall-health self-assessment test result: PASSED


General SMART Values:

Offline data collection status:  (0x00)	Offline data collection activity

					was never started.

					Auto Offline Data Collection: Disabled.

Self-test execution status:	  (   0)	The previous self-test routine completed

					without error or no self-test has ever

					been run.

Total time to complete Offline

data collection:		 (50880) seconds.

Offline data collection

capabilities:			 (0x7b) SMART execute Offline immediate.

					Auto Offline data collection on/off support.

					Suspend Offline collection upon new

					command.

					Offline surface scan supported.

					Self-test supported.

					Conveyance Self-test supported.

					Selective Self-test supported.

SMART capabilities:			(0x0003)	Saves SMART data before entering

					power-saving mode.

					Supports SMART auto save timer.

Error logging capability:		(0x01)	Error logging supported.

					General Purpose Logging supported.

Short self-test routine

recommended polling time:	 (   2) minutes.

Extended self-test routine

recommended polling time:	 ( 509) minutes.

Conveyance self-test routine

recommended polling time:	 (   5) minutes.

SCT capabilities:		   (0x703d)	SCT Status supported.

					SCT Error Recovery Control supported.

					SCT Feature Control supported.

					SCT Data Table supported.


SMART Attributes Data Structure revision number: 16

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE

  1 Raw_Read_Error_Rate	 0x002f   200   200   051	Pre-fail  Always	   -	   0

  3 Spin_Up_Time			0x0027   202   176   021	Pre-fail  Always	   -	   6900

  4 Start_Stop_Count		0x0032   100   100   000	Old_age   Always	   -	   711

  5 Reallocated_Sector_Ct   0x0033   200   200   140	Pre-fail  Always	   -	   0

  7 Seek_Error_Rate		 0x002e   200   200   000	Old_age   Always	   -	   0

  9 Power_On_Hours		  0x0032   055   055   000	Old_age   Always	   -	   33556

 10 Spin_Retry_Count		0x0032   100   100   000	Old_age   Always	   -	   0

 11 Calibration_Retry_Count 0x0032   100   253   000	Old_age   Always	   -	   0

 12 Power_Cycle_Count	   0x0032   100   100   000	Old_age   Always	   -	   81

192 Power-Off_Retract_Count 0x0032   200   200   000	Old_age   Always	   -	   41

193 Load_Cycle_Count		0x0032   197   197   000	Old_age   Always	   -	   11841

194 Temperature_Celsius	 0x0022   114   107   000	Old_age   Always	   -	   38

196 Reallocated_Event_Count 0x0032   200   200   000	Old_age   Always	   -	   0

197 Current_Pending_Sector  0x0032   200   200   000	Old_age   Always	   -	   0

198 Offline_Uncorrectable   0x0030   100   253   000	Old_age   Offline	  -	   0

199 UDMA_CRC_Error_Count	0x0032   200   200   000	Old_age   Always	   -	   0

200 Multi_Zone_Error_Rate   0x0008   200   200   000	Old_age   Offline	  -	   0


SMART Error Log Version: 1

No Errors Logged


SMART Self-test log structure revision number 1

Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error

# 1  Short offline	   Completed without error	   00%	 33555		 -

# 2  Short offline	   Completed without error	   00%	 33499		 -

# 3  Extended offline	Completed without error	   00%	 33265		 -

# 4  Short offline	   Completed without error	   00%	 33254		 -

# 5  Short offline	   Completed without error	   00%	 33230		 -

# 6  Short offline	   Completed without error	   00%	 32991		 -

# 7  Short offline	   Completed without error	   00%	 32751		 -

# 8  Short offline	   Completed without error	   00%	 32543		 -

# 9  Extended offline	Completed without error	   00%	 32286		 -

#10  Short offline	   Completed without error	   00%	 32275		 -

#11  Short offline	   Completed without error	   00%	 32253		 -

#12  Short offline	   Completed without error	   00%	 32012		 -

#13  Short offline	   Completed without error	   00%	 31772		 -

#14  Extended offline	Interrupted (host reset)	  70%	 31536		 -

#15  Short offline	   Completed without error	   00%	 31532		 -

#16  Short offline	   Completed without error	   00%	 31292		 -

#17  Short offline	   Completed without error	   00%	 31053		 -

#18  Extended offline	Completed without error	   00%	 30824		 -

#19  Short offline	   Completed without error	   00%	 30813		 -

#20  Short offline	   Completed without error	   00%	 30789		 -

#21  Short offline	   Completed without error	   00%	 30550		 -


SMART Selective self-test log data structure revision number 1

 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS

	1		0		0  Not_testing

	2		0		0  Not_testing

	3		0		0  Not_testing

	4		0		0  Not_testing

	5		0		0  Not_testing

Selective self-test flags (0x0):

  After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.



HD2
Code:
[root@freenas ~]# smartctl -a /dev/ada1

smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.1-STABLE amd64] (local build)

Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org


=== START OF INFORMATION SECTION ===

Model Family:	 Western Digital Red

Device Model:	 WDC WD40EFRX-68WT0N0

Serial Number:	WD-WCC4ER84E2KP

LU WWN Device Id: 5 0014ee 20a9c1bda

Firmware Version: 80.00A80

User Capacity:	4,000,787,030,016 bytes [4.00 TB]

Sector Sizes:	 512 bytes logical, 4096 bytes physical

Rotation Rate:	5400 rpm

Device is:		In smartctl database [for details use: -P show]

ATA Version is:   ACS-2 (minor revision not indicated)

SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)

Local Time is:	Mon Aug 13 17:57:31 2018 CEST

SMART support is: Available - device has SMART capability.

SMART support is: Enabled


=== START OF READ SMART DATA SECTION ===

SMART overall-health self-assessment test result: PASSED


General SMART Values:

Offline data collection status:  (0x00)	Offline data collection activity

					was never started.

					Auto Offline Data Collection: Disabled.

Self-test execution status:	  (   0)	The previous self-test routine completed

					without error or no self-test has ever

					been run.

Total time to complete Offline

data collection:		 (52080) seconds.

Offline data collection

capabilities:			 (0x7b) SMART execute Offline immediate.

					Auto Offline data collection on/off support.

					Suspend Offline collection upon new

					command.

					Offline surface scan supported.

					Self-test supported.

					Conveyance Self-test supported.

					Selective Self-test supported.

SMART capabilities:			(0x0003)	Saves SMART data before entering

					power-saving mode.

					Supports SMART auto save timer.

Error logging capability:		(0x01)	Error logging supported.

					General Purpose Logging supported.

Short self-test routine

recommended polling time:	 (   2) minutes.

Extended self-test routine

recommended polling time:	 ( 521) minutes.

Conveyance self-test routine

recommended polling time:	 (   5) minutes.

SCT capabilities:		   (0x703d)	SCT Status supported.

					SCT Error Recovery Control supported.

					SCT Feature Control supported.

					SCT Data Table supported.


SMART Attributes Data Structure revision number: 16

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE

  1 Raw_Read_Error_Rate	 0x002f   200   200   051	Pre-fail  Always	   -	   0

  3 Spin_Up_Time			0x0027   205   180   021	Pre-fail  Always	   -	   6741

  4 Start_Stop_Count		0x0032   100   100   000	Old_age   Always	   -	   713

  5 Reallocated_Sector_Ct   0x0033   200   200   140	Pre-fail  Always	   -	   0

  7 Seek_Error_Rate		 0x002e   200   200   000	Old_age   Always	   -	   0

  9 Power_On_Hours		  0x0032   055   055   000	Old_age   Always	   -	   33555

 10 Spin_Retry_Count		0x0032   100   100   000	Old_age   Always	   -	   0

 11 Calibration_Retry_Count 0x0032   100   253   000	Old_age   Always	   -	   0

 12 Power_Cycle_Count	   0x0032   100   100   000	Old_age   Always	   -	   82

192 Power-Off_Retract_Count 0x0032   200   200   000	Old_age   Always	   -	   42

193 Load_Cycle_Count		0x0032   197   197   000	Old_age   Always	   -	   11817

194 Temperature_Celsius	 0x0022   114   107   000	Old_age   Always	   -	   38

196 Reallocated_Event_Count 0x0032   200   200   000	Old_age   Always	   -	   0

197 Current_Pending_Sector  0x0032   200   200   000	Old_age   Always	   -	   0

198 Offline_Uncorrectable   0x0030   100   253   000	Old_age   Offline	  -	   0

199 UDMA_CRC_Error_Count	0x0032   200   200   000	Old_age   Always	   -	   0

200 Multi_Zone_Error_Rate   0x0008   200   200   000	Old_age   Offline	  -	   0


SMART Error Log Version: 1

No Errors Logged


SMART Self-test log structure revision number 1

Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error

# 1  Short offline	   Completed without error	   00%	 33554		 -

# 2  Short offline	   Completed without error	   00%	 33499		 -

# 3  Extended offline	Completed without error	   00%	 33265		 -

# 4  Short offline	   Completed without error	   00%	 33254		 -

# 5  Short offline	   Completed without error	   00%	 33230		 -

# 6  Short offline	   Completed without error	   00%	 32990		 -

# 7  Short offline	   Completed without error	   00%	 32750		 -

# 8  Short offline	   Completed without error	   00%	 32542		 -

# 9  Extended offline	Completed without error	   00%	 32286		 -

#10  Short offline	   Completed without error	   00%	 32275		 -

#11  Short offline	   Completed without error	   00%	 32252		 -

#12  Short offline	   Completed without error	   00%	 32011		 -

#13  Short offline	   Completed without error	   00%	 31771		 -

#14  Extended offline	Completed without error	   00%	 31543		 -

#15  Short offline	   Completed without error	   00%	 31531		 -

#16  Short offline	   Completed without error	   00%	 31292		 -

#17  Short offline	   Completed without error	   00%	 31052		 -

#18  Extended offline	Completed without error	   00%	 30823		 -

#19  Short offline	   Completed without error	   00%	 30812		 -

#20  Short offline	   Completed without error	   00%	 30788		 -

#21  Short offline	   Completed without error	   00%	 30550		 -


SMART Selective self-test log data structure revision number 1

 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS

	1		0		0  Not_testing

	2		0		0  Not_testing

	3		0		0  Not_testing

	4		0		0  Not_testing

	5		0		0  Not_testing

Selective self-test flags (0x0):

  After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.



HD3
Code:
[root@freenas ~]# smartctl -a /dev/ada2

smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.1-STABLE amd64] (local build)

Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org


=== START OF INFORMATION SECTION ===

Model Family:	 Western Digital Red

Device Model:	 WDC WD40EFRX-68WT0N0

Serial Number:	WD-WCC4E48Y5930

LU WWN Device Id: 5 0014ee 20a9c22ca

Firmware Version: 80.00A80

User Capacity:	4,000,787,030,016 bytes [4.00 TB]

Sector Sizes:	 512 bytes logical, 4096 bytes physical

Rotation Rate:	5400 rpm

Device is:		In smartctl database [for details use: -P show]

ATA Version is:   ACS-2 (minor revision not indicated)

SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)

Local Time is:	Mon Aug 13 17:58:01 2018 CEST

SMART support is: Available - device has SMART capability.

SMART support is: Enabled


=== START OF READ SMART DATA SECTION ===

SMART overall-health self-assessment test result: PASSED


General SMART Values:

Offline data collection status:  (0x00)	Offline data collection activity

					was never started.

					Auto Offline Data Collection: Disabled.

Self-test execution status:	  (   0)	The previous self-test routine completed

					without error or no self-test has ever

					been run.

Total time to complete Offline

data collection:		 (52020) seconds.

Offline data collection

capabilities:			 (0x7b) SMART execute Offline immediate.

					Auto Offline data collection on/off support.

					Suspend Offline collection upon new

					command.

					Offline surface scan supported.

					Self-test supported.

					Conveyance Self-test supported.

					Selective Self-test supported.

SMART capabilities:			(0x0003)	Saves SMART data before entering

					power-saving mode.

					Supports SMART auto save timer.

Error logging capability:		(0x01)	Error logging supported.

					General Purpose Logging supported.

Short self-test routine

recommended polling time:	 (   2) minutes.

Extended self-test routine

recommended polling time:	 ( 520) minutes.

Conveyance self-test routine

recommended polling time:	 (   5) minutes.

SCT capabilities:		   (0x703d)	SCT Status supported.

					SCT Error Recovery Control supported.

					SCT Feature Control supported.

					SCT Data Table supported.


SMART Attributes Data Structure revision number: 16

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE

  1 Raw_Read_Error_Rate	 0x002f   200   200   051	Pre-fail  Always	   -	   0

  3 Spin_Up_Time			0x0027   205   178   021	Pre-fail  Always	   -	   6750

  4 Start_Stop_Count		0x0032   100   100   000	Old_age   Always	   -	   713

  5 Reallocated_Sector_Ct   0x0033   200   200   140	Pre-fail  Always	   -	   0

  7 Seek_Error_Rate		 0x002e   200   200   000	Old_age   Always	   -	   0

  9 Power_On_Hours		  0x0032   055   055   000	Old_age   Always	   -	   33555

 10 Spin_Retry_Count		0x0032   100   100   000	Old_age   Always	   -	   0

 11 Calibration_Retry_Count 0x0032   100   253   000	Old_age   Always	   -	   0

 12 Power_Cycle_Count	   0x0032   100   100   000	Old_age   Always	   -	   81

192 Power-Off_Retract_Count 0x0032   200   200   000	Old_age   Always	   -	   41

193 Load_Cycle_Count		0x0032   197   197   000	Old_age   Always	   -	   11804

194 Temperature_Celsius	 0x0022   109   102   000	Old_age   Always	   -	   43

196 Reallocated_Event_Count 0x0032   200   200   000	Old_age   Always	   -	   0

197 Current_Pending_Sector  0x0032   200   200   000	Old_age   Always	   -	   0

198 Offline_Uncorrectable   0x0030   100   253   000	Old_age   Offline	  -	   0

199 UDMA_CRC_Error_Count	0x0032   200   200   000	Old_age   Always	   -	   0

200 Multi_Zone_Error_Rate   0x0008   200   200   000	Old_age   Offline	  -	   0


SMART Error Log Version: 1

No Errors Logged


SMART Self-test log structure revision number 1

Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error

# 1  Short offline	   Completed without error	   00%	 33555		 -

# 2  Short offline	   Completed without error	   00%	 33498		 -

# 3  Extended offline	Completed without error	   00%	 33265		 -

# 4  Short offline	   Completed without error	   00%	 33254		 -

# 5  Short offline	   Completed without error	   00%	 33230		 -

# 6  Short offline	   Completed without error	   00%	 32990		 -

# 7  Short offline	   Completed without error	   00%	 32750		 -

# 8  Short offline	   Completed without error	   00%	 32542		 -

# 9  Extended offline	Completed without error	   00%	 32286		 -

#10  Short offline	   Completed without error	   00%	 32275		 -

#11  Short offline	   Completed without error	   00%	 32252		 -

#12  Short offline	   Completed without error	   00%	 32011		 -

#13  Short offline	   Completed without error	   00%	 31771		 -

#14  Extended offline	Completed without error	   00%	 31543		 -

#15  Short offline	   Completed without error	   00%	 31531		 -

#16  Short offline	   Completed without error	   00%	 31292		 -

#17  Short offline	   Completed without error	   00%	 31052		 -

#18  Extended offline	Completed without error	   00%	 30823		 -

#19  Short offline	   Completed without error	   00%	 30813		 -

#20  Short offline	   Completed without error	   00%	 30788		 -

#21  Short offline	   Completed without error	   00%	 30550		 -


SMART Selective self-test log data structure revision number 1

 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS

	1		0		0  Not_testing

	2		0		0  Not_testing

	3		0		0  Not_testing

	4		0		0  Not_testing

	5		0		0  Not_testing

Selective self-test flags (0x0):

  After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.



HD4
Code:
[root@freenas ~]# smartctl -a /dev/ada3

smartctl 6.6 2017-11-05 r4594 [FreeBSD 11.1-STABLE amd64] (local build)

Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org


=== START OF INFORMATION SECTION ===

Model Family:	 Western Digital Red

Device Model:	 WDC WD40EFRX-68WT0N0

Serial Number:	WD-WCC4EF87VY7J

LU WWN Device Id: 5 0014ee 20a9c2f9c

Firmware Version: 80.00A80

User Capacity:	4,000,787,030,016 bytes [4.00 TB]

Sector Sizes:	 512 bytes logical, 4096 bytes physical

Rotation Rate:	5400 rpm

Device is:		In smartctl database [for details use: -P show]

ATA Version is:   ACS-2 (minor revision not indicated)

SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)

Local Time is:	Mon Aug 13 17:58:43 2018 CEST

SMART support is: Available - device has SMART capability.

SMART support is: Enabled


=== START OF READ SMART DATA SECTION ===

SMART overall-health self-assessment test result: PASSED


General SMART Values:

Offline data collection status:  (0x00)	Offline data collection activity

					was never started.

					Auto Offline Data Collection: Disabled.

Self-test execution status:	  (   0)	The previous self-test routine completed

					without error or no self-test has ever

					been run.

Total time to complete Offline

data collection:		 (51720) seconds.

Offline data collection

capabilities:			 (0x7b) SMART execute Offline immediate.

					Auto Offline data collection on/off support.

					Suspend Offline collection upon new

					command.

					Offline surface scan supported.

					Self-test supported.

					Conveyance Self-test supported.

					Selective Self-test supported.

SMART capabilities:			(0x0003)	Saves SMART data before entering

					power-saving mode.

					Supports SMART auto save timer.

Error logging capability:		(0x01)	Error logging supported.

					General Purpose Logging supported.

Short self-test routine

recommended polling time:	 (   2) minutes.

Extended self-test routine

recommended polling time:	 ( 517) minutes.

Conveyance self-test routine

recommended polling time:	 (   5) minutes.

SCT capabilities:		   (0x703d)	SCT Status supported.

					SCT Error Recovery Control supported.

					SCT Feature Control supported.

					SCT Data Table supported.


SMART Attributes Data Structure revision number: 16

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME		  FLAG	 VALUE WORST THRESH TYPE	  UPDATED  WHEN_FAILED RAW_VALUE

  1 Raw_Read_Error_Rate	 0x002f   200   200   051	Pre-fail  Always	   -	   0

  3 Spin_Up_Time			0x0027   203   177   021	Pre-fail  Always	   -	   6850

  4 Start_Stop_Count		0x0032   100   100   000	Old_age   Always	   -	   713

  5 Reallocated_Sector_Ct   0x0033   200   200   140	Pre-fail  Always	   -	   0

  7 Seek_Error_Rate		 0x002e   200   200   000	Old_age   Always	   -	   0

  9 Power_On_Hours		  0x0032   055   055   000	Old_age   Always	   -	   33557

 10 Spin_Retry_Count		0x0032   100   100   000	Old_age   Always	   -	   0

 11 Calibration_Retry_Count 0x0032   100   253   000	Old_age   Always	   -	   0

 12 Power_Cycle_Count	   0x0032   100   100   000	Old_age   Always	   -	   82

192 Power-Off_Retract_Count 0x0032   200   200   000	Old_age   Always	   -	   42

193 Load_Cycle_Count		0x0032   197   197   000	Old_age   Always	   -	   11832

194 Temperature_Celsius	 0x0022   114   107   000	Old_age   Always	   -	   38

196 Reallocated_Event_Count 0x0032   200   200   000	Old_age   Always	   -	   0

197 Current_Pending_Sector  0x0032   200   200   000	Old_age   Always	   -	   0

198 Offline_Uncorrectable   0x0030   100   253   000	Old_age   Offline	  -	   0

199 UDMA_CRC_Error_Count	0x0032   200   200   000	Old_age   Always	   -	   0

200 Multi_Zone_Error_Rate   0x0008   200   200   000	Old_age   Offline	  -	   0


SMART Error Log Version: 1

No Errors Logged


SMART Self-test log structure revision number 1

Num  Test_Description	Status				  Remaining  LifeTime(hours)  LBA_of_first_error

# 1  Short offline	   Completed without error	   00%	 33556		 -

# 2  Short offline	   Completed without error	   00%	 33500		 -

# 3  Extended offline	Completed without error	   00%	 33266		 -

# 4  Short offline	   Completed without error	   00%	 33255		 -

# 5  Short offline	   Completed without error	   00%	 33231		 -

# 6  Short offline	   Completed without error	   00%	 32991		 -

# 7  Short offline	   Completed without error	   00%	 32751		 -

# 8  Short offline	   Completed without error	   00%	 32544		 -

# 9  Extended offline	Completed without error	   00%	 32287		 -

#10  Short offline	   Completed without error	   00%	 32276		 -

#11  Short offline	   Completed without error	   00%	 32253		 -

#12  Short offline	   Completed without error	   00%	 32013		 -

#13  Short offline	   Completed without error	   00%	 31772		 -

#14  Extended offline	Completed without error	   00%	 31544		 -

#15  Short offline	   Completed without error	   00%	 31533		 -

#16  Short offline	   Completed without error	   00%	 31293		 -

#17  Short offline	   Completed without error	   00%	 31053		 -

#18  Extended offline	Completed without error	   00%	 30824		 -

#19  Short offline	   Completed without error	   00%	 30814		 -

#20  Short offline	   Completed without error	   00%	 30789		 -

#21  Short offline	   Completed without error	   00%	 30551		 -


SMART Selective self-test log data structure revision number 1

 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS

	1		0		0  Not_testing

	2		0		0  Not_testing

	3		0		0  Not_testing

	4		0		0  Not_testing

	5		0		0  Not_testing

Selective self-test flags (0x0):

  After scanning selected spans, do NOT read-scan remainder of disk.

If Selective self-test is pending on power-up, resume after 0 minute delay.



Thank you for your analyse.

Regards
 

garm

Wizard
Joined
Aug 19, 2017
Messages
1,556
30+ hours is not making these drives spring chickens exactly, but smart isn’t showing sector errors. You need to start looking at your controller, cabling and PSU. Also, how old is this issue? Did it all happen the same night or have you ignored the errors for a longer time?
 

lnix

Dabbler
Joined
Aug 16, 2014
Messages
29
I have this problem since 2 month but not many errors ( under 5). Too many errors came last Sunday after a scrub. At first I lost one datafile because of permanent errors. I delete it and started a scrub. I will shutdown my nas and control all cable. How can I check my PSU?
 

kdragon75

Wizard
Joined
Aug 7, 2016
Messages
2,457
I have this problem since 2 month but not many errors ( under 5). Too many errors came last Sunday after a scrub. At first I lost one datafile because of permanent errors. I delete it and started a scrub. I will shutdown my nas and control all cable. How can I check my PSU?
Tell us the models of the PSU and the disk controller along with the approximate age. My guess if you have a sff-8087 to sata breakout cable that's the issue. Otherwise, it's the controller.
 

lnix

Dabbler
Joined
Aug 16, 2014
Messages
29
I have a be quite PSU and the disk controller is on my board. The stuff is 4 years old.

I checked all cable and cleaned my nas.

After the reboot I have no errors. Is it normal?

Code:
[root@freenas ~]# zpool status -v pool

  pool: pool

 state: ONLINE

status: Some supported features are not enabled on the pool. The pool can

	still be used, but some features are unavailable.

action: Enable all features using 'zpool upgrade'. Once this is done,

	the pool may no longer be accessible by software that does not support

	the features. See zpool-features(7) for details.

  scan: scrub repaired 6.96M in 0 days 09:33:44 with 0 errors on Sun Aug 12 00:18:58 2018

config:


	NAME											STATE	 READ WRITE CKSUM

	pool											ONLINE	   0	 0	 0

	  raidz1-0									  ONLINE	   0	 0	 0

		gptid/1d131aa1-261c-11e4-a0a2-d050992ba901  ONLINE	   0	 0	 0

		gptid/1d7cb2d8-261c-11e4-a0a2-d050992ba901  ONLINE	   0	 0	 0

		gptid/1de5f709-261c-11e4-a0a2-d050992ba901  ONLINE	   0	 0	 0

		gptid/1e542192-261c-11e4-a0a2-d050992ba901  ONLINE	   0	 0	 0


errors: No known data errors

 

kdragon75

Wizard
Joined
Aug 7, 2016
Messages
2,457

lnix

Dabbler
Joined
Aug 16, 2014
Messages
29
Yes. I have the board Q1900-ITX from ASRock with Intel(R) Celeron(R) CPU J1900 @ 1.99GHz. And a be quite PSU. I don't know the model of my psu.
 
Status
Not open for further replies.
Top