Frage zu Smart Report

Status
Not open for further replies.

Fredi918

Explorer
Joined
Jan 18, 2015
Messages
61
Hallo zusammen,

habe seit ca. 3,5 Jahren erfolgreich ein Nas laufen mit 4 WD Red Platten mit FreeNAS-11.1-RC1 im RaidZ1 Verbund.
Seit ca. 1 Woche bekomme ich E-Mails das mit einer der Platten etwas nicht stimmt (siehe Anhang, Fehlermeldung).
Kennt sich jemand damit aus und kann mir sagen was ich tun kann bzw. ob die Platte zum tauschen ist.
Danke schon mal im voraus
Fredi
Code:
=== START OF INFORMATION SECTION ===
Model Family:	  Western Digital Red
Device Model:	  WDC WD30EFRX-68EUZN0
Serial Number:	  WD-WMC4N0--------
LU WWN Device Id: 5 0014ee 65a522921
Firmware Version: 82.00A82
User Capacity:	  3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:	  512 bytes logical, 4096 bytes physical
Rotation Rate:	  5400 rpm
Device is:			In smartctl database [for details use: -P show]
ATA Version is:	ACS-2 (minor revision not indicated)
SATA Version is:   SATA 3.0, 6.0 Gb/s (current: 3.0 Gb/s)
Local Time is:	  Mon Jan 15 15:28:05 2018 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is:	Unavailable
APM feature is:	Unavailable
Rd look-ahead is: Enabled
Write cache is:	Enabled
ATA Security is:   Disabled, frozen [SEC2]
Wt Cache Reorder: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:   (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status:		 ( 113) The previous self-test completed having
the read element of the test failed.
Total time to complete Offline  
data collection: (39600) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities:				  (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability:			(0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine  
recommended polling time: (	2) minutes.
Extended self-test routine
recommended polling time: ( 398) minutes.
Conveyance self-test routine
recommended polling time: (	5) minutes.
SCT capabilities:		 (0x703d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME			   FLAGS	  VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate	  POSR-K	200	200   051	  -	  20
  3 Spin_Up_Time				  POS--K   179	177	021	  -	  6016
  4 Start_Stop_Count			-O--CK	096   096	000	  -	  4775
  5 Reallocated_Sector_Ct	PO--CK	200	200   140	  -	  0
  7 Seek_Error_Rate			 -OSR-K	200   200	000	  -	  0
  9 Power_On_Hours			   -O--CK   063	063	000	  -	  27214
10 Spin_Retry_Count			-O--CK	100   100	000	  -	  0
11 Calibration_Retry_Count -O--CK	100	100	000     -	  0
12 Power_Cycle_Count		 -O--CK	100   100	000	  -	  232
192 Power-Off_Retract_Count -O--CK	200	200	000     -	  137
193 Load_Cycle_Count			-O--CK	195   195	000	  -	  15900
194 Temperature_Celsius	  -O---K	116	108   000	  -	  34
196 Reallocated_Event_Count -O--CK	200	200	000     -	  0
197 Current_Pending_Sector   -O--CK	200	200   000	  -	  1
198 Offline_Uncorrectable	----CK	100	253   000	  -	  0
199 UDMA_CRC_Error_Count	  -O--CK	200	200   000	  -	  0
200 Multi_Zone_Error_Rate	---R--	200	200   000	  -	  1
  						   			||||||_ K auto-keep
  						   			|||||__ C event count
  						   			||||___ R error rate
  						   			|||____ S speed/performance
  						   			||_____ O updated online
  						   			|______ P prefailure warning

General Purpose Log Directory Version 1
SMART 			  Log Directory Version 1 [multi-sector log support]
Address 	 Access   R/W	Size   Description
0x00 		 GPL,SL   R/O		 1   Log Directory
0x01 			  SL   R/O	    1   Summary SMART error log
0x02 			  SL   R/O	    5   Comprehensive SMART error log
0x03 		 GPL	  R/O		 6   Ext. Comprehensive SMART error log
0x06 			  SL   R/O	    1   SMART self-test log
0x07 		 GPL	  R/O		 1   Extended self-test log
0x09 			  SL   R/W	    1   Selective self-test log
0x10 		 GPL	  R/O		 1   SATA NCQ Queued Error log
0x11 		 GPL	  R/O		 1   SATA Phy Event Counters log
0x21 		 GPL	  R/O		 1   Write stream error log
0x22 		 GPL	  R/O		 1   Read stream error log
0x80-0x9f   GPL,SL   R/W	  16   Host vendor specific log
0xa0-0xa7   GPL,SL   VS		 16   Device vendor specific log
0xa8-0xb7   GPL,SL   VS		 1   Device vendor specific log
0xbd 		 GPL,SL   VS		 1   Device vendor specific log
0xc0 		 GPL,SL   VS		 1   Device vendor specific log
0xc1 		 GPL	  VS		 93   Device vendor specific log
0xe0 		 GPL,SL   R/W		 1   SCT Command/Status
0xe1 		 GPL,SL   R/W		 1   SCT Data Transfer

SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
Device Error Count: 20
CR 	 = Command Register
FEATR   = Features Register
COUNT   = Count (was: Sector Count) Register
LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]   ATA-8
LH 	 = LBA High (was: Cylinder High) Register	  ]	LBA
LM 	 = LBA Mid (was: Cylinder Low) Register	    ] Register
LL 	 = LBA Low (was: Sector Number) Register	  ]
DV 	 = Device (was: Device/Head) Register
DC 	 = Device Control Register
ER 	 = Error register
ST 	 = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss  where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 20 [19] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 54 6f d8 40 00   Error: UNC at LBA = 0x14e546fd8 = 5609123800

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 c0 00 68 00 01 4e 54 70 a8 40 08 26d+13:37:11.630   READ FPDMA QUEUED
  60 01 00 00 60 00 01 4e 54 6f a8 40 08 26d+13:37:11.629   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:37:11.627   READ LOG EXT
  60 00 c0 00 50 00 01 4e 54 70 a8 40 08 26d+13:37:07.810   READ FPDMA QUEUED
  60 01 00 00 48 00 01 4e 54 6f a8 40 08 26d+13:37:07.810   READ FPDMA QUEUED

Error 19 [18] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 54 6f d8 40 00   Error: UNC at LBA = 0x14e546fd8 = 5609123800

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 c0 00 50 00 01 4e 54 70 a8 40 08 26d+13:37:07.810   READ FPDMA QUEUED
  60 01 00 00 48 00 01 4e 54 6f a8 40 08 26d+13:37:07.810   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:37:07.808   READ LOG EXT
  60 00 c0 00 38 00 01 4e 54 70 a8 40 08 26d+13:37:04.270   READ FPDMA QUEUED
  60 01 00 00 30 00 01 4e 54 6f a8 40 08 26d+13:37:04.270   READ FPDMA QUEUED

Error 18 [17] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 54 6f d8 40 00   Error: UNC at LBA = 0x14e546fd8 = 5609123800

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 c0 00 38 00 01 4e 54 70 a8 40 08 26d+13:37:04.270   READ FPDMA QUEUED
  60 01 00 00 30 00 01 4e 54 6f a8 40 08 26d+13:37:04.270   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:37:04.268   READ LOG EXT
  60 00 c0 00 20 00 01 4e 54 70 a8 40 08 26d+13:37:00.452   READ FPDMA QUEUED
  60 01 00 00 18 00 01 4e 54 6f a8 40 08 26d+13:37:00.452   READ FPDMA QUEUED

Error 17 [16] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 54 6f d8 40 00   Error: UNC at LBA = 0x14e546fd8 = 5609123800

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 c0 00 20 00 01 4e 54 70 a8 40 08 26d+13:37:00.452   READ FPDMA QUEUED
  60 01 00 00 18 00 01 4e 54 6f a8 40 08 26d+13:37:00.452   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:37:00.450   READ LOG EXT
  60 00 c0 00 08 00 01 4e 54 70 a8 40 08 26d+13:36:56.661   READ FPDMA QUEUED
  60 01 00 00 00 00 01 4e 54 6f a8 40 08 26d+13:36:56.661   READ FPDMA QUEUED

Error 16 [15] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 54 6f d8 40 00   Error: UNC at LBA = 0x14e546fd8 = 5609123800

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 c0 00 08 00 01 4e 54 70 a8 40 08 26d+13:36:56.661   READ FPDMA QUEUED
  60 01 00 00 00 00 01 4e 54 6f a8 40 08 26d+13:36:56.661   READ FPDMA QUEUED
  60 00 c0 00 f8 00 01 4e 54 6e 68 40 08 26d+13:36:56.657   READ FPDMA QUEUED
  60 00 c0 00 f0 00 01 4e 54 6d 48 40 08 26d+13:36:56.657   READ FPDMA QUEUED
  60 00 40 00 e8 00 01 4e 54 6c a8 40 08 26d+13:36:56.647   READ FPDMA QUEUED

Error 15 [14] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 53 f5 90 40 00   Error: UNC at LBA = 0x14e53f590 = 5609092496

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 d0 00 d8 00 01 4e 53 f5 08 40 08 26d+13:36:52.903   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:36:52.900   READ LOG EXT
  60 00 d0 00 c8 00 01 4e 53 f5 08 40 08 26d+13:36:49.337   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:36:49.293   READ LOG EXT
  60 00 d0 00 b8 00 01 4e 53 f5 08 40 08 26d+13:36:45.730   READ FPDMA QUEUED

Error 14 [13] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 53 f5 90 40 00   Error: UNC at LBA = 0x14e53f590 = 5609092496

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 d0 00 c8 00 01 4e 53 f5 08 40 08 26d+13:36:49.337   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:36:49.293   READ LOG EXT
  60 00 d0 00 b8 00 01 4e 53 f5 08 40 08 26d+13:36:45.730   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:36:45.686   READ LOG EXT
  60 00 d0 00 a8 00 01 4e 53 f5 08 40 08 26d+13:36:42.122   READ FPDMA QUEUED

Error 13 [12] occurred at disk power-on lifetime: 27009 hours (1125 days + 9 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT   LBA_48   LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 00 00 01 4e 53 f5 90 40 00   Error: UNC at LBA = 0x14e53f590 = 5609092496

  Commands leading to the command that caused the error were:
  CR FEATR COUNT   LBA_48   LH LM LL DV DC   Powered_Up_Time   Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --   ---------------   --------------------
  60 00 d0 00 b8 00 01 4e 53 f5 08 40 08 26d+13:36:45.730   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:36:45.686   READ LOG EXT
  60 00 d0 00 a8 00 01 4e 53 f5 08 40 08 26d+13:36:42.122   READ FPDMA QUEUED
  2f 00 00 00 01 00 00 00 00 00 10 40 08 26d+13:36:42.078   READ LOG EXT
  60 00 d0 00 98 00 01 4e 53 f5 08 40 08 26d+13:36:38.307   READ FPDMA QUEUED

SMART Extended Self-test Log Version: 1 (1 sectors)
Num   Test_Description	  Status		 				 Remaining   LifeTime(hours)   LBA_of_first_error
# 1   Extended offline	  Completed: read failure   	  10%	  27212		   5609123800
# 2   Extended offline	  Completed: read failure   	  10%	  27034		   5609123800
# 3   Extended offline	  Completed: read failure   	  90%	  27016		   5609123800
# 4   Short offline		 Completed without error		 00%	  10919	  	  -
# 5   Short offline		 Completed without error		 00%	  10882	  	  -
# 6   Extended offline	  Completed without error   	  00%	  10827		   -
# 7   Short offline		 Completed without error		 00%	  10800	  	  -
# 8   Short offline		 Completed without error		 00%	  10750	  	  -
# 9   Extended offline	  Completed without error   	  00%	  10707		   -
#10   Short offline		 Completed without error 		 00%	  10654		 	-
#11   Short offline		 Completed without error 		 00%	  10606		 	-
#12   Short offline		 Completed without error 		 00%	  10560		 	-
#13   Extended offline	  Completed without error   	  00%	  10515		   -
#14   Short offline		 Completed without error 		 00%	  10464		 	-
#15   Short offline		 Completed without error 		 00%	  10416		 	-
#16   Short offline		 Completed without error 		 00%	  10368		 	-
#17   Extended offline	  Completed without error   	  00%	  10327		   -
#18   Short offline		 Completed without error 		 00%	  10273		 	-

SMART Selective self-test log data structure revision number 1
SPAN   MIN_LBA   MAX_LBA   CURRENT_TEST_STATUS
    1			0		   0   Not_testing
    2			0		   0   Not_testing
    3			0		   0   Not_testing
    4			0		   0   Not_testing
    5			0		   0   Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

SCT Status Version:					 	  3
SCT Version (vendor specific):		 258 (0x0102)
SCT Support Level:					 	  1
Device State:					   			Active (0)
Current Temperature:					 		34 Celsius
Power Cycle Min/Max Temperature:	  22/37 Celsius
Lifetime 	 Min/Max Temperature:		 2/42 Celsius
Under/Over Temperature Limit Count:	0/0
Vendor specific:
01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

SCT Temperature History Version:	  2
Temperature Sampling Period:			 1 minute
Temperature Logging Interval:			1 minute
Min/Max recommended Temperature:		 0/60 Celsius
Min/Max Temperature Limit:				-41/85 Celsius
Temperature History Size (Index):	  478 (4)

Index 	 Estimated Time	Temperature Celsius
  5	  2018-01-15 07:31	  36   *****************
... 	 ..( 74 skipped).	  ..   *****************
  80	  2018-01-15 08:46	  36   *****************
  81	  2018-01-15 08:47	  35   ****************
... 	 ..( 20 skipped).	  ..   ****************
102 	 2018-01-15 09:08	  35   ****************
103 	 2018-01-15 09:09	  36   *****************
... 	 ..(113 skipped).	  ..   *****************
217 	 2018-01-15 11:03	  36   *****************
218 	 2018-01-15 11:04	  35   ****************
... 	 ..( 14 skipped).	  ..   ****************
233 	 2018-01-15 11:19	  35   ****************
234 	 2018-01-15 11:20	  34   ***************
... 	 ..( 63 skipped).	  ..   ***************
298 	 2018-01-15 12:24	  34   ***************
299 	 2018-01-15 12:25	  36   *****************
... 	 ..(182 skipped).	  ..   *****************
  4	  2018-01-15 15:28	  36   *****************

SCT Error Recovery Control:
  			 Read:	  70 (7.0 seconds)
  			Write:	  70 (7.0 seconds)

Device Statistics (GP/SMART Log 0x04) not supported

SATA Phy Event Counters (GP Log 0x11)
ID 		Size	  Value   Description
0x0001   2				  0   Command failed due to ICRC error
0x0002   2				  0   R_ERR response for data FIS
0x0003   2				  0   R_ERR response for device-to-host data FIS
0x0004   2				  0   R_ERR response for host-to-device data FIS
0x0005   2				  0   R_ERR response for non-data FIS
0x0006   2				  0   R_ERR response for device-to-host non-data FIS
0x0007   2				  0   R_ERR response for host-to-device non-data FIS
0x0008   2				  0   Device-to-host non-data FIS retries
0x0009   2				17   Transition from drive PhyRdy to drive PhyNRdy
0x000a   2				18   Device-to-host register FISes sent due to a COMRESET
0x000b   2				  0   CRC errors within host-to-device FIS
0x000f   2				  0   R_ERR response for host-to-device data FIS, CRC
0x0012   2				  0   R_ERR response for host-to-device non-data FIS, CRC
0x8000   4		 3032412   Vendor specific

smartctl 6.5 2016-05-07 r4318 [FreeBSD 11.1-STABLE amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke,  www.smartmontools.org




 

Attachments

  • Warnung.JPG
    Warnung.JPG
    22 KB · Views: 354

MrToddsFriends

Documentation Browser
Joined
Jan 12, 2015
Messages
1,338
Ich würde die Platte wegen der berichteten Fehler zügig tauschen (Abschnitt ab "Device Error Count: 20" im smartctl Log sowie die letzten drei fehlgeschlagenen "Extended offline" Tests).
 

Fredi918

Explorer
Joined
Jan 18, 2015
Messages
61
Danke für deine Hilfe, hab eine neue Platte bestellt und werde diese demnächst einbauen.
 

MrToddsFriends

Documentation Browser
Joined
Jan 12, 2015
Messages
1,338
Grundsätzlich wäre es an dieser Stelle angebracht, noch auf die Vorteile eines Disk Burnin hinzuweisen. Ein Burnin einer 3TB WD Red wie im folgenden Posting angegeben dauert allerdings rund drei Tage. Drei weitere Tage, während der Dein Pool wegen der oben dargelegten Probleme einer höheren Ausfallfallgefahr ausgesetzt bliebe. Ohne Backup wären das gewiss nicht nur Vorteile.

https://forums.freenas.org/index.ph...for-freenas-scripts-including-disk-burnin.28/
 

Fredi918

Explorer
Joined
Jan 18, 2015
Messages
61
Kurze Frage noch, wie genau Tausche ich die Defekte Platte ?
System runterfahren, defekte Platte raus, neue Platte rein (ist identisch) und wieder hochfahren, danach warten bis resilver abgeschlossen ??
Danke schon mal
Fredi
 

MrToddsFriends

Documentation Browser
Joined
Jan 12, 2015
Messages
1,338
Status
Not open for further replies.
Top