Hi Guys
I'm hoping that someone can shed some light on this issue I am having.
My setup is:
FreeNAS 9.3 - Stable - 201506292130
Asus Sabertooth 990fx
AMD 8150 Processor
32 GB ECC Kingston Memory
2 TB Mirror WD Red
3TB Mirror WD Red
4 TB Mirror WD Red
2 x 4 Port PCI Express SATA III RAID Card - Star Tech
I installed Freenas a short while ago, and left the machine off. I turned it on recently, and its coming up with the above error upon boot up.
I have run long tests on all my drives, and a fair few short tests. All without problems. The error will not go away, and will only go green after I un-check the alert.
Please see below SMART from both of the raid 1 drives. The failed and non failed.
Failed Drive
Counter part of the raid
zpool status
The server is not in production yet, as there is no rush, but I would like to get this annoying error away. I just wanted to double check this problem before I do.
Should I run WD Tools on the drive or will WD accept smartctl for returns?
Kind Regards
Lee
I'm hoping that someone can shed some light on this issue I am having.
My setup is:
FreeNAS 9.3 - Stable - 201506292130
Asus Sabertooth 990fx
AMD 8150 Processor
32 GB ECC Kingston Memory
2 TB Mirror WD Red
3TB Mirror WD Red
4 TB Mirror WD Red
2 x 4 Port PCI Express SATA III RAID Card - Star Tech
I installed Freenas a short while ago, and left the machine off. I turned it on recently, and its coming up with the above error upon boot up.
I have run long tests on all my drives, and a fair few short tests. All without problems. The error will not go away, and will only go green after I un-check the alert.
Please see below SMART from both of the raid 1 drives. The failed and non failed.
Failed Drive
Code:
~# smartctl -x /dev/ada0
smartctl 6.3 2014-07-26 r3976 [FreeBSD 9.3-RELEASE-p16 amd64] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital Red (AF)
Device Model: WDC WD20EFRX-68EUZN0
Serial Number: WD-WCC4M0VHHX2E
LU WWN Device Id: 5 0014ee 2b65fdd2f
Firmware Version: 82.00A82
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5400 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-2 (minor revision not indicated)
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Wed Jul 22 13:24:29 2015 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is: Unavailable
APM feature is: Unavailable
Rd look-ahead is: Enabled
Write cache is: Enabled
ATA Security is: Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Unknown
=== START OF READ SMART DATA SECTION ===
Error SMART Status command failed
Please get assistance from
http://smartmontools.sourceforge.net/
Register values returned from SMART Status command are:
CMD=0xb0
FR =0xda
NS =0x00
SC =0x00
CL =0x00
CH =0x00
RETURN =0x0000
SMART overall-health self-assessment test result: FAILED!
No failed Attributes found.
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (26580) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 268) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x703d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE
1 Raw_Read_Error_Rate POSR-K 100 253 051 - 0
3 Spin_Up_Time POS--K 100 253 021 - 0
4 Start_Stop_Count -O--CK 100 100 000 - 4
5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0
7 Seek_Error_Rate -OSR-K 200 200 000 - 0
9 Power_On_Hours -O--CK 100 100 000 - 175
10 Spin_Retry_Count -O--CK 100 253 000 - 0
11 Calibration_Retry_Count -O--CK 100 253 000 - 0
12 Power_Cycle_Count -O--CK 100 100 000 - 4
192 Power-Off_Retract_Count -O--CK 200 200 000 - 0
193 Load_Cycle_Count -O--CK 200 200 000 - 12
194 Temperature_Celsius -O---K 116 115 000 - 31
196 Reallocated_Event_Count -O--CK 200 200 000 - 0
197 Current_Pending_Sector -O--CK 200 200 000 - 0
198 Offline_Uncorrectable ----CK 100 253 000 - 0
199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0
200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0
||||||_ K auto-keep
|||||__ C event count
||||___ R error rate
|||____ S speed/performance
||_____ O updated online
|______ P prefailure warning
General Purpose Log Directory Version 1
SMART Log Directory Version 1 [multi-sector log support]
Address Access R/W Size Description
0x00 GPL,SL R/O 1 Log Directory
0x01 SL R/O 1 Summary SMART error log
0x02 SL R/O 5 Comprehensive SMART error log
0x03 GPL R/O 6 Ext. Comprehensive SMART error log
0x06 SL R/O 1 SMART self-test log
0x07 GPL R/O 1 Extended self-test log
0x09 SL R/W 1 Selective self-test log
0x10 GPL R/O 1 NCQ Command Error log
0x11 GPL R/O 1 SATA Phy Event Counters
0x21 GPL R/O 1 Write stream error log
0x22 GPL R/O 1 Read stream error log
0x80-0x9f GPL,SL R/W 16 Host vendor specific log
0xa0-0xa7 GPL,SL VS 16 Device vendor specific log
0xa8-0xb7 GPL,SL VS 1 Device vendor specific log
0xbd GPL,SL VS 1 Device vendor specific log
0xc0 GPL,SL VS 1 Device vendor specific log
0xc1 GPL VS 93 Device vendor specific log
0xe0 GPL,SL R/W 1 SCT Command/Status
0xe1 GPL,SL R/W 1 SCT Data Transfer
SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged
SMART Extended Self-test Log Version: 1 (1 sectors)
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 168 -
# 2 Short offline Completed without error 00% 144 -
# 3 Short offline Completed without error 00% 123 -
# 4 Short offline Completed without error 00% 122 -
# 5 Short offline Completed without error 00% 121 -
# 6 Short offline Completed without error 00% 120 -
# 7 Short offline Completed without error 00% 119 -
# 8 Short offline Completed without error 00% 118 -
# 9 Short offline Completed without error 00% 117 -
#10 Short offline Completed without error 00% 116 -
#11 Short offline Completed without error 00% 115 -
#12 Short offline Completed without error 00% 114 -
#13 Short offline Completed without error 00% 113 -
#14 Short offline Completed without error 00% 112 -
#15 Short offline Completed without error 00% 111 -
#16 Short offline Completed without error 00% 110 -
#17 Short offline Completed without error 00% 109 -
#18 Short offline Completed without error 00% 108 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
SCT Status Version: 3
SCT Version (vendor specific): 258 (0x0102)
SCT Support Level: 1
Device State: Active (0)
Current Temperature: 31 Celsius
Power Cycle Min/Max Temperature: 25/32 Celsius
Lifetime Min/Max Temperature: 21/32 Celsius
Under/Over Temperature Limit Count: 0/0
SCT Temperature History Version: 2
Temperature Sampling Period: 1 minute
Temperature Logging Interval: 1 minute
Min/Max recommended Temperature: 0/60 Celsius
Min/Max Temperature Limit: -41/85 Celsius
Temperature History Size (Index): 478 (15)
Index Estimated Time Temperature Celsius
16 2015-07-22 05:27 29 **********
... ..(135 skipped). .. **********
152 2015-07-22 07:43 29 **********
153 2015-07-22 07:44 30 ***********
... ..(189 skipped). .. ***********
343 2015-07-22 10:54 30 ***********
344 2015-07-22 10:55 31 ************
... ..(148 skipped). .. ************
15 2015-07-22 13:24 31 ************
SCT Error Recovery Control:
Read: Disabled
Write: Disabled
Device Statistics (GP Log 0x04) not supported
SATA Phy Event Counters (GP Log 0x11)
ID Size Value Description
0x0001 2 0 Command failed due to ICRC error
0x0002 2 0 R_ERR response for data FIS
0x0003 2 0 R_ERR response for device-to-host data FIS
0x0004 2 0 R_ERR response for host-to-device data FIS
0x0005 2 0 R_ERR response for non-data FIS
0x0006 2 0 R_ERR response for device-to-host non-data FIS
0x0007 2 0 R_ERR response for host-to-device non-data FIS
0x0008 2 0 Device-to-host non-data FIS retries
0x0009 2 10 Transition from drive PhyRdy to drive PhyNRdy
0x000a 2 10 Device-to-host register FISes sent due to a COMRESET
0x000b 2 0 CRC errors within host-to-device FIS
0x000f 2 0 R_ERR response for host-to-device data FIS, CRC
0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC
0x8000 4 426880 Vendor specific
Counter part of the raid
Code:
smartctl -x /dev/ada2
smartctl 6.3 2014-07-26 r3976 [FreeBSD 9.3-RELEASE-p16 amd64] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Western Digital Red (AF)
Device Model: WDC WD20EFRX-68EUZN0
Serial Number: WD-WCC4M1NTSTUK
LU WWN Device Id: 5 0014ee 2610a09f5
Firmware Version: 82.00A82
User Capacity: 2,000,398,934,016 bytes [2.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5400 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-2 (minor revision not indicated)
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Wed Jul 22 13:35:57 2015 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
AAM feature is: Unavailable
APM feature is: Unavailable
Rd look-ahead is: Enabled
Write cache is: Enabled
ATA Security is: Disabled, NOT FROZEN [SEC1]
Wt Cache Reorder: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: (27720) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 280) minutes.
Conveyance self-test routine
recommended polling time: ( 5) minutes.
SCT capabilities: (0x703d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE
1 Raw_Read_Error_Rate POSR-K 100 253 051 - 0
3 Spin_Up_Time POS--K 100 253 021 - 0
4 Start_Stop_Count -O--CK 100 100 000 - 4
5 Reallocated_Sector_Ct PO--CK 200 200 140 - 0
7 Seek_Error_Rate -OSR-K 200 200 000 - 0
9 Power_On_Hours -O--CK 100 100 000 - 175
10 Spin_Retry_Count -O--CK 100 253 000 - 0
11 Calibration_Retry_Count -O--CK 100 253 000 - 0
12 Power_Cycle_Count -O--CK 100 100 000 - 4
192 Power-Off_Retract_Count -O--CK 200 200 000 - 0
193 Load_Cycle_Count -O--CK 200 200 000 - 12
194 Temperature_Celsius -O---K 117 116 000 - 30
196 Reallocated_Event_Count -O--CK 200 200 000 - 0
197 Current_Pending_Sector -O--CK 200 200 000 - 0
198 Offline_Uncorrectable ----CK 100 253 000 - 0
199 UDMA_CRC_Error_Count -O--CK 200 200 000 - 0
200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0
||||||_ K auto-keep
|||||__ C event count
||||___ R error rate
|||____ S speed/performance
||_____ O updated online
|______ P prefailure warning
General Purpose Log Directory Version 1
SMART Log Directory Version 1 [multi-sector log support]
Address Access R/W Size Description
0x00 GPL,SL R/O 1 Log Directory
0x01 SL R/O 1 Summary SMART error log
0x02 SL R/O 5 Comprehensive SMART error log
0x03 GPL R/O 6 Ext. Comprehensive SMART error log
0x06 SL R/O 1 SMART self-test log
0x07 GPL R/O 1 Extended self-test log
0x09 SL R/W 1 Selective self-test log
0x10 GPL R/O 1 NCQ Command Error log
0x11 GPL R/O 1 SATA Phy Event Counters
0x21 GPL R/O 1 Write stream error log
0x22 GPL R/O 1 Read stream error log
0x80-0x9f GPL,SL R/W 16 Host vendor specific log
0xa0-0xa7 GPL,SL VS 16 Device vendor specific log
0xa8-0xb7 GPL,SL VS 1 Device vendor specific log
0xbd GPL,SL VS 1 Device vendor specific log
0xc0 GPL,SL VS 1 Device vendor specific log
0xc1 GPL VS 93 Device vendor specific log
0xe0 GPL,SL R/W 1 SCT Command/Status
0xe1 GPL,SL R/W 1 SCT Data Transfer
SMART Extended Comprehensive Error Log Version: 1 (6 sectors)
No Errors Logged
SMART Extended Self-test Log Version: 1 (1 sectors)
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 168 -
# 2 Short offline Completed without error 00% 144 -
# 3 Short offline Completed without error 00% 123 -
# 4 Short offline Completed without error 00% 122 -
# 5 Short offline Completed without error 00% 121 -
# 6 Short offline Completed without error 00% 120 -
# 7 Short offline Completed without error 00% 119 -
# 8 Short offline Completed without error 00% 118 -
# 9 Short offline Completed without error 00% 117 -
#10 Short offline Completed without error 00% 116 -
#11 Short offline Completed without error 00% 115 -
#12 Short offline Completed without error 00% 114 -
#13 Short offline Completed without error 00% 113 -
#14 Short offline Completed without error 00% 112 -
#15 Short offline Completed without error 00% 111 -
#16 Short offline Completed without error 00% 110 -
#17 Short offline Completed without error 00% 109 -
#18 Short offline Completed without error 00% 108 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
SCT Status Version: 3
SCT Version (vendor specific): 258 (0x0102)
SCT Support Level: 1
Device State: Active (0)
Current Temperature: 30 Celsius
Power Cycle Min/Max Temperature: 25/31 Celsius
Lifetime Min/Max Temperature: 21/31 Celsius
Under/Over Temperature Limit Count: 0/0
SCT Temperature History Version: 2
Temperature Sampling Period: 1 minute
Temperature Logging Interval: 1 minute
Min/Max recommended Temperature: 0/60 Celsius
Min/Max Temperature Limit: -41/85 Celsius
Temperature History Size (Index): 478 (26)
Index Estimated Time Temperature Celsius
27 2015-07-22 05:38 28 *********
... ..(115 skipped). .. *********
143 2015-07-22 07:34 28 *********
144 2015-07-22 07:35 29 **********
... ..( 64 skipped). .. **********
209 2015-07-22 08:40 29 **********
210 2015-07-22 08:41 30 ***********
... ..( 28 skipped). .. ***********
239 2015-07-22 09:10 30 ***********
240 2015-07-22 09:11 29 **********
... ..( 78 skipped). .. **********
319 2015-07-22 10:30 29 **********
320 2015-07-22 10:31 30 ***********
321 2015-07-22 10:32 30 ***********
322 2015-07-22 10:33 30 ***********
323 2015-07-22 10:34 29 **********
324 2015-07-22 10:35 30 ***********
... ..( 88 skipped). .. ***********
413 2015-07-22 12:04 30 ***********
414 2015-07-22 12:05 31 ************
... ..( 42 skipped). .. ************
457 2015-07-22 12:48 31 ************
458 2015-07-22 12:49 30 ***********
... ..( 45 skipped). .. ***********
26 2015-07-22 13:35 30 ***********
SCT Error Recovery Control:
Read: 70 (7.0 seconds)
Write: 70 (7.0 seconds)
Device Statistics (GP Log 0x04) not supported
SATA Phy Event Counters (GP Log 0x11)
ID Size Value Description
0x0001 2 0 Command failed due to ICRC error
0x0002 2 0 R_ERR response for data FIS
0x0003 2 0 R_ERR response for device-to-host data FIS
0x0004 2 0 R_ERR response for host-to-device data FIS
0x0005 2 0 R_ERR response for non-data FIS
0x0006 2 0 R_ERR response for device-to-host non-data FIS
0x0007 2 0 R_ERR response for host-to-device non-data FIS
0x0008 2 0 Device-to-host non-data FIS retries
0x0009 2 11 Transition from drive PhyRdy to drive PhyNRdy
0x000a 2 11 Device-to-host register FISes sent due to a COMRESET
0x000b 2 0 CRC errors within host-to-device FIS
0x000f 2 0 R_ERR response for host-to-device data FIS, CRC
0x0012 2 0 R_ERR response for host-to-device non-data FIS, CRC
0x8000 4 427570 Vendor specific
zpool status
Code:
zpool status
pool: freenas-boot
state: ONLINE
scan: scrub repaired 0 in 0h0m with 0 errors on Sat Jul 18 03:45:10 2015
config:
NAME STATE READ WRITE CKSUM
freenas-boot ONLINE 0 0 0
ada6p2 ONLINE 0 0 0
errors: No known data errors
pool: raid2tb
state: ONLINE
scan: scrub repaired 0 in 0h0m with 0 errors on Sun Jul 19 00:00:03 2015
config:
NAME STATE READ WRITE CKS UM
raid2tb ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
gptid/806c2811-0244-11e5-9039-10bf488604b9 ONLINE 0 0 0
gptid/81244bfc-0244-11e5-9039-10bf488604b9 ONLINE 0 0 0
errors: No known data errors
pool: raid4tb
state: ONLINE
scan: scrub repaired 0 in 0h0m with 0 errors on Sun Jul 19 00:00:37 2015
config:
NAME STATE READ WRITE CKS UM
raid4tb ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
gptid/c5048f01-0244-11e5-9039-10bf488604b9 ONLINE 0 0 0
gptid/c5755250-0244-11e5-9039-10bf488604b9 ONLINE 0 0 0
errors: No known data errors
pool: riad3tb
state: ONLINE
scan: scrub repaired 0 in 0h0m with 0 errors on Sun Jul 19 00:00:03 2015
config:
NAME STATE READ WRITE CKS UM
riad3tb ONLINE 0 0 0
mirror-0 ONLINE 0 0 0
gptid/a2b11d77-0244-11e5-9039-10bf488604b9 ONLINE 0 0 0
gptid/a320c619-0244-11e5-9039-10bf488604b9 ONLINE 0 0 0
errors: No known data errors
The server is not in production yet, as there is no rush, but I would like to get this annoying error away. I just wanted to double check this problem before I do.
Should I run WD Tools on the drive or will WD accept smartctl for returns?
Kind Regards
Lee