sypher
Cadet
- Joined
- May 7, 2019
- Messages
- 5
Hi guys,
bit of a problem here.
A drive failed, other drives have subsequently shown signs of bad health.
I however don't understand the current resiliency of the tank, i'd like some help in understanding that.
Problem is the following: resilver takes *a long* time, you can see 5 hours in the timer, but that's just cuz i just reset the system, otherwise it would not show anything.
I have problems with 2 other disks, as can be seen from here:
Smart data of the devices is as follows:
DA10 (this takes a few seconds to even come up, the times it does come up) - the disk is new
DA8
Resilvering drive is DA10.
Any help, recommendations, suggestions, even hugs - everything's accepted.
Thanks!
bit of a problem here.
A drive failed, other drives have subsequently shown signs of bad health.
I however don't understand the current resiliency of the tank, i'd like some help in understanding that.
pool: tank
state: ONLINE
status: One or more devices is currently being resilvered. The pool will
continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
scan: resilver in progress since Sat Dec 4 09:20:33 2021
11.7T scanned at 9.28G/s, 7.19T issued at 1.25G/s, 32.9T total
994G resilvered, 21.88% done, 05:51:26 to go
config:
NAME STATE READ WRITE CKSUM
tank ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
gptid/5c023dac-7ed1-11e7-b44d-645106d8d754 ONLINE 0 0 0
gptid/0c404deb-54db-11ec-af93-000c2918b177 ONLINE 0 0 0 (resilvering)
gptid/5d9af434-7ed1-11e7-b44d-645106d8d754 ONLINE 0 0 0
gptid/5e46a3dd-7ed1-11e7-b44d-645106d8d754 ONLINE 0 0 0
raidz1-1 ONLINE 0 0 0
gptid/8ada6537-8031-11e7-a999-000c2907ef12 ONLINE 0 0 0
gptid/8c05a1b5-8031-11e7-a999-000c2907ef12 ONLINE 0 0 0
gptid/8d23771a-8031-11e7-a999-000c2907ef12 ONLINE 0 0 0
gptid/e8fd3923-5159-11ec-92fa-000c2918b177 ONLINE 0 0 0
logs
gptid/d58f7ae3-cf25-11e8-9dd2-000c2918b177 ONLINE 0 0 0
cache
gptid/0d898ed7-51e2-11ec-82e4-000c2918b177 ONLINE 0 0 0
errors: No known data errors
Problem is the following: resilver takes *a long* time, you can see 5 hours in the timer, but that's just cuz i just reset the system, otherwise it would not show anything.
I have problems with 2 other disks, as can be seen from here:
Device: /dev/da10 [SAT], failed to read SMART Attribute Data.
Device: /dev/da8 [SAT], Self-Test Log error count increased from 5 to 6.
Smart data of the devices is as follows:
DA10 (this takes a few seconds to even come up, the times it does come up) - the disk is new
=== START OF INFORMATION SECTION ===
Device Model: WDC WD60EDAZ-11U78B0
Serial Number: WD-
LU WWN Device Id: 5 0014ee 214059afd
Firmware Version: 80.00A80
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5400 rpm
Form Factor: 3.5 inches
TRIM Command: Available, deterministic, zeroed
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: ACS-3 T13/2161-D revision 5
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Sun Dec 12 23:57:42 2021 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 100 253 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 220 220 021 Pre-fail Always - 4000
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 9
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 206
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 8
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 2
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 8
194 Temperature_Celsius 0x0022 113 112 000 Old_age Always - 37
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Interrupted (host reset) 90% 182 -
# 2 Short offline Completed without error 00% 158 -
# 3 Short offline Completed without error 00% 134 -
# 4 Short offline Completed without error 00% 110 -
# 5 Short offline Completed without error 00% 87 -
# 6 Short offline Completed without error 00% 63 -
# 7 Short offline Completed without error 00% 39 -
# 8 Extended offline Interrupted (host reset) 10% 29 -
Device Model: WDC WD60EDAZ-11U78B0
Serial Number: WD-
LU WWN Device Id: 5 0014ee 214059afd
Firmware Version: 80.00A80
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5400 rpm
Form Factor: 3.5 inches
TRIM Command: Available, deterministic, zeroed
Device is: Not in smartctl database [for details use: -P showall]
ATA Version is: ACS-3 T13/2161-D revision 5
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Sun Dec 12 23:57:42 2021 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 100 253 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 220 220 021 Pre-fail Always - 4000
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 9
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 0
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 206
10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 8
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 2
193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 8
194 Temperature_Celsius 0x0022 113 112 000 Old_age Always - 37
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0008 100 253 000 Old_age Offline - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Interrupted (host reset) 90% 182 -
# 2 Short offline Completed without error 00% 158 -
# 3 Short offline Completed without error 00% 134 -
# 4 Short offline Completed without error 00% 110 -
# 5 Short offline Completed without error 00% 87 -
# 6 Short offline Completed without error 00% 63 -
# 7 Short offline Completed without error 00% 39 -
# 8 Extended offline Interrupted (host reset) 10% 29 -
DA8
=== START OF INFORMATION SECTION ===
Model Family: Western Digital Blue
Device Model: WDC WD60EZRZ-00RWYB1
Serial Number: WD-
LU WWN Device Id: 5 0014ee 262d0d748
Firmware Version: 80.00A80
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5700 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Mon Dec 13 00:01:16 2021 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 190 184 021 Pre-fail Always - 9466
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 109
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 050 050 000 Old_age Always - 36834
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 108
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 100
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 1412871
194 Temperature_Celsius 0x0022 114 102 000 Old_age Always - 38
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 1
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 1
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 252
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 1
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 90% 36810 620535024
# 2 Short offline Completed: read failure 90% 36786 620535024
# 3 Short offline Completed without error 00% 36762 -
# 4 Short offline Completed: read failure 90% 36738 620535024
# 5 Short offline Completed without error 00% 36714 -
# 6 Short offline Completed: read failure 90% 36690 620535024
# 7 Short offline Completed: read failure 90% 36666 620535024
# 8 Extended offline Completed: read failure 10% 36652 620535024
# 9 Short offline Completed without error 00% 36619 -
#10 Short offline Completed without error 00% 36595 -
Model Family: Western Digital Blue
Device Model: WDC WD60EZRZ-00RWYB1
Serial Number: WD-
LU WWN Device Id: 5 0014ee 262d0d748
Firmware Version: 80.00A80
User Capacity: 6,001,175,126,016 bytes [6.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 5700 rpm
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-2, ACS-3 T13/2161-D revision 3b
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Mon Dec 13 00:01:16 2021 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 0
3 Spin_Up_Time 0x0027 190 184 021 Pre-fail Always - 9466
4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 109
5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 0
7 Seek_Error_Rate 0x002e 100 253 000 Old_age Always - 0
9 Power_On_Hours 0x0032 050 050 000 Old_age Always - 36834
10 Spin_Retry_Count 0x0032 100 100 000 Old_age Always - 0
11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 108
192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 100
193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always - 1412871
194 Temperature_Celsius 0x0022 114 102 000 Old_age Always - 38
196 Reallocated_Event_Count 0x0032 200 200 000 Old_age Always - 0
197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 1
198 Offline_Uncorrectable 0x0030 200 200 000 Old_age Offline - 1
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 252
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 1
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 90% 36810 620535024
# 2 Short offline Completed: read failure 90% 36786 620535024
# 3 Short offline Completed without error 00% 36762 -
# 4 Short offline Completed: read failure 90% 36738 620535024
# 5 Short offline Completed without error 00% 36714 -
# 6 Short offline Completed: read failure 90% 36690 620535024
# 7 Short offline Completed: read failure 90% 36666 620535024
# 8 Extended offline Completed: read failure 10% 36652 620535024
# 9 Short offline Completed without error 00% 36619 -
#10 Short offline Completed without error 00% 36595 -
Resilvering drive is DA10.
Any help, recommendations, suggestions, even hugs - everything's accepted.
Thanks!