Hi all,
Had an accident with the NAS today, a power cable got knocked out of the back of the UPS while I was fixing another device and the NAS lost power.
Came back online straight away with no errors, but then about 2 hours later I got an alert:
CRITICAL
Device: /dev/da0 [SAT], 8 Currently unreadable (pending) sectors.
da0 is a member of my 'primary_array' pool - but this is showing no Zpool errors:
SMART shows just those 8 'Pending' sectors.
Is it safe to continue with this disk and just keep an eye on this? I have to assume the two incidents (power loss and this alert) are in some way related.
Had an accident with the NAS today, a power cable got knocked out of the back of the UPS while I was fixing another device and the NAS lost power.
Came back online straight away with no errors, but then about 2 hours later I got an alert:
CRITICAL
Device: /dev/da0 [SAT], 8 Currently unreadable (pending) sectors.
da0 is a member of my 'primary_array' pool - but this is showing no Zpool errors:
Code:
Geom name: da0p2 Providers: 1. Name: gptid/6fcfd1c3-39c4-11ea-a333-0cc47aab393c
Code:
pool: primary_array state: ONLINE scan: resilvered 1.42T in 0 days 04:16:34 with 0 errors on Sat Jan 18 17:59:23 2020 config: NAME STATE READ WRITE CKSUM primary_array ONLINE 0 0 0 mirror-0 ONLINE 0 0 0 gptid/65bed2e9-39f8-11ea-a333-0cc47aab393c ONLINE 0 0 0 gptid/6fcfd1c3-39c4-11ea-a333-0cc47aab393c ONLINE 0 0 0 errors: No known data errors
SMART shows just those 8 'Pending' sectors.
Code:
root@nas[/]# smartctl -A /dev/da0 smartctl 7.0 2018-12-30 r4883 [FreeBSD 11.3-RELEASE-p5 amd64] (local build) Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x000b 100 100 016 Pre-fail Always - 0 2 Throughput_Performance 0x0005 138 138 054 Pre-fail Offline - 100 3 Spin_Up_Time 0x0007 134 134 024 Pre-fail Always - 481 (Average 497) 4 Start_Stop_Count 0x0012 100 100 000 Old_age Always - 66 5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail Always - 0 7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail Always - 0 8 Seek_Time_Performance 0x0005 128 128 020 Pre-fail Offline - 18 9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 2662 10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 66 192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 139 193 Load_Cycle_Count 0x0012 100 100 000 Old_age Always - 139 194 Temperature_Celsius 0x0002 166 166 000 Old_age Always - 36 (Min/Max 11/53) 196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 0 197 Current_Pending_Sector 0x0022 100 100 000 Old_age Always - 8 198 Offline_Uncorrectable 0x0008 100 100 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age Always - 0
Is it safe to continue with this disk and just keep an eye on this? I have to assume the two incidents (power loss and this alert) are in some way related.