HeloJunkie
Patron
- Joined
- Oct 15, 2014
- Messages
- 286
SYSTEM:
Supermicro Superserver 5028R-E1CR12L
Supermicro X10SRH-CLN4F Motherboard
1 x Intel Xeon E5-2640 V3 8 Core 2.66GHz
4 x 16GB PC4-17000 DDR4 2133Mhz Registered ECC
12 x 4TB HGST HDN724040AL 7200RPM NAS SATA Hard Drives
2 x 6 Drive RAIDZ2 VDEVs
LSI3008 SAS Controller - Flashed to IT Mode (Firmware Version 12.00.02.00)
LSI SAS3x28 SAS Expander
LSI9211-8i SAS Controller - Flashed to IT Mode (Firmware Version 20.00.02.00)
(connects to external JBOD enclosure)
Dual 920 Watt Platinum Power Supplies
16GB USB Thumb Drive for booting
Chelsio T580-SO-CR Dual 40Gbe NIC (Replication Connection to backup FreeNAS server)
Chelsio T520-SO-CR Dual 10Gbe NIC (Data connection to Plex server & media management server)
FreeNAS-11.0-U2 (e417d8aa5)
So this morning at 0120 hours, I received an email:
followed by another email:
Since I am running RAIDZ2, I made a note of it and went back to sleep. This morning when I came into work, I took a look at the pool:
And I took a look at /dev/da14:
So I am kind of at a loss as to where to look next for the unrecoverable error and unreadable pending sectors. Everything seems to be running well, no alerts on the system itself.
One weird thing was that I had no webui when I came in this morning. The system was working jsut fine (just nfs mounts), all the mounts were working, but I could not access the
web interface itself. I am using the old interface. It looked like django was the cause of the problem as nothing I could do would get it to stop and restart and eventually I had to
roll to my backup NAS and reboot the primary. As soon as I did, I got the web interface back again.
I am not sure if it is related, but thought I would throw it in as well, just in case.
Supermicro Superserver 5028R-E1CR12L
Supermicro X10SRH-CLN4F Motherboard
1 x Intel Xeon E5-2640 V3 8 Core 2.66GHz
4 x 16GB PC4-17000 DDR4 2133Mhz Registered ECC
12 x 4TB HGST HDN724040AL 7200RPM NAS SATA Hard Drives
2 x 6 Drive RAIDZ2 VDEVs
LSI3008 SAS Controller - Flashed to IT Mode (Firmware Version 12.00.02.00)
LSI SAS3x28 SAS Expander
LSI9211-8i SAS Controller - Flashed to IT Mode (Firmware Version 20.00.02.00)
(connects to external JBOD enclosure)
Dual 920 Watt Platinum Power Supplies
16GB USB Thumb Drive for booting
Chelsio T580-SO-CR Dual 40Gbe NIC (Replication Connection to backup FreeNAS server)
Chelsio T520-SO-CR Dual 10Gbe NIC (Data connection to Plex server & media management server)
FreeNAS-11.0-U2 (e417d8aa5)
So this morning at 0120 hours, I received an email:
Code:
The volume vol1 state is ONLINE: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected.
followed by another email:
Code:
Device: /dev/da14 [SAT], 7 Currently unreadable (pending) sectors
Since I am running RAIDZ2, I made a note of it and went back to sleep. This morning when I came into work, I took a look at the pool:
Code:
root@plexnas:~ # zpool status vol1 pool: vol1 state: ONLINE scan: scrub repaired 0 in 7h40m with 0 errors on Tue Aug 15 09:40:27 2017 config: NAME STATE READ WRITE CKSUM vol1 ONLINE 0 0 0 raidz2-0 ONLINE 0 0 0 gptid/f46fb4ec-ed62-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/f69f4e21-ed62-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/f8cde372-ed62-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/faeb3d6d-ed62-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/fd087ff0-ed62-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/ff28300a-ed62-11e4-a956-0cc47a31abcc ONLINE 0 0 0 raidz2-1 ONLINE 0 0 0 gptid/013d5491-ed63-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/0357b342-ed63-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/05811f51-ed63-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/079f5f22-ed63-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/09b81318-ed63-11e4-a956-0cc47a31abcc ONLINE 0 0 0 gptid/a82dda8c-ef5f-11e4-bb0a-0cc47a31abcc ONLINE 0 0 0 raidz2-2 ONLINE 0 0 0 gptid/120bb7bc-89a2-11e6-9e64-0cc47a31abcc ONLINE 0 0 0 gptid/12ce80a0-89a2-11e6-9e64-0cc47a31abcc ONLINE 0 0 0 gptid/13867ead-89a2-11e6-9e64-0cc47a31abcc ONLINE 0 0 0 gptid/14413602-89a2-11e6-9e64-0cc47a31abcc ONLINE 0 0 0 gptid/14f95eb4-89a2-11e6-9e64-0cc47a31abcc ONLINE 0 0 0 gptid/15af6956-89a2-11e6-9e64-0cc47a31abcc ONLINE 0 0 0 raidz2-3 ONLINE 0 0 0 gptid/d69a5dad-0ab4-11e7-9f3c-0cc47a31abcc ONLINE 0 0 0 gptid/d7b9b84e-0ab4-11e7-9f3c-0cc47a31abcc ONLINE 0 0 0 gptid/d8d769fb-0ab4-11e7-9f3c-0cc47a31abcc ONLINE 0 0 0 gptid/d9f58a88-0ab4-11e7-9f3c-0cc47a31abcc ONLINE 0 0 0 gptid/db11810d-0ab4-11e7-9f3c-0cc47a31abcc ONLINE 0 0 0 gptid/dc3427cd-0ab4-11e7-9f3c-0cc47a31abcc ONLINE 0 0 0 errors: No known data errors
And I took a look at /dev/da14:
Code:
root@plexnas:~ # smartctl -A /dev/da14 smartctl 6.5 2016-05-07 r4318 [FreeBSD 11.0-STABLE amd64] (local build) Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org === START OF READ SMART DATA SECTION === SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 613 3 Spin_Up_Time 0x0027 183 181 021 Pre-fail Always - 7808 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 18 5 Reallocated_Sector_Ct 0x0033 200 200 140 Pre-fail Always - 5 7 Seek_Error_Rate 0x002e 200 200 000 Old_age Always - 37 9 Power_On_Hours 0x0032 090 090 000 Old_age Always - 7762 10 Spin_Retry_Count 0x0032 100 253 000 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 253 000 Old_age Always - 0 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 18 192 Power-Off_Retract_Count 0x0032 200 200 000 Old_age Always - 15 193 Load_Cycle_Count 0x0032 200 200 000 Old_age Always - 98 194 Temperature_Celsius 0x0022 129 119 000 Old_age Always - 23 196 Reallocated_Event_Count 0x0032 196 196 000 Old_age Always - 4 197 Current_Pending_Sector 0x0032 200 200 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 100 253 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 31
So I am kind of at a loss as to where to look next for the unrecoverable error and unreadable pending sectors. Everything seems to be running well, no alerts on the system itself.
One weird thing was that I had no webui when I came in this morning. The system was working jsut fine (just nfs mounts), all the mounts were working, but I could not access the
web interface itself. I am using the old interface. It looked like django was the cause of the problem as nothing I could do would get it to stop and restart and eventually I had to
roll to my backup NAS and reboot the primary. As soon as I did, I got the web interface back again.
I am not sure if it is related, but thought I would throw it in as well, just in case.