Hi All,
Got back in last Saturday to find Truenas warning me that the pool was in a degraded state. On closer inspection 2 of the four disks had been removed. Que panic mode whilst I frantically made sure I had a backup of the data.
Whilst that was on going I got onto ordering some replacement drives. I've ended up with 2x 10Tb WD Gold. They came with a 5y warranty and were cheaper than any of the Red or Red Pro drives.
After backing up the data I replaced the failed drives and hit replace. Pool status seems ok.
Issues/Questions:
From the Syslog I see the following:
The other shows no such error. Is it possible that this was incorrectly kicked out? The drives were bought back in 2015 so I'm thinking replacement is probably a good idea anyway. Just unsure if I should be trusting any data to the drive.
In good news looks like resilver works just fine in TrueNAS-12.0-BETA2.1!
System is as follows currently:
OS: TrueNAS-12.0-BETA2.1
HP Microserver Gen 8
CPU: Intel(R) Xeon(R) CPU E3-1230 V2 @ 3.30GHz
MEM: 16GB (2x8GB ECC)
OS Drive: 1x Intel® SSD 530 120GB
Storage: 2x 4TB WD Red, 2x 10TB WD Gold
RAID: ZFS RaidZ2
NET: Chelsio T320, 2 ports (10GB single DAC link)
Got back in last Saturday to find Truenas warning me that the pool was in a degraded state. On closer inspection 2 of the four disks had been removed. Que panic mode whilst I frantically made sure I had a backup of the data.
Whilst that was on going I got onto ordering some replacement drives. I've ended up with 2x 10Tb WD Gold. They came with a 5y warranty and were cheaper than any of the Red or Red Pro drives.
After backing up the data I replaced the failed drives and hit replace. Pool status seems ok.
Issues/Questions:
- I haven't burnt the drives in or checked them beyond running a quick SMART test from the GUI. I've hit "Manual S.M.A.R.T. Test" and "long" for each of the drives, will this be enough or should I do a more comprehensive test and if so how should I approach this? Offline drives individually and then test?
- I noticed the WD Gold uses significantly more power than the WD Red it replaced (spec says 10W). Would four drives be too much for the Microserver Gen 8 PSU (specs in sig)
- Smartctl -P show /dev/ada1 gives the following, I've never seen this before. Is it just that the model isn't in the database?
Code:
root@freenas[~]# smartctl -P show /dev/ada2 smartctl 7.1 2019-12-30 r5022 [FreeBSD 12.1-STABLE amd64] (local build) Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org No presets are defined for this drive. Its identity strings: MODEL: WDC WD102KRYZ-01A5AB0 FIRMWARE: 01.01H01 do not match any of the known regular expressions. Use -P showall to list all known regular expressions.
- Of the two drives that were booted out of the array one has the following:
Code:
Error 1 occurred at disk power-on lifetime: 25303 hours (1054 days + 7 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 10 51 08 90 01 40 e0 Error: IDNF at LBA = 0x00400190 = 4194704 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- ca 00 08 90 01 40 e0 00 09:09:41.315 WRITE DMA
From the Syslog I see the following:
Code:
Sep 12 15:43:59 freenas (ada3:ata3:0:1:0): RES: 51 10 c0 d2 97 20 20 01 00 08 00 Sep 12 15:43:59 freenas (ada3:ata3:0:1:0): Retrying command, 3 more tries remain Sep 12 15:46:17 freenas (ada3:ata3:0:1:0): WRITE_DMA48. ACB: 35 00 f8 0f 43 40 78 00 00 00 08 00 Sep 12 15:46:17 freenas (ada3:ata3:0:1:0): CAM status: ATA Status Error Sep 12 15:46:17 freenas (ada3:ata3:0:1:0): ATA status: 51 (DRDY SERV ERR), error: 10 (IDNF ) Sep 12 15:46:17 freenas (ada3:ata3:0:1:0): RES: 51 10 f8 0f 43 78 78 00 00 08 00 Sep 12 15:46:17 freenas (ada3:ata3:0:1:0): Retrying command, 3 more tries remain Sep 12 15:47:18 freenas (ada3:ata3:0:1:0): FLUSHCACHE48. ACB: ea 00 00 00 00 40 00 00 00 00 00 00 Sep 12 15:47:18 freenas (ada3:ata3:0:1:0): CAM status: Command timeout Sep 12 15:47:18 freenas (ada3:ata3:0:1:0): Retrying command, 0 more tries remain Sep 12 15:47:18 freenas ada2 at ata3 bus 0 scbus1 target 0 lun 0 Sep 12 15:47:18 freenas ada2: <WDC WD40EFRX-68WT0N0 82.00A82> s/n WD-WCC4E3LHE7R4 detached Sep 12 15:47:18 freenas ada3 at ata3 bus 0 scbus1 target 1 lun 0 Sep 12 15:47:18 freenas ada3: <WDC WD40EFRX-68WT0N0 82.00A82> s/n WD-WCC4E7TZH262 detached Sep 12 15:47:18 freenas GEOM_MIRROR: Device swap0: provider ada2p1 disconnected. Sep 12 15:47:18 freenas GEOM_MIRROR: Device swap0: provider ada3p1 disconnected. Sep 12 15:47:18 freenas g_access(961): provider gptid/f0df517c-65c7-11e9-89d2-000f53160620 has error 6 set Sep 12 15:47:18 freenas g_access(961): provider ada3 has error 6 set Sep 12 15:47:18 freenas g_access(961): provider ada2 has error 6 set Sep 12 15:47:18 freenas (ada3:ata3:0:1:0): Periph destroyed Sep 12 15:47:18 freenas (ada2:ata3:0:0:0): Periph destroyed Sep 12 15:47:23 freenas 1 2020-09-12T15:47:23.872590+00:00 freenas.ransome-pearce savecore 9116 - - /dev/ada1p1: Operation not permitted Sep 12 15:47:25 freenas syslog-ng[9221]: syslog-ng starting up; version='3.25.1' Sep 12 15:47:25 freenas GEOM_ELI: Device mirror/swap0.eli destroyed. Sep 12 15:47:25 freenas GEOM_MIRROR: Device swap0: provider destroyed. Sep 12 15:47:25 freenas GEOM_MIRROR: Device swap0 destroyed. Sep 12 15:47:25 freenas kernel: pid 1200 (syslog-ng), jid 0, uid 0: exited on signal 6 (core dumped) Sep 12 15:47:25 freenas GEOM_ELI: Device ada1p1.eli created. Sep 12 15:47:25 freenas GEOM_ELI: Encryption: AES-XTS 128 Sep 12 15:47:25 freenas GEOM_ELI: Crypto: hardware Sep 12 15:47:27 freenas 1 2020-09-12T15:47:27.842730+00:00 freenas.ransome-pearce savecore 9264 - - /dev/ada1p1: Operation not permitted
The other shows no such error. Is it possible that this was incorrectly kicked out? The drives were bought back in 2015 so I'm thinking replacement is probably a good idea anyway. Just unsure if I should be trusting any data to the drive.
In good news looks like resilver works just fine in TrueNAS-12.0-BETA2.1!
System is as follows currently:
OS: TrueNAS-12.0-BETA2.1
HP Microserver Gen 8
CPU: Intel(R) Xeon(R) CPU E3-1230 V2 @ 3.30GHz
MEM: 16GB (2x8GB ECC)
OS Drive: 1x Intel® SSD 530 120GB
Storage: 2x 4TB WD Red, 2x 10TB WD Gold
RAID: ZFS RaidZ2
NET: Chelsio T320, 2 ports (10GB single DAC link)
Last edited: