Hello everyone,
Hopefully someone has some additional insight for me - I have spent a couple of hours going through the forums,
but none of the similar issues I found matches my problem exactly, or the proposed solution just isn't it...
Thank you for reading -- hopefully I listed all the information you need, below.
Issue summary: TrueNAS does not recognize new inserted disk after disk failure.
System:
ProLiant ML350 Gen10
2x TrueNAS OS drives, SSD, Mirrored
10x Data drives, ATA, 14TB Seagate Ironwolf Pro
Raid: HPE Smart Array P816i-a SR Gen10
Intel(R) Xeon(R) Gold 5218R CPU @ 2.10GHz
32GB RAM, 1 DIMM
HPE Ethernet 10Gb 2-port 562SFP+ Adapter
HP ILO shows all 10 data drives listed as being in good working order
HP ILO shows the 10 data drives as Unconfigured - thi sis done by Truenas
TrueNAS:
TrueNAS-13.0-RELEASE
Data store: 1 vdev; RAIDZ2, 10 disks
Issue detail:
Disk da5 died after power outage.
Now, after replacing the disk and wiring:
- /Storage/Pools status shows degraded, da5 is Faulted and is listed as /dev/gptid/###
- /Storage/Disks shows da11 with serial number, disk size 0K, pool N/A
- (New) Disk specs: Disk Type: UNKNOWN Model: Generic- SD/MMC CRW
This setup has been working properly for months
After replacing the faulted drive with a new drive, the UI does not update/respond.
The same serial number and /dev/gptid hook are showing after refresh, off/onlining, replacement.
HP ILO has the replacement drive listed as being in good working order (the faulted was showing as Smart Failure Imminent)
TShooting steps:
- tried multiple replacement disks
- replaced cable
- took disk offline -- no change in console UI
- replaced disk -- no change; new disk not recognized
- wiped new disk - no change
- put old disk back, onlined disk and scrubbed pool -- no change
- took disk offline, replaced with (other) new disk and Onlined -- no change
- I tried the 'replace option' as well; it shows me the options:
- da5 twice and d11 once.
'Alerts' notes:
CRITICAL -- Pool DATA state is DEGRADED: One or more devices are faulted in response to persistent errors.
Sufficient replicas exist for the pool to continue functioning in a degraded state.
The following devices are not healthy: Disk ATA ST14000NT001-3LW ZR9088QL is FAULTED
My box 'o tricks is empty - what am I missing?
The other 9 disks contain production data; changing/endangering the pool is not an quick option (Without offloading all data first, which I hope to avoid if at all possible...)
Hopefully someone has some additional insight for me - I have spent a couple of hours going through the forums,
but none of the similar issues I found matches my problem exactly, or the proposed solution just isn't it...
Thank you for reading -- hopefully I listed all the information you need, below.
Issue summary: TrueNAS does not recognize new inserted disk after disk failure.
System:
ProLiant ML350 Gen10
2x TrueNAS OS drives, SSD, Mirrored
10x Data drives, ATA, 14TB Seagate Ironwolf Pro
Raid: HPE Smart Array P816i-a SR Gen10
Intel(R) Xeon(R) Gold 5218R CPU @ 2.10GHz
32GB RAM, 1 DIMM
HPE Ethernet 10Gb 2-port 562SFP+ Adapter
HP ILO shows all 10 data drives listed as being in good working order
HP ILO shows the 10 data drives as Unconfigured - thi sis done by Truenas
TrueNAS:
TrueNAS-13.0-RELEASE
Data store: 1 vdev; RAIDZ2, 10 disks
Issue detail:
Disk da5 died after power outage.
Now, after replacing the disk and wiring:
- /Storage/Pools status shows degraded, da5 is Faulted and is listed as /dev/gptid/###
- /Storage/Disks shows da11 with serial number, disk size 0K, pool N/A
- (New) Disk specs: Disk Type: UNKNOWN Model: Generic- SD/MMC CRW
This setup has been working properly for months
After replacing the faulted drive with a new drive, the UI does not update/respond.
The same serial number and /dev/gptid hook are showing after refresh, off/onlining, replacement.
HP ILO has the replacement drive listed as being in good working order (the faulted was showing as Smart Failure Imminent)
TShooting steps:
- tried multiple replacement disks
- replaced cable
- took disk offline -- no change in console UI
- replaced disk -- no change; new disk not recognized
- wiped new disk - no change
- put old disk back, onlined disk and scrubbed pool -- no change
- took disk offline, replaced with (other) new disk and Onlined -- no change
- I tried the 'replace option' as well; it shows me the options:
- da5 twice and d11 once.
'Alerts' notes:
CRITICAL -- Pool DATA state is DEGRADED: One or more devices are faulted in response to persistent errors.
Sufficient replicas exist for the pool to continue functioning in a degraded state.
The following devices are not healthy: Disk ATA ST14000NT001-3LW ZR9088QL is FAULTED
My box 'o tricks is empty - what am I missing?
The other 9 disks contain production data; changing/endangering the pool is not an quick option (Without offloading all data first, which I hope to avoid if at all possible...)