I've had this as a semi-recurring issue for a time, and am only posting now since I've tried about everything I've read from searching this forum, but could use a more thorough approach that I'll need help for. In particular, I'm pretty sure the drive is fine.
First, the system:
Version: TrueNAS core 13.0-U6
CPU: i7-7700
Motherboard: B250
RAM: 24GB DDR4
Drives: 10x 4TB Ironwolf (3 are ironwolf pro), 1 nvme boot drive and 1 sata SSD
HBA: SAS9207-8i
NIC: intel dual 10g (I dont remember the exact name, I used the recommended hardware list)
The problem: IO errors causing a drive to fault. At least this is what the TrueNas Alert tells me. checking the SMART data for the drive tells me the drive is working fine. And shuffling cables and a reset usually calm it down for a little while. But then a day or a week or a month later, it'll do it again.
Observations: This seems to happen almost exclusively to drives plugged into the HBA. It may be actually exclusively, as I cant remember it ever happening to a drive plugged into the mobo. But it may have a while back.
Things I've tried: Different cables, thought I havent tried *every* cable combination, so this may still be the solution.
Different HBA. I have another HBA, a SAS9220-8i.
Upgrading the PSU.
All ove the above solutions work for a time. But then the problem comes back.
I'd like some help to identify what the problem actually is. Maybe it is just the cables this whole time. Maybe it's the SAS card, maybe theres something with my combination of hardware (10 drives off this hardware may be more than it should be able to handle). But I'd like your help figuring out what the problem actually is, so I'm not flailing in the dark.
One more note. I'm not particularly good with console commands, so if you're asking me to provide you with info, best to assume I don't know the command and provide that as well.
First, the system:
Version: TrueNAS core 13.0-U6
CPU: i7-7700
Motherboard: B250
RAM: 24GB DDR4
Drives: 10x 4TB Ironwolf (3 are ironwolf pro), 1 nvme boot drive and 1 sata SSD
HBA: SAS9207-8i
NIC: intel dual 10g (I dont remember the exact name, I used the recommended hardware list)
The problem: IO errors causing a drive to fault. At least this is what the TrueNas Alert tells me. checking the SMART data for the drive tells me the drive is working fine. And shuffling cables and a reset usually calm it down for a little while. But then a day or a week or a month later, it'll do it again.
Observations: This seems to happen almost exclusively to drives plugged into the HBA. It may be actually exclusively, as I cant remember it ever happening to a drive plugged into the mobo. But it may have a while back.
Things I've tried: Different cables, thought I havent tried *every* cable combination, so this may still be the solution.
Different HBA. I have another HBA, a SAS9220-8i.
Upgrading the PSU.
All ove the above solutions work for a time. But then the problem comes back.
I'd like some help to identify what the problem actually is. Maybe it is just the cables this whole time. Maybe it's the SAS card, maybe theres something with my combination of hardware (10 drives off this hardware may be more than it should be able to handle). But I'd like your help figuring out what the problem actually is, so I'm not flailing in the dark.
One more note. I'm not particularly good with console commands, so if you're asking me to provide you with info, best to assume I don't know the command and provide that as well.