c32767a
Patron
- Joined
- Dec 13, 2012
- Messages
- 371
I have a system that's been in service for quite some time with no issues. It's never been mistreated (eg over temp, power issues, etc).
It serves as a lab test/bench box.
Hardware is X9SRL-F , Intel(R) Xeon(R) CPU E5-1620 v2 @ 3.70GHz
Memory is all Micron 36KSF2G72PZ-1G6.
After upgrading to 12.0-U5 from U2, I'm seeing memory errors in the logs:
The Address is consistent across all error messages. This address maps to DIMMA1 in the dmidecode data and I've replaced that DIMM, yet the errors continue.
Before I start replacing the guts of the entire system, these errors correlate precisely with the upgrade to -U5. There is no chance there's a defect or change on the TrueNas side that could cause this? Even if it's just something cosmetic like previously unreported memory errors are now being reported?
It serves as a lab test/bench box.
Hardware is X9SRL-F , Intel(R) Xeon(R) CPU E5-1620 v2 @ 3.70GHz
Memory is all Micron 36KSF2G72PZ-1G6.
After upgrading to 12.0-U5 from U2, I'm seeing memory errors in the logs:
Code:
Aug 21 12:21:22 nastest MCA: Bank 7, Status 0x8c00004000010093 Aug 21 12:21:22 nastest MCA: Global Cap 0x0000000001000c15, Status 0x0000000000000000 Aug 21 12:21:22 nastest MCA: Vendor "GenuineIntel", ID 0x306e4, APIC ID 0 Aug 21 12:21:22 nastest MCA: CPU 0 COR (1) RD channel 3 memory error Aug 21 12:21:22 nastest MCA: Address 0x18a9215c0 Aug 21 12:21:22 nastest MCA: Misc 0x214042c286 Aug 21 12:23:21 nastest MCA: Bank 7, Status 0x8c00004000010093 Aug 21 12:23:21 nastest MCA: Global Cap 0x0000000001000c15, Status 0x0000000000000000 Aug 21 12:23:21 nastest MCA: Vendor "GenuineIntel", ID 0x306e4, APIC ID 0 Aug 21 12:23:21 nastest MCA: CPU 0 COR (1) RD channel 3 memory error Aug 21 12:23:21 nastest MCA: Address 0x18a9215c0 Aug 21 12:23:21 nastest MCA: Misc 0x2140545486 Aug 21 12:27:04 nastest MCA: Bank 7, Status 0x8c00004000010093
The Address is consistent across all error messages. This address maps to DIMMA1 in the dmidecode data and I've replaced that DIMM, yet the errors continue.
Before I start replacing the guts of the entire system, these errors correlate precisely with the upgrade to -U5. There is no chance there's a defect or change on the TrueNas side that could cause this? Even if it's just something cosmetic like previously unreported memory errors are now being reported?