Hi! I'm in a bit of a pickle, I can't install TrueNAS Scale due to "Uncorrectable ECC" errors (reported as kernel panics during the installation), but I don't understand what's to blame, it might've been my poor choice of RAM.
Hardware:
System info:
Problem:
I'm installing TrueNAS scale off of a USB stick, I get to where it starts extracting stuff and then it fails with kernel panics:
I see it reported in the "Health event log" as "Uncorrectable ECC @ DIMMB2 - Assertion".
What I tried:
1. Ran memtest86 for 8 hours with both sticks, no errors, all good. Not even in the event log.
2. Tried to install TrueNAS with one stick only, in both different slots, both sticks - no difference, still fails.
3. Tried it with a spare consumer (non-ECC) DDR4 2666MHz RAM, worked pretty well, was able to install TrueNAS and boot into it.
4. Updated the BIOS to the latest available (I had 2.0c, but I saw that the support for 2667MHz RAM was added later), no difference
5. Updated the firmware to the latest just in case, no difference.
Memory is reported correctly is BIOS - 64GB, running at 2400MHz.
Assumptions:
Not sure what to try next, so any help would be much appreciated, thanks!
Hardware:
CPU: Intel Xeon E3-1220 V6 (used, bundled with the motherboard)
Motherboard: Supermicro X11SSM-F (used, bundled with the CPU)
Memory: Samsung 2x32GB ECC UDIMM DDR4 3200 (M391A4G43BB1-CWE) (new, exact product that I bought)
Boot drive: x2 2.5" Kingston 120GB SSD (new)
Power supply: Seasonic PRIME PX-650 650 Watt Platinum (new, for 10 HDDs)
System info:
Firmware Revision: 01.63 Firmware Build Time: 09/04/2020BIOS Version: 2.7 BIOS Build Time: 12/06/2021 Redfish Version: 1.0.1
Problem:
I'm installing TrueNAS scale off of a USB stick, I get to where it starts extracting stuff and then it fails with kernel panics:
I see it reported in the "Health event log" as "Uncorrectable ECC @ DIMMB2 - Assertion".
What I tried:
1. Ran memtest86 for 8 hours with both sticks, no errors, all good. Not even in the event log.
2. Tried to install TrueNAS with one stick only, in both different slots, both sticks - no difference, still fails.
3. Tried it with a spare consumer (non-ECC) DDR4 2666MHz RAM, worked pretty well, was able to install TrueNAS and boot into it.
4. Updated the BIOS to the latest available (I had 2.0c, but I saw that the support for 2667MHz RAM was added later), no difference
5. Updated the firmware to the latest just in case, no difference.
Memory is reported correctly is BIOS - 64GB, running at 2400MHz.
Assumptions:
- I did get both sticks from one place, shipped together. Could they be faulty? How can I rule it out if I don't have any spare ECC memory? I assume the fact that it booted with non-ECC memory doesn't change much since there could be unreported errors?
- I did get two 32GB sticks of 3200MHz because I thought I'd get clocked down anyway and I'd be able to re-use it in the future whenever I upgrade, and it was much cheaper than the alternatives anyway. I know that supermicro doesn't recommend using 2x32GB sticks, but I thought it was just an outdated instruction. I also looked for this specific model and some users reported that it worked fine. Could I have chosen bad memory?
Not sure what to try next, so any help would be much appreciated, thanks!