Hi All
I've just has a report back from SuperMicro advising that there is a known Intel CPU issue across all hardware vendors which is related to the Caterr_IERR error we are seeing in the BMC health logs triggering the host reboot.
their fix is to upgrade to the latest version of the BIOS which has some mocrocode fixes to patch the CPU.
Current BIOS version is 3.2
Server SUPERMICRO SYS-2029U-E1CRTP
Upgrade BIOS Version - 3.3
Link to our board update -
https://www.supermicro.com.tw/en/products/motherboard/X11DPU
see attachment from Intel.
We will be flashing tonight and will update after a few days of use.
At present our system has been up for over 25 days using MTU 9000 the only difference we have made is to update FreeNas to latest version so far and the crashes had stopped.
It may just be a coincidence which is very possible.
""Cheers
G