sylaan
Cadet
- Joined
- Mar 9, 2024
- Messages
- 9
Hello all,
I have a problem upgrading my TrueNAS SCALE system to the latest 23.10 corbia version. I had an older version of bluefin so after reading on the upgrade path, I upgraded the system without issues via the GUI to TrueNAS-SCALE-22.12.4.2. Then I attempted to upgrade to TrueNAS-23.10.2 and that's when things went wrong. The system hang hard after boot, at a very early stage. I could watch this via the IPMI of my Supermicro board, could not see all the messages but the latest ones were:
It obviously has something to with some PCI devices (I think), something in the new version (or kernel) is maybe not ok with some of my hardware, even though it works just fine on TrueNAS-SCALE-22.12.4.2. I had to power cycle the server and tried several times but it hangs at the exact same spot.
If I choose the latest bluefin version (TrueNAS-SCALE-22.12.4.2) from the boot menu, then it boots ok. For comparison, this are the boot messages for a working boot (https://pastebin.com/Qu6rCLmt), one can see there what the boot should look like, compared to when it hangs.
One of the first few lines after the one where it hangs above are:
I am not sure what that is, something to do with Intel. No idea why that fails on corbia. This is my hardware:
OS Version: TrueNAS-SCALE-22.12.4.2
Mainboard: X10SLM+-LN4F
CPU: Intel(R) Xeon(R) CPU E3-1231 v3 @ 3.40GHz
Memory: 32 GB (non-ECC)
Disks:
1x Crucial 256GB SSD: boot pool
3x 8TB Seagate Exos 7E10 (ST8000NM017B-2TJ): data pool
NICs:
4x Intel i210 Gb ports (built into the mainboard, not connected/used)
2x Intel 82571EB/82571GB ports (not connected/used)
1x ConnectX-3 Mellanox 10Gbps NIC (connected, used).
Anyone has any idea what happens here ? Or any other info that I can provide ?
Thank you in advance, any help is much appreciated.
--
Sylaan
I have a problem upgrading my TrueNAS SCALE system to the latest 23.10 corbia version. I had an older version of bluefin so after reading on the upgrade path, I upgraded the system without issues via the GUI to TrueNAS-SCALE-22.12.4.2. Then I attempted to upgrade to TrueNAS-23.10.2 and that's when things went wrong. The system hang hard after boot, at a very early stage. I could watch this via the IPMI of my Supermicro board, could not see all the messages but the latest ones were:
Code:
...... [Thu Mar 7 23:06:56 2024] PCI host bridge to bus 0000:00 [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000cc000-0x000cffff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000d0000-0x000d3fff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000d4000-0x000d7fff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000d8000-0x000dbfff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000dc000-0x000dffff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000e0000-0x000e3fff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0x000e4000-0x000e7fff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0xe0000000-0xfeafffff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [mem 0xc00000000-0xfbfffffff window] [Thu Mar 7 23:06:56 2024] pci_bus 0000:00: root bus resource [bus 00-3e] [Thu Mar 7 23:06:56 2024] pci 0000:00:00.0: [8086:0c08] type 00 class 0x060000 [Thu Mar 7 23:06:56 2024] pci 0000:00:01.0: [8086:0c01] type 01 class 0x060400 [Thu Mar 7 23:06:56 2024] pci 0000:00:01.0: PME# supported from D0 D3hot D3cold [Thu Mar 7 23:06:56 2024] pci 0000:00:14.0: [8086:8c31] type 00 class 0x0c0330 [Thu Mar 7 23:06:56 2024] pci 0000:00:14.0: reg 0x10: [mem 0xf7800000-0xf780ffff 64bit] [Thu Mar 7 23:06:56 2024] pci 0000:00:14.0: PME# supported from D3hot D3cold [Thu Mar 7 23:06:56 2024] pci 0000:00:16.0: [8086:8c3a] type 00 class 0x078000 [Thu Mar 7 23:06:56 2024] pci 0000:00:16.0: reg 0x10: [mem 0xf7816000-0xf781600f 64bit] [Thu Mar 7 23:06:56 2024] pci 0000:00:16.0: PME# supported from D0 D3hot D3cold [Thu Mar 7 23:06:56 2024] pci 0000:00:16.1: [8086:8c3b] type 00 class 0x078000 [Thu Mar 7 23:06:56 2024] pci 0000:00:16.1: reg 0x10: [mem 0xf7815000-0xf781500f 64bit] [Thu Mar 7 23:06:56 2024] pci 0000:00:16.1: PME# supported from D0 D3hot D3cold [Thu Mar 7 23:06:56 2024] pci 0000:00:1a.0: [8086:8c2d] type 00 class 0x0c0320 [Thu Mar 7 23:06:56 2024] pci 0000:00:1a.0: reg 0x10: [mem 0xf7813000-0xf78133ff] [Thu Mar 7 23:06:56 2024] pci 0000:00:1a.0: PME# supported from D0 D3hot D3cold [Thu Mar 7 23:06:56 2024] pci 0000:00:1c.0: [8086:8c10] type 01 class 0x060400 [Thu Mar 7 23:06:56 2024] pci 0000:00:1c.0: PME# supported from D0 D3hot D3cold
It obviously has something to with some PCI devices (I think), something in the new version (or kernel) is maybe not ok with some of my hardware, even though it works just fine on TrueNAS-SCALE-22.12.4.2. I had to power cycle the server and tried several times but it hangs at the exact same spot.
If I choose the latest bluefin version (TrueNAS-SCALE-22.12.4.2) from the boot menu, then it boots ok. For comparison, this are the boot messages for a working boot (https://pastebin.com/Qu6rCLmt), one can see there what the boot should look like, compared to when it hangs.
One of the first few lines after the one where it hangs above are:
Code:
[Thu Mar 7 23:06:56 2024] pci 0000:00:1c.0: Enabling MPC IRBNCE [Thu Mar 7 23:06:56 2024] pci 0000:00:1c.0: Intel PCH root port ACS workaround enabled [Thu Mar 7 23:06:56 2024] pci 0000:00:1c.2: [8086:8c14] type 01 class 0x060400 [Thu Mar 7 23:06:56 2024] pci 0000:00:1c.2: PME# supported from D0 D3hot D3cold
I am not sure what that is, something to do with Intel. No idea why that fails on corbia. This is my hardware:
OS Version: TrueNAS-SCALE-22.12.4.2
Mainboard: X10SLM+-LN4F
CPU: Intel(R) Xeon(R) CPU E3-1231 v3 @ 3.40GHz
Memory: 32 GB (non-ECC)
Disks:
1x Crucial 256GB SSD: boot pool
3x 8TB Seagate Exos 7E10 (ST8000NM017B-2TJ): data pool
NICs:
4x Intel i210 Gb ports (built into the mainboard, not connected/used)
2x Intel 82571EB/82571GB ports (not connected/used)
1x ConnectX-3 Mellanox 10Gbps NIC (connected, used).
Anyone has any idea what happens here ? Or any other info that I can provide ?
Thank you in advance, any help is much appreciated.
--
Sylaan