se7zer
Cadet
- Joined
- Feb 14, 2024
- Messages
- 3
Thank you everyone in advance for any insight or help you may be able to offer!
I'm using TrueNAS-SCALE-22.12.4.2 bare metal and I have an AMD RX 5500 GPU which have the following settings:
- Applications/Settings/Advanced Settings/'Enable GPU support' selected
- System Settings/Advanced/Isolated GPU Device(s) was empty
lspci -k
It shows as Capacity and Allocatable on k3s kubectl describe nodes
I don't have any apps besides Immich, that I'm trying to install.
When trying to install Immich v1.94.1_3.0.9 via ix-apps (truenas community) and selecting Allocate 1 amd.com/gpu GPU,
it gives me the following error on the last (5/5) deployment:
All other 4 deployments are scheduled correctly and running:
It seems that k3s is trying to allocate the entire GPU in some pod and other pods can´t allocate it.
How can I make this GPU allocatable to every pod that needs it, or even every app, if ever install another app ?
If I select Allocate 0 amd.com/gpu GPU, it would seem to me that it will disable HWA, and I want to have it enabled.
Please tell me if you need more logs or information, I am happy to provide.
I'm using TrueNAS-SCALE-22.12.4.2 bare metal and I have an AMD RX 5500 GPU which have the following settings:
- Applications/Settings/Advanced Settings/'Enable GPU support' selected
- System Settings/Advanced/Isolated GPU Device(s) was empty
lspci -k
Code:
0a:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 14 [Radeon RX 5500/5500M / Pro 5500M] (rev c5) Subsystem: Sapphire Technology Limited Navi 14 [Radeon RX 5500/5500M / Pro 5500M] Kernel driver in use: amdgpu Kernel modules: amdgpu
It shows as Capacity and Allocatable on k3s kubectl describe nodes
Code:
Capacity: amd.com/gpu: 1 cpu: 12 ephemeral-storage: 3607177856Ki hugepages-1Gi: 0 hugepages-2Mi: 0 memory: 16284308Ki pods: 250 Allocatable: amd.com/gpu: 1 cpu: 12 ephemeral-storage: 3509062615565 hugepages-1Gi: 0 hugepages-2Mi: 0 memory: 16284308Ki pods: 250
I don't have any apps besides Immich, that I'm trying to install.
When trying to install Immich v1.94.1_3.0.9 via ix-apps (truenas community) and selecting Allocate 1 amd.com/gpu GPU,
it gives me the following error on the last (5/5) deployment:
0/1 nodes are available: 1 Insufficient amd.com/gpu. preemption: 0/1 nodes are available: 1 No preemption victims found for incoming pod.
All other 4 deployments are scheduled correctly and running:
Code:
root@truenas[/home/admin]# k3s kubectl get deployment -n ix-immich NAME READY UP-TO-DATE AVAILABLE AGE immich-redis 1/1 1 1 151m immich-postgres 1/1 1 1 151m immich 1/1 1 1 151m immich-machinelearning 1/1 1 1 151m immich-microservices 0/1 1 0 151m
It seems that k3s is trying to allocate the entire GPU in some pod and other pods can´t allocate it.
Code:
Allocated resources: (Total limits may be over 100 percent, i.e., overcommitted.) Resource Requests Limits -------- -------- ------ cpu 140m (1%) 16 (133%) memory 270Mi (1%) 32938Mi (207%) ephemeral-storage 0 (0%) 0 (0%) hugepages-1Gi 0 (0%) 0 (0%) hugepages-2Mi 0 (0%) 0 (0%) amd.com/gpu 1 1 Events: <none>
How can I make this GPU allocatable to every pod that needs it, or even every app, if ever install another app ?
If I select Allocate 0 amd.com/gpu GPU, it would seem to me that it will disable HWA, and I want to have it enabled.
Please tell me if you need more logs or information, I am happy to provide.