Skip to content

Instantly share code, notes, and snippets.

@matt22207
Last active January 2, 2025 04:23
Show Gist options
  • Save matt22207/bb1ba1811a08a715e32f106450b0418a to your computer and use it in GitHub Desktop.
Save matt22207/bb1ba1811a08a715e32f106450b0418a to your computer and use it in GitHub Desktop.
Proxmox 5700G APU GPU Passthrough Notes

Some random notes on trying (and failing) to get Proxmox as host with 5700G APU GPU PCI Passthrough to Ubuntu guest VM working:

References:

Made some progress, but i am thinking this may not possible (for now without more support from AMD?) due to the shared memory of the GPU and system. Various AMD features (like PSP and TMZ) are meant for security and ensuring the programs/cpu can't read your GPU memory. I am totally unknowledgeable here, so I am speculating based off various threads. I suspect the stability issues various people mentioned may be due to issues with reserving memory for the GPU. Not sure, but it seems the memory sharing happens inside the VM at the driver level (rather than on the host at a higher level of hardware), as when i give the VM 10GB, i can see the amdgpu driver grabs 2GB and only 8GB are left to the OS on the host. So maybe this is all self-contained cleanly in the VM, and not really an issue. Not sure.

Anyway, I'm trying Proxmox and I was able to get an external display working (only on first boot??) inside an Ubuntu guest VM, and it seems like various 3d apps run at high fps (vkcube, glxgears) although i haven't installed any games yet to test, and not sure if this is really running on the GPU or CPU since the grpahic driver is shown as llvmpipe. Additionally, the amdgpu driver is still not loading and is giving some errors even though the monitor and those 3d apps work. Also, no audio yet either :( .

In case it helps others solve this, here's some things i've learned:

  • On the Host, modify /etc/default/grub and then run update-grub :
GRUB_DEFAULT=0
GRUB_TIMEOUT=0
GRUB_CMDLINE_LINUX_DEFAULT="amd_iommu=on iommu=pt video=efifb:off pcie_acs_override=downstream,multifunction amdgpu.exp_hw_support=1 modprobe.blacklist=amdgpu,snd_hda_intel,ccp textonly loglevel=0 silent text nomodeset"
GRUB_TERMINAL=serial
#GRUB_TERMINAL_OUTPUT -- comment this out along with GRUB_TERMINAL_INPUT if included, since GRUB_TERMINAL overrides
GRUB_FORCE_HIDDEN_MENU="true"
  • Not sure if all of these kernel params are needed, and some may be excessive, but most of this is random guesses trying to get the host not to touch any sort of display during boot, etc. Also, the pcie_acs_override is to split the PCI IOMMU groups since by default the GPU group included USB devices that i did not want to pass through. I am not sure if this is a problem, and possibly the full original group (including USB needs to be passed through.

  • On the host, blacklist these via /etc/modprobe.d/blacklist.conf (or similar) to avoid having your host initialize the AMD GPU and related devices (snd_hda_intel for audio and ccp for PSP security device i think) :

echo "blacklist amdgpu" >> /etc/modprobe.d/pve-blacklist.conf
echo "blacklist snd_hda_intel" >> /etc/modprobe.d/pve-blacklist.conf
echo "blacklist ccp" >> /etc/modprobe.d/pve-blacklist.conf
  • reboot

  • ensure these weren't activated by checking the boot logs such as sudo dmesg | grep amdgpu

    • note, ccp still gets loaded by kvm_amd no matter what i tried. not sure if this is a blocker
  • Confirm your IOMMU groups are split if needed so you can pass in the GPU+audio on its own by looking for different groups via this script: https://gist.github.com/flungo/428c374c040de1d0a30fd4a593d39040

  • Install/Setup your VM using q35 and OVMF (UEFI) without setting up PCI passthrough yet .. for me in ProxMox, this is just for initial setup in the noVNC console as hardware display may not work yet.. I had to switch to SeaBIOS after setup was complete to get the physical display output working. I would hope/assume we could get UEFI working, but it didn't see to work for me at all with this vbios.

  • After you have a working VM, install a backup way to get in such as regular VNC or openssh-server, then shut it down, and switch the VM to SeaBIOS .. i left the UEFI disk in place so i could switch back and forth as needed.

  • To add the PCI Passthrough you'll need to and get a vbios rom:

  • I found a copy of the 5700G VBIOS from a similar machine here: https://rog.asus.com/us/desktops/mid-tower/rog-strix-g10dk-series/helpdesk_bios ..

  • Extract using VBiosFinder : https://github.com/coderobe/VBiosFinder

    • Install all the dependencies. Especially UEFIExtract [ binaries ] and rom-parser
  • This will output a few rom's, and vbios_1002_1638_1.rom seemed to work the best for me. (Sorry, not sure of the legality of posting the actual rom file)

  • Copy this rom onto your Host in the appropriate dir. For me on proxmox, this was in /usr/share/kvm

  • Get your PCI id's via lspci on the host, which for me are:

05:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Cezanne [1002:1638] (rev c8)
05:00.1 Audio device [0403]: Advanced Micro Devices, Inc. [AMD/ATI] Device [1002:1637]
05:00.2 Encryption controller [1080]: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) Platform Security Processor [1022:15df]
05:00.3 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Renoir USB 3.1 [1022:1639]
05:00.4 USB controller [0c03]: Advanced Micro Devices, Inc. [AMD] Renoir USB 3.1 [1022:1639]
05:00.5 Multimedia controller [0480]: Advanced Micro Devices, Inc. [AMD] Raven/Raven2/FireFlight/Renoir Audio Processor [1022:15e2] (rev 01)
05:00.6 Audio device [0403]: Advanced Micro Devices, Inc. [AMD] Family 17h (Models 10h-1fh) HD Audio Controller [1022:15e3]
  • I think I only want #0 and #1 and #2 to passthrough (although again, i wonder if the others are required). Not sure about the #2 PSP (especially since i'm disabling PSP later) and 5/6 audio if they are duplicate of #1 audio. I did confirm that including the USB controllers will crash the host system, so if this is even possible you'd need the pcie_acs_override in grub to exclude those into different IOMMU groups. In your host's terminal, add your PCI id's to /etc/pve/qemu-server/100.conf (assuming this is your first VM #100) since you can't add the romfile setting via the proxmox web screens:
hostpci0: 0000:05:00.0;0000:05:00.1;0000:05:00.2,pcie=1,x-vga=1,romfile=vbios_1002_1638_1.rom
  • Note that x-vga=1 will force this to be the primary display, and proxmox's built in "noVNC" viewer will stop working.

    • Ensure you have another way to get in to your VM before this, such as passing through a usb keyboard/mouse, or openssh-server, regular VNC, etc.
  • Start your guest VM and you may get video output on your external display (i'm using the HDMI output), and various errors in the guest's dmesg if you grep for amdgpu ..

  • To fix error complaining about PSP unable to load rom.. Inside the Guest (NOT THE HOST!!!) , add these kernel boot params (for me via /etc/default/grub and then update-grub) : amgdpu.fw_load_type=0

    • This disables PSP which was failing to read the vbios rom, not sure if this has other side effects or removes needed functionality, but it gets some of the errors to go away.
GRUB_CMDLINE_LINUX_DEFAULT="amdgpu.fw_load_type=0"
  • Also, I played with disabling TMZ in the Guest's /etc/default/grub kernel params via amdgpu.tmz=0 .. not sure this did anything, so i left it out

  • reboot the guest and you should get rid of many errors. I am left with:

[    0.000000] Linux version 5.13.0-27-generic (buildd@lcy02-amd64-014) (gcc (Ubuntu 11.2.0-7ubuntu2) 11.2.0, GNU ld (GNU Binutils for Ubuntu) 2.37) #29-Ubuntu SMP Wed Jan 12 17:36:47 UTC 2022 (Ubuntu 5.13.0-27.29-generic 5.13.19)
[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-5.13.0-27-generic root=/dev/mapper/vgubuntu-root ro amdgpu.fw_load_type=0 amdgpu.dc=1 radeon.cik_support=0 amdgpu.cik_support=1
[    0.017576] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-5.13.0-27-generic root=/dev/mapper/vgubuntu-root ro amdgpu.fw_load_type=0 amdgpu.dc=1 radeon.cik_support=0 amdgpu.cik_support=1
[   11.189820] [drm] amdgpu kernel modesetting enabled.
[   11.189897] amdgpu: CRAT table not found
[   11.189899] amdgpu: Virtual CRAT table created for CPU
[   11.189906] amdgpu: Topology: Add CPU node
[   11.189947] fb0: switching to amdgpudrmfb from VESA VGA
[   11.190056] amdgpu 0000:01:00.0: vgaarb: deactivate vga console
[   11.190329] amdgpu 0000:01:00.0: amdgpu: Trusted Memory Zone (TMZ) feature enabled
[   11.197046] amdgpu 0000:01:00.0: amdgpu: Fetched VBIOS from ROM BAR
[   11.197048] amdgpu: ATOM BIOS: 113-CEZANNE-017
[   11.229736] amdgpu 0000:01:00.0: amdgpu: VRAM: 2048M 0x000000F400000000 - 0x000000F47FFFFFFF (2048M used)
[   11.229738] amdgpu 0000:01:00.0: amdgpu: GART: 1024M 0x0000000000000000 - 0x000000003FFFFFFF
[   11.229739] amdgpu 0000:01:00.0: amdgpu: AGP: 267419648M 0x000000F800000000 - 0x0000FFFFFFFFFFFF
[   11.229791] [drm] amdgpu: 2048M of VRAM memory ready
[   11.229793] [drm] amdgpu: 3072M of GTT memory ready.
[   12.708341] amdgpu 0000:01:00.0: amdgpu: SMU is initialized successfully!
[   12.872471] amdgpu 0000:01:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
[   12.872675] [drm:amdgpu_gfx_enable_kcq.cold [amdgpu]] *ERROR* KCQ enable failed
[   12.872825] [drm:amdgpu_device_ip_init [amdgpu]] *ERROR* hw_init of IP block <gfx_v9_0> failed -110
[   12.872939] amdgpu 0000:01:00.0: amdgpu: amdgpu_device_ip_init failed
[   12.872942] amdgpu 0000:01:00.0: amdgpu: Fatal error during GPU init
[   12.872993] amdgpu 0000:01:00.0: amdgpu: amdgpu: finishing device.

open issues:

  • amdgpu driver on guest still doesn't load as shown above
  • audio doesn't work, likely same issue
  • general stability. haven't tracked this thoroughly, but maybe only the first boot of the VM works.. after that, subsequent boots of the VM can freeze, and even once my proxmox host froze. again, i think the shared memory allocation may be a problem, but no clue.
  • On host. i think the USB controllers in the original IOMMU group really are tied to the APU, and there is an error trying to reset the extra audio devices after the VM shuts down. I had to remove these devices from the pci passthrough to get rid of this error.
Jan 23 17:54:55 minispve kernel: usb 1-2: reset low-speed USB device number 2 using xhci_hcd
Jan 23 17:59:30 minispve QEMU[7222]: kvm: vfio: Cannot reset device 0000:05:00.6, depends on group 17 which is not owned.
Jan 23 17:59:30 minispve QEMU[7222]: kvm: vfio: Cannot reset device 0000:05:00.5, depends on group 17 which is not owned.
  • I want to try a Windows10 VM as well to see if there's any difference
@vinphan
Copy link

vinphan commented Aug 8, 2024

For anyone else trying to do this, all the steps are outlined here and vbios + GOP driver files are available for download for various APUs: https://github.com/isc30/ryzen-7000-series-proxmox
Are you using Windows 10 or 11?
I followed the tutorial, but there was no network after passthrough the iGPU ):

Windows 11. I'm using UEFI as well, not CSM

It's been months since I messed with any of this, but I don't recall having any network problems. My Proxmox network device is set to VirtIO (paravirtualized), which requires the VirtIO drivers be installed for each broken device in Device Manager.

I don't have any useful information to help you fix your issue.

For the display, did you set it to None or Virtio?

@augiem
Copy link

augiem commented Aug 8, 2024

For anyone else trying to do this, all the steps are outlined here and vbios + GOP driver files are available for download for various APUs: https://github.com/isc30/ryzen-7000-series-proxmox
Are you using Windows 10 or 11?
I followed the tutorial, but there was no network after passthrough the iGPU ):

Windows 11. I'm using UEFI as well, not CSM
It's been months since I messed with any of this, but I don't recall having any network problems. My Proxmox network device is set to VirtIO (paravirtualized), which requires the VirtIO drivers be installed for each broken device in Device Manager.
I don't have any useful information to help you fix your issue.

For the display, did you set it to None or Virtio?

Display is set to Default in Proxmox.

agent: 1
bios: ovmf
boot: order=virtio0;net0;sata0
cores: 16
cpu: host
efidisk0: local-lvm:vm-103-disk-0,efitype=4m,pre-enrolled-keys=1,size=4M
hostpci0: 0000:05:00.0,pcie=1,romfile=vbios_Ryzen5700U.bin,x-vga=1
hostpci1: 0000:05:00.1,pcie=1,romfile=AMDGopDriver_5700U.rom
machine: pc-q35-8.1
memory: 16384
meta: creation-qemu=8.1.5,ctime=1709158056
name: Win11APUTest
net0: virtio=BC:24:11:5E:C7:D9,bridge=vmbr0,firewall=1
numa: 0
ostype: win11
sata0: local:iso/virtio-win-0.1.240.iso,media=cdrom,size=612812K
scsihw: virtio-scsi-single
smbios1: uuid=a76db531-de42-496c-83e1-4100e460ea1b
sockets: 1
tpmstate0: local-lvm:vm-103-disk-1,size=4M,version=v2.0
usb0: host=046d:c53f
usb1: host=046d:c31c
virtio0: local-lvm:vm-103-disk-2,iothread=1,size=64G
vmgenid: f734fc27-789e-4f18-88de-452e656940e6

@dmuiX
Copy link

dmuiX commented Sep 13, 2024

Wow. This really Solved my issue thanks a ton for this valuable information.
I have a Working Passthrough Windows 11 VM with my Ryzen 3 4350G.
Didnt Know that I need a Rom for my Audio Device as well.

@Naunter
Copy link

Naunter commented Oct 21, 2024

Finally I made it works! 5700G iGPU passthrough on PVE 7 is possible! Thanks for your collections and advice!
My motherboard is "B550 Phantom Gaming-ITX/ax". The PVE version is 7.2.
Few important points:

  • Motherboard set to CSM
  • Using correct vBIOS
  • Machine: Q35
  • BIOS: SeaBIOS

But there are still a issue is the AMD GPU passthrough reset bug. Every time you shut the VM off, you need to reboot the whole PVE machine to reset the GPU. Can't find a solution.
And I also made a video to show how to do 5700G iGPU passthrough on PVE 7. The video is in Chinese, but I have marked some key points in English as much as possible.
Here is the full video: https://www.bilibili.com/video/BV11d4y1G7Nk
And the final demo is at 29:33: https://www.bilibili.com/video/BV11d4y1G7Nk?share_source=copy_web&vd_source=37c57e6564de58a018f5b76ac5bfd5e2&t=1773
untitled-f014913

Thanks a lot, I followed your tutorial, but at the end, there is no IP for my Windows VM to work ): Do you know why?

Hi, because you don't have more information provided, can't analyze the issue for you. But first you should check your PVE network settings, bridge settings and VM network settings. Check your logs to see if any error or warning messages.

@DocMAX
Copy link

DocMAX commented Oct 22, 2024

AMD reset bug??

@philuxe
Copy link

philuxe commented Dec 6, 2024

Hi,
Was able to get mine working (igpu passthrough to ubuntu vm), thanks for your input.

I got a PVE crash once, did not happen again so far.

Link to my step by step notes: https://github.com/philuxe/pve/blob/main/5700g_PciPassthrough.md

@matt22207
Copy link
Author

Thanks to everyone for feedback, updates, etc. Apologies I haven't had time to review and incorporate all of this back into the original notes.

Adding one more hopefully relevant reference that came across today:

via ballerburg9005 : https://forums.unraid.net/topic/100729-help-requested-for-amd-ryzen-5-pro-4650g-passthrough/page/3/#comment-1494802

Check out this post from me, I made Ryzen 5 5600G work via UEFI VBIOS AND fixed reset bug with vendor-reset, so 100% usable.
https://forum.proxmox.com/threads/amd-ryzen-5600g-igpu-code-43-error.138665/post-726791
I think both are Vega 7. So by copying my method, maybe all you need is a different VBIOS, or maybe the supplied one also works.

@augiem
Copy link

augiem commented Dec 7, 2024

Adding one more hopefully relevant reference that came across today:

via ballerburg9005 : https://forums.unraid.net/topic/100729-help-requested-for-amd-ryzen-5-pro-4650g-passthrough/page/3/#comment-1494802

Check out this post from me, I made Ryzen 5 5600G work via UEFI VBIOS AND fixed reset bug with vendor-reset, so 100% usable.
https://forum.proxmox.com/threads/amd-ryzen-5600g-igpu-code-43-error.138665/post-726791
I think both are Vega 7. So by copying my method, maybe all you need is a different VBIOS, or maybe the supplied one also works.

Interesting. I had tried modifying vendor-reset in the same way he did before but it didn't make any difference. However, I think my criteria for what I consider working is different than his.

I have always done my testing on this issue by having a physical monitor hooked up. I didn't ever try it with only a virtual display over remote and no HDMI cable (or dummy display dongle) attached. (My understanding of what exactly the reset bug is was incorrect, I guess?)

I replicated the vendor-reset stuff for my iGPU, and at no time did a reset ever work with HDMI attached. However, I did notice that using the VM only virtually through Parsec, I was able to reset and power off/on to my heart's content. Curious as to whether this was even being enabled by vendor-reset, I completely removed vendor-reset and it still works. So it seems my setup has always worked without gpu reset bug issues, provided you ignore the fact that you cannot use a physical monitor (don't have an HDMI dongle attached either or it may fail). In my case, vendor-reset didn't do anything. However, I learned from this post that as long as I am content only connecting over remote, I don't have to worry about the reset bug, so thanks!

FWIW, I use the AMD Adrenaline drivers with no issues.

@DocMAX
Copy link

DocMAX commented Dec 7, 2024

TL;DR, Can someone summarize the steps i have to do to avoid the reset-bug on a Ryzen 5800U APU? I am running a proxmox server and i want to start stop a Win11 VM as many times wthout freeze or disappearing GPU. Thanks!

@augiem
Copy link

augiem commented Dec 13, 2024

However, I did notice that using the VM only virtually through Parsec, I was able to reset and power off/on to my heart's content.

After a lot more testing, even with no physical display or dongle attached, while I am able to reboot / shutdown and restart the VM (Windows only, KDE Neon fails), the GPU can and does still crash sometimes in games and every time when running 3DMark Time Spy. Once crashed, it will not work again until the host is rebooted. This is true with or without the extra steps of customizing and installing vendor-reset. So in my case, the GPU reset bug is not fixed, even without a physical display, and vendor-reset does nothing at all to address this on my system. So for me, while GPU passthrough does work, it's still unusable for anything I might want to use it for because the GPU can crash at any time, taking the whole system down.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment