Skip to content

Instantly share code, notes, and snippets.

@scyto
Last active October 12, 2024 21:54
Show Gist options
  • Save scyto/e4e3de35ee23fdb4ae5d5a3b85c16ed3 to your computer and use it in GitHub Desktop.
Save scyto/e4e3de35ee23fdb4ae5d5a3b85c16ed3 to your computer and use it in GitHub Desktop.

Enable & Using vGPU Passthrough

This gist is almost entirely not unlike Derek Seaman's awesome blog:

Proxmox VE 8: Windows 11 vGPU (VT-d) Passthrough with Intel Alder Lake

As such please refer to that for pictures, here i will capture the command lines I used as i sequence the commands a little differently so it makes more logic to me.

This gists assumes you are not running ZFS and are not passing any other PCIE devices (as both of these can require addtional steps - see Derek's blog for more info)

This gist assumes you are not running proxmox in UEFI Secure boot - if you are please refer entirely to dereks blog.

ALSO pleas refere to the comments section as folks have found workarounds and probably corrections (if the mistakes remain in my write up it is because i have't yet tested the corrections)

Note:i made no changes to the BIOS defaults on the Intel Nuc 13th Gen. This just worked as-is.

this gist is part of this series

Preparation

Install Build Requirements

apt update && apt install pve-headers-$(uname -r)
apt install git sysfsutils dkms build-* unzip -y

Install Other Drivers / Tools

This allow you to run vainfo, intel_gpu_top for testing and non-free versions of the encoding driver - without this you will not AFAIK be able to encoding with this GPU. This was missed in EVERY guide i saw for this vGPU, so not sure, but i had terrible issues until i did this.

edits the sources list with nano /etc/apt/sources.list

add the following lines:

#non-free firmwares
deb http://deb.debian.org/debian bookworm non-free-firmware

#non-free drivers and components
deb http://deb.debian.org/debian bookworm non-free

and save the file

apt update && apt install intel-media-va-driver-non-free intel-gpu-tools vainfo

This next step copies a driver missing on proxmox installs and will remove the -2 error for this file in dmesg.

wget -r -nd -e robots=no -A '*.bin' --accept-regex '/plain/' https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915/adlp_dmc.bin

cp adlp_dmc.bin /lib/firmware/i915/

Compile and Install the new driver

Clone github project

cd ~
git clone https://github.com/strongtz/i915-sriov-dkms.git

modify dkms.conf

cd i915-sriov-dkms
nano dkms.conf

change these two lines as follows:

PACKAGE_NAME="i915-sriov-dkms"
PACKAGE_VERSION="6.5"

save the file

Compile and Install the Driver

cd ~
mv i915-sriov-dkms/ /usr/src/i915-sriov-dkms-6.5
dkms install --force -m i915-sriov-dkms -v 6.5

and use dkms status to verify the module is now installed

Modify grub

edit the grub fle with nano /etc/default/grub

change this line in the file

GRUB_CMDLINE_LINUX_DEFAULT="quiet intel_iommu=on iommu=pt i915.enable_guc=3 i915.max_vfs=7"

note: if you have already made modifications to this line in your grub file for other purposes you should also still keep those items

finally run

update-grub
update-initramfs -u

Find PCIe Bus and update sysfs.conf

use lspci | grep VGA t find the bus number

you should see something like this:

root@pve2:~# lspci | grep VGA
00:02.0 VGA compatible controller: Intel Corporation Raptor Lake-P [Iris Xe Graphics] (rev 04)

take the number on the far left and add to the sysfs.conf as follows - note all the proceeding zeros on the bus path are needed

echo "devices/pci0000:00/0000:00:02.0/sriov_numvfs = 7" > /etc/sysfs.conf

REBOOT

Testing On Host

check devices

check devices with dmesg | grep i915

the last two lines should read as follows:

[    7.591662] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.7 on minor 7
[    7.591818] i915 0000:00:02.0: Enabled 7 VFs

if they don't then check all steps carefully

Validate with VAInfo

validate with vainfo you should see no errors (note this needs the drivers and tool i said to install at the top) and vainfo --display drm --device /dev/dri/cardN where N is a number from 0 to 7 - this will show you the acceleration endpoints for each VF

Check you can monitor the VFs - if not you have issues

monitor any VF renderer in real time with intel_gpu_top -d drm:/dev/dri/renderD128 there is one per VF - to see them all use ls -l /dev/dri

Configure vGPU Pool in Proxmox

  1. navigate to Datacenter > Resource Mappings
  2. click add in PCI devices
  3. name the pool something like vGPU-Pool
  4. map all 7 VFs for pve 1 but NOT the root device i.e 0000:00:02.x not 0000:00:02
  5. click create
  6. on the created pool lcikc the plus button next to vGPU-Pool
  7. select mapping on node = pve 2, ad all devices and click create
  8. repeat for pve3

The pool should now look like this:

image

Note: machines with PCI pass through devices cannot be live migrated, they must be shutdown, migrated offline to the new node and then started.

EVERYTIME THE KERNEL IS UPDATED IN PROXMOX YOU SHOULD DO THE FOLLOWING

update the kernel using proxox ui
dkms install -m i915-sriov-dkms -v 6.5 --force
reboot

How to get working in a privileged container

wow this one is hard.... you can avoid the id mapping stuff by not using a privileged container...

Assumptions:

  1. you have a debian 12 container, you added the non-free deb and have installed the non-free drivers as per the host instructions
  2. you have run cat /etc/groups in the container and noted down the GID for render (lets call that CTRGID) and gid for video (lets call that CTVGID).
  3. you have run cat /etc/groups in the container and noted down the GID for render (lets call that HSTRGID) and gid for video (lets call that HSTVGID). 5 that you have va info fully working

Create Container

  1. create container privileged, with debian 12, starts it
  2. apt update, apt upgrade, install non free drivers, vainfo and intel_gpu_top tools
  3. add root to user and video groups (this will mean when we get to ID mapping you don't need to tart about with user mappings - only group ones)
usermod -a -G render root
usermod -a -G video root
  1. shutdown container

Edit container conf file

  1. These are stored in /etc/pve/lxc and have the VMID.conf anme
  2. nano /etc/pve/lxc/VMID.conf

Add lxc device mapping

Here you add a line for the card uyou want and the rendere. Note if you map a VF (card) to a container it means that is hard mapped, if you have that VF in a pool for VMs please remove it from the pool (this means also these containers cannot be HA)

In the example below i chose card6 - which is renderD134 These are mapped into the container as card0 and renderD128 Change your numbers as per your own VF / card mappings

lxc.cgroup2.devices.allow: c 226:6 rwm
lxc.mount.entry: /dev/dri/card6 dev/dri/card0 none bind,optional,create=file

lxc.cgroup2.devices.allow: c 226:134 rwm
lxc.mount.entry: /dev/dri/renderD134 dev/dri/renderD128 none bind,optional,create=file

Add ID mapping (only needed in unprivileged)

  1. add the following... and here it gets complex as it will vary based on the numbers you recorded earlier - let me try... the aim is to have a continguois block of mappings but the syntax is um difficult...
lxc.idmap: u 0 100000 65536
lxc.idmap: g 0 100000 CTVGID
lxc.idmap: g CTVGID HSTVGID 1
lxc.idmap: g CTVGID+1 1000{CTVGID+1} CTRGID-CTVGID-1
lxc.idmap: g CTRGID HSTVGID 1
lxc.idmap: g CTRGID+1 100{CTRGID+1} 65536-{CTRGID+1}

so as an example, these are my values:

        host > ct
video:    44 > 44
render:  104 > 106

this is what i added to my VMID.conf file (in my case /etc/pve/lxc/107.conf

lxc.idmap: u 0 100000 65536
lxc.idmap: g 0 100000 44
lxc.idmap: g 44 44 1
lxc.idmap: g 45 100045 61
lxc.idmap: g 106 104 1
lxc.idmap: g 107 100107 65429
  1. add your two CT values to nano /etc/subgid (only needed in unprivileged)

in my case:

root:106:1
root:44:1

after this you should be able to start up the container and run vainfo and perform transcoding.

check permissions with ls -la /dev/dri it should look like this:

root@vGPU-debian-test:~# ls -la /dev/dri
total 0
drwxr-xr-x 2 root   root         80 Oct  7 00:22 .
drwxr-xr-x 7 root   root        500 Oct  7 00:22 ..
crw-rw-rw- 1 nobody video  226,   0 Oct  4 21:42 card0
crw-rw-rw- 1 nobody render 226, 128 Oct  4 21:42 renderD128

if the group names do not say video and render then you did something wrong

**Note: YYMV **

For example plex HW transcoded just fine on my system.

Emby on the otherhand seems to interrogate the kernel driver directly and gets the wrong answers - this is IMHO an issue with their detection logic not supporting this scenario.

Another example is intel_gpu_top which doesn't seem to work in this mode either - this is because it only works with the PMUs not the VFs (so somoene said)

Or maybe i just have no clue what i am doing, lol.

---work in progress 2023.10.6---

add vGPU to a Windows 11 or Server 2022 VM

  1. create VM with CPU set to host DO NOT CHANGE THIS
  2. boot VM without vGPU and display set to default
  3. install windows 11
  4. install VirtIO drivers [as of 4.6.2024 do not install guest tools - this may cause repair loops]
  5. shutdown VM and change display to VirtIO-GPU
  6. Now add the vGPU pool as a PCI device
  7. when creating a VM add a PCI device and add the poool as follows:

image

  1. now boot into VM and install latest IrisXe drivers from intel
  2. you should now have graphics acceleration availble to apps wether you connect by webcolse VNC, SPICE or an RDP client

From @rinze24:

If you follow the guide successfully, in Device Manager you will see:

  • Microsoft Basic Display Adapter - If you use Display in VM Settings
  • Intel iGPU - passthrough

You have 2 options (or more) to use your iGPU. Because Windows 11 decide on its own which graphics to use.

  1. Setup Remote Desktop Connection in Windows 11 and set the display to none in VM Hardware settings.
  • Pro: No configuration per app, Responsive Connection.
  • Con: No proxmox console.
  1. Inside Windows Set which graphics preference to use per application in Display Settings -> Graphics Settings-
  • Pro: Have proxmox console.
  • Con: Need to configure per application / program.

If you hit automatic repair loop at any point shutdown the machine and edit its conf file in /etc/pve/qemu-server and add args: -cpu Cooperlake,hv_relaxed,hv_spinlocks=0x1fff,hv_vapic,hv_time,+vmx

@pcmike
Copy link

pcmike commented Sep 19, 2024

@mm2293 I was sticking to 6.5.13-3 because I read that the dkms module wont be working otherwise

that doesn't seem correct.... i mean what you read, i believe you you read it unless it is an issue that doesn't manifest until use? this is my dmesg | grep i915 - is there an issue in there you can see you think is an issue https://privatebin.net/?f65d649ff1c82f05#D98CvzJuXV2KarFcfe8wQBVqxRmZRK7sKBNgaNpZJxXY

This was true up until the linked dkms GitHub was updated to work with 6.8.

If I have time this weekend, I may try and update my nodes. I want to find a way to clone the boot disks on each node before I attempt this update though as I really don’t want to have to rebuild each node if this goes south. I’ll post back when I’ve completed the update.

@nickcharlesyt
Copy link

nickcharlesyt commented Sep 19, 2024

lunch was yummy and productive, this rough and ready so sorry if i made errors, i haven't had time to redo on another node

this assumes you are running kernel 6.8.12-2-pve and pulled the git repo on Sep 18th 2024 @3pm PDT

do dkms status if you see any old i915-dkms modules as installed you will need to first uninstall them and then remove them using dkms --uninstall -m <module name> -v <version> -k <kernel>

if you see any old i915-dkms modules but they are not installed you need to remove them using dkms --remove -m <module name> -v <version> -k <kernel>

you can drive the module name, version and kernel from dkms status

for example

root@pve1:/usr/src# dkms status
i915-sriov-dkms/2024.08.09, 6.8.12-2-pve, x86_64: installed
  • m = i915-sriov-dkms
  • v = 2024.08.09
  • k = 6.8.12.2-pve

Once clean remake and re-install dkms driver

cd ~
git clone https://github.com/strongtz/i915-sriov-dkms.git
mv i915-sriov-dkms/ /usr/src/i915-sriov-dkms-2024.08.09
cd /usr/src

You should have the folder you just moved and a folder called linux-headers-$(uname-r) (i.e. the same as your current)

For example:

root@pve1:/usr/src# ls -l
total 12
drwxr-xr-x  4 root root 4096 Sep 18 14:07 i915-sriov-dkms-2024.08.09
drwxr-xr-x 25 root root 4096 Sep 18 14:53 linux-headers-6.8.12-2-pve

If not try apt install pve-headers If that doesn't populate the linux-headers folder try apt --reinstall install proxmox-headers-$(uname -r) to epxlicitly pull the headers and force them, this shouldn't be needed but in my case it was (maybe because i had used pvekclean or i had manually cleared the src dir)

cd /usr/src/i915-sriov-dkms-2024.09.09
dkms add .
dkms install -m i915-sriov-dkms -v 2024.08.09 --force --kernelsourcedir /usr/src/linux-headers-6.8.12-2-pve/

Make sure /etc/sysfs.conf still contains "devices/pci0000:00/0000:00:02.0/sriov_numvfs = 7"

update-grub update-initramfs -u proxmox-boot-tool refresh

reboot

this should get you as far as having 7VFs - i have tested no further

This is, I believe, the steps I took to get mine working. I screwed something up the first time I tried and mixed up some of the instructions here and on the other blog post, but since I redid it it's been working just fine, I have a copy of blue iris and jellyfin running inside a w11 box using the vgpu pool and it is indeed making use of the hardware acceleration. I will say I had trouble getting it working with an Ubuntu or Debian based VM, but I think that's my own ineptitude at installing the Intel drivers on Linux in the first place. If I eventually get it working there I did want to separate jellyfin onto it's own VM, but I'll have to try again this weekend.

@scyto
Copy link
Author

scyto commented Sep 19, 2024 via email

@Ogglord
Copy link

Ogglord commented Sep 20, 2024

I'm failing the validation on the proxmox host, same error with kernel 6.5.13-3-pve and 6.8.12-2-pve (i5-12500 CPU).
The VFs are there at least.

root# ls /dev/dri
by-path  card0	card1  card2  card3  card4  card5  card6  card7  renderD128  renderD129  renderD130  renderD131  renderD132  renderD133  renderD134  renderD135

I cannot get intel_gpu_top to run for any except card1/renderD128, all other return the following.

root# intel_gpu_top -d drm:/dev/dri/renderD129
Failed to detect engines! (No such file or directory)
(Kernel 4.16 or newer is required for i915 PMU support.)
zsh: segmentation fault  intel_gpu_top -d drm:/dev/dri/renderD129

dmesg | grep i915 gives me

[    0.000000] Command line: BOOT_IMAGE=/vmlinuz-6.8.12-2-pve root=UUID=0c6df81e-6eee-430b-971f-0ae926ead243 ro consoleblank=0 systemd.show_status=true consoleblank=0 intel_iommu=on iommu=pt i915.enable_guc=3 i915.max_vfs=7
[    3.365872] i915: loading out-of-tree module taints kernel.
[    3.366101] i915: module verification failed: signature and/or required key missing - tainting kernel
[    3.552746] i915 0000:00:02.0: Running in SR-IOV PF mode
[    3.553670] i915 0000:00:02.0: [drm] VT-d active for gfx access
[    3.579608] i915 0000:00:02.0: vgaarb: deactivate vga console
[    3.579649] i915 0000:00:02.0: [drm] Using Transparent Hugepages
[    3.580242] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=io+mem:owns=io+mem
[    3.581727] i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adls_dmc_ver2_01.bin (v2.1)
[    3.589524] i915 0000:00:02.0: [drm] GT0: GuC firmware i915/tgl_guc_70.bin version 70.29.2
[    3.589530] i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
[    3.592311] i915 0000:00:02.0: [drm] GT0: HuC: authenticated for all workloads!
[    3.592629] i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
[    3.592630] i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
[    3.592973] i915 0000:00:02.0: [drm] GuC RC: enabled
[    3.593472] i915 0000:00:02.0: [drm] Protected Xe Path (PXP) protected content support initialized
[    3.622805] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.0 on minor 1
[    3.623788] i915 0000:00:02.0: 7 VFs could be associated with this PF
[    3.660126] fbcon: i915drmfb (fb0) is primary device
[    3.706604] i915 0000:00:02.0: [drm] fb0: i915drmfb frame buffer device
[    4.398538] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=io+mem
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.401267] i915 0000:00:02.1: enabling device (0000 -> 0002)
[    4.401732] i915 0000:00:02.1: Running in SR-IOV VF mode
[    4.403072] i915 0000:00:02.1: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.403632] i915 0000:00:02.1: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.404917] i915 0000:00:02.1: [drm] VT-d active for gfx access
[    4.405305] i915 0000:00:02.1: [drm] Using Transparent Hugepages
[    4.406155] i915 0000:00:02.1: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.406734] i915 0000:00:02.1: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.407445] i915 0000:00:02.1: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.407797] i915 0000:00:02.1: HuC firmware PRELOADED
[    4.409701] i915 0000:00:02.1: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.410057] i915 0000:00:02.1: [drm] PMU not supported for this GPU.
[    4.410624] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.1 on minor 0
[    4.413115] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    4.413509] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.416126] i915 0000:00:02.2: enabling device (0000 -> 0002)
[    4.416543] i915 0000:00:02.2: Running in SR-IOV VF mode
[    4.417741] i915 0000:00:02.2: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.418255] i915 0000:00:02.2: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.419845] i915 0000:00:02.2: [drm] VT-d active for gfx access
[    4.420215] i915 0000:00:02.2: [drm] Using Transparent Hugepages
[    4.421002] i915 0000:00:02.2: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.421522] i915 0000:00:02.2: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.422591] i915 0000:00:02.2: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.422950] i915 0000:00:02.2: HuC firmware PRELOADED
[    4.424889] i915 0000:00:02.2: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.425252] i915 0000:00:02.2: [drm] PMU not supported for this GPU.
[    4.425758] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.2 on minor 2
[    4.428179] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    4.428577] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.429005] i915 0000:00:02.2: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.431398] i915 0000:00:02.3: enabling device (0000 -> 0002)
[    4.431759] i915 0000:00:02.3: Running in SR-IOV VF mode
[    4.432524] i915 0000:00:02.3: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.432885] i915 0000:00:02.3: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.433470] i915 0000:00:02.3: [drm] VT-d active for gfx access
[    4.433825] i915 0000:00:02.3: [drm] Using Transparent Hugepages
[    4.434432] i915 0000:00:02.3: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.434789] i915 0000:00:02.3: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.435314] i915 0000:00:02.3: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.435724] i915 0000:00:02.3: HuC firmware PRELOADED
[    4.437523] i915 0000:00:02.3: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.437869] i915 0000:00:02.3: [drm] PMU not supported for this GPU.
[    4.438264] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.3 on minor 3
[    4.440656] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    4.441001] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.441351] i915 0000:00:02.2: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.441762] i915 0000:00:02.3: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.444276] i915 0000:00:02.4: enabling device (0000 -> 0002)
[    4.444710] i915 0000:00:02.4: Running in SR-IOV VF mode
[    4.445221] i915 0000:00:02.4: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.445623] i915 0000:00:02.4: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.446227] i915 0000:00:02.4: [drm] VT-d active for gfx access
[    4.446623] i915 0000:00:02.4: [drm] Using Transparent Hugepages
[    4.447227] i915 0000:00:02.4: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.447629] i915 0000:00:02.4: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.448119] i915 0000:00:02.4: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.448478] i915 0000:00:02.4: HuC firmware PRELOADED
[    4.450391] i915 0000:00:02.4: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.450740] i915 0000:00:02.4: [drm] PMU not supported for this GPU.
[    4.451144] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.4 on minor 4
[    4.453426] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    4.453772] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.454122] i915 0000:00:02.2: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.454486] i915 0000:00:02.3: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.454891] i915 0000:00:02.4: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.457162] i915 0000:00:02.5: enabling device (0000 -> 0002)
[    4.457544] i915 0000:00:02.5: Running in SR-IOV VF mode
[    4.458273] i915 0000:00:02.5: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.458674] i915 0000:00:02.5: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.459248] i915 0000:00:02.5: [drm] VT-d active for gfx access
[    4.459643] i915 0000:00:02.5: [drm] Using Transparent Hugepages
[    4.460253] i915 0000:00:02.5: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.460688] i915 0000:00:02.5: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.461290] i915 0000:00:02.5: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.461680] i915 0000:00:02.5: HuC firmware PRELOADED
[    4.463495] i915 0000:00:02.5: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.463827] i915 0000:00:02.5: [drm] PMU not supported for this GPU.
[    4.464220] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.5 on minor 5
[    4.466483] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    4.466822] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.467160] i915 0000:00:02.2: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.467526] i915 0000:00:02.3: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.467923] i915 0000:00:02.4: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.468314] i915 0000:00:02.5: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.470571] i915 0000:00:02.6: enabling device (0000 -> 0002)
[    4.470903] i915 0000:00:02.6: Running in SR-IOV VF mode
[    4.471589] i915 0000:00:02.6: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.471990] i915 0000:00:02.6: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.472685] i915 0000:00:02.6: [drm] VT-d active for gfx access
[    4.473020] i915 0000:00:02.6: [drm] Using Transparent Hugepages
[    4.473571] i915 0000:00:02.6: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.473907] i915 0000:00:02.6: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.474361] i915 0000:00:02.6: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.474684] i915 0000:00:02.6: HuC firmware PRELOADED
[    4.476264] i915 0000:00:02.6: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.476643] i915 0000:00:02.6: [drm] PMU not supported for this GPU.
[    4.477016] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.6 on minor 6
[    4.479194] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    4.479566] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.479961] i915 0000:00:02.2: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.480351] i915 0000:00:02.3: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.480739] i915 0000:00:02.4: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.481119] i915 0000:00:02.5: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=none
[    4.481489] i915 0000:00:02.6: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'
[    4.483746] i915 0000:00:02.7: enabling device (0000 -> 0002)
[    4.484077] i915 0000:00:02.7: Running in SR-IOV VF mode
[    4.484778] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.485107] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.485816] i915 0000:00:02.7: [drm] VT-d active for gfx access
[    4.486150] i915 0000:00:02.7: [drm] Using Transparent Hugepages
[    4.486731] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.487056] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4
[    4.487495] i915 0000:00:02.7: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF
[    4.487815] i915 0000:00:02.7: HuC firmware PRELOADED
[    4.489388] i915 0000:00:02.7: [drm] Protected Xe Path (PXP) protected content support initialized
[    4.489710] i915 0000:00:02.7: [drm] PMU not supported for this GPU.
[    4.490091] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.7 on minor 7
[    4.490548] i915 0000:00:02.0: Enabled 7 VFs

Any ideas how to proceed with eliminating the GUC_VERSION mismatch? I have tried to compile with #define GUC_VF_VERSION_LATEST_MINOR 13

@tristan-k
Copy link

For Meteor Lake support see my comment here.

@Ogglord
Copy link

Ogglord commented Sep 20, 2024

For Meteor Lake support see my comment here.

We have very similar problems, @tristan-k - except I'm on Alder Lake.. Lets crack this!

@Ogglord
Copy link

Ogglord commented Sep 20, 2024

I think I am one step closer (REF)
what I did was the following:

$ GUCFIRMWARE_MINOR=13 dkms install -m i915-sriov-dkms -v $(cat VERSION) -k $(uname -r) --force --kernelsourcedir /usr/src/linux-headers-$(uname -r)

and also copying all *.bin files from the official kernel tree, e.g.

$ mkdir firmware
$ cd firmware
$ wget -r -nd -e robots=no -A '*.bin' --accept-regex '/plain/' https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915/
$ mv *.bin /lib/firmware/i915/

This got rid of these rows

[    4.484778] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.485107] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4

Finally, I read here that intel_gpu_top doesn't like to run on Virtual Functions. So I will continue on with trying passthrough to a VM.

@scyto
Copy link
Author

scyto commented Sep 20, 2024 via email

@rjblake
Copy link

rjblake commented Sep 20, 2024

I think I am one step closer (REF) what I did was the following:

$ GUCFIRMWARE_MINOR=13 dkms install -m i915-sriov-dkms -v $(cat VERSION) -k $(uname -r) --force --kernelsourcedir /usr/src/linux-headers-$(uname -r)

and also copying all *.bin files from the official kernel tree, e.g.

$ mkdir firmware
$ cd firmware
$ wget -r -nd -e robots=no -A '*.bin' --accept-regex '/plain/' https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/tree/i915/
$ mv *.bin /lib/firmware/i915/

This got rid of these rows

[    4.484778] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.13 (0000000000000000)
[    4.485107] i915 0000:00:02.7: [drm] *ERROR* GT0: IOV: Found interface version 0.1.13.4

Finally, I read here that intel_gpu_top doesn't like to run on Virtual Functions. So I will continue on with trying passthrough to a VM.

Will check on my system. Have the same errors, but other than the error appearing in the log, all VFs working without issue (Plex HW transcoding, Windows 11 VM with iGPU passthrough). Probably just start with updating i915 firmwares first.

Will update on findings.

@scyto
Copy link
Author

scyto commented Sep 20, 2024

did you try the GUCFIRMWARE without copying files (for example GUCFIRMWARE_MINOR=13.4 or 13 without copying files)
because the ERROR shows that the files are already there and being used. - it found the file and just didn't like the version number - maybe because the formats are different, not sure...

also so i can record it somewhere, this is a list of the difference in the files between the existing firmware directory and the one that comes with your download command

root@pve1:~/i915-latest-firmware# diff -qn . /lib/firmware/i915 | sort
Only in .: adlp_dmc_ver2_09.bin
Only in .: adlp_dmc_ver2_10.bin
Only in .: adlp_dmc_ver2_14.bin
Only in .: bmg_dmc.bin
Only in .: bxt_guc_32.0.3.bin
Only in .: bxt_guc_33.0.0.bin
Only in .: bxt_guc_49.0.1.bin
Only in .: cml_guc_33.0.0.bin
Only in .: cml_guc_69.0.3.bin
Only in .: cnl_dmc_ver1_06.bin
Only in .: dg1_guc_49.0.1.bin
Only in .: dg1_guc_70.1.1.bin
Only in .: dg1_huc_7.7.1.bin
Only in .: dg2_dmc_ver2_07.bin
Only in .: dg2_guc_70.4.1.bin
Only in .: ehl_guc_33.0.4.bin
Only in .: ehl_guc_69.0.3.bin
Only in .: glk_guc_32.0.3.bin
Only in .: glk_guc_33.0.0.bin
Only in .: glk_huc_ver03_01_2893.bin
Only in .: icl_guc_49.0.1.bin
Only in .: icl_guc_69.0.3.bin
Only in .: icl_huc_ver8_4_3238.bin
Only in .: kbl_dmc_ver1_01.bin
Only in .: kbl_guc_32.0.3.bin
Only in .: kbl_guc_49.0.1.bin
Only in .: kbl_guc_69.0.3.bin
Only in .: kbl_guc_ver9_14.bin
Only in .: kbl_huc_ver02_00_1810.bin
Only in /lib/firmware/i915: adlp_guc_70.19.2.bin
Only in /lib/firmware/i915: cml_guc_62.0.0.bin
Only in /lib/firmware/i915: cml_guc_70.1.1.bin
Only in /lib/firmware/i915: cml_huc_4.0.0.bin
Only in /lib/firmware/i915: dg1_dmc_ver2_02.bin
Only in /lib/firmware/i915: dg1_guc_70.bin
Only in /lib/firmware/i915: dg1_huc.bin
Only in /lib/firmware/i915: dg2_dmc_ver2_08.bin
Only in /lib/firmware/i915: dg2_guc_70.bin
Only in /lib/firmware/i915: dg2_huc_gsc.bin
Only in /lib/firmware/i915: glk_huc_4.0.0.bin
Only in /lib/firmware/i915: icl_huc_9.0.0.bin
Only in /lib/firmware/i915: kbl_dmc_ver1_04.bin
Only in /lib/firmware/i915: kbl_guc_62.0.0.bin
Only in /lib/firmware/i915: kbl_guc_70.1.1.bin
Only in /lib/firmware/i915: kbl_huc_4.0.0.bin
Only in /lib/firmware/i915: mtl_dmc.bin
Only in /lib/firmware/i915: mtl_gsc_1.bin
Only in /lib/firmware/i915: mtl_guc_70.bin
Only in /lib/firmware/i915: rkl_dmc_ver2_03.bin
Only in /lib/firmware/i915: skl_dmc_ver1_27.bin
Only in /lib/firmware/i915: skl_guc_62.0.0.bin
Only in /lib/firmware/i915: skl_guc_70.1.1.bin
Only in /lib/firmware/i915: tgl_dmc_ver2_12.bin
Only in /lib/firmware/i915: tgl_guc_69.0.3.bin
Only in /lib/firmware/i915: tgl_guc_70.bin
Only in .: mtl_dmc_ver2_10.bin
Only in .: rkl_dmc_ver2_02.bin
Only in .: skl_dmc_ver1_23.bin
Only in .: skl_guc_32.0.3.bin
Only in .: skl_guc_49.0.1.bin
Only in .: skl_guc_69.0.3.bin
Only in .: skl_guc_ver1.bin
Only in .: tgl_dmc_ver2_04.bin
Only in .: tgl_guc_35.2.0.bin
Only in .: tgl_guc_49.0.1.bin
Only in .: tgl_huc_7.0.12.bin
Only in .: tgl_huc_7.0.3.bin
Only in .: tgl_huc_7.5.0.bin
Only in .: xe2lpd_dmc.bin

i am not convinced copying files over would have made the difference unless you missed that step for the needed files in the gist?
( will try later when i get time)

@rjblake
Copy link

rjblake commented Sep 21, 2024

@scyto - you are correct. All I did was rebuild using the GUCFIRMWARE_MINOR=13 without copying files and the error is gone. I do see, however, that I have another error (apparently started with an earlier 6.8.x kernel) as follows:

[ 4.953984] xe 0000:00:02.5: Your graphics device 4680 is not officially supported
by xe driver in this kernel version. To force Xe probe,
use xe.force_probe='4680' and i915.force_probe='!4680'
module parameters or CONFIG_DRM_XE_FORCE_PROBE='4680' and
CONFIG_DRM_I915_FORCE_PROBE='!4680' configuration options.

Adding those to my grub cause the VFs to not be listed.

@Ogglord
Copy link

Ogglord commented Sep 21, 2024

Unfortunately I did the firmware step and GUCFIRMWARE_MINOR=13 simultaneously. But it sounds like the latter was the key?
If you add xe.force_probe='4680' and i915.force_probe='!4680' then you basically use xe instead of i915, which defeats the whole purpose, so ignore that message.

I have VFs working flawlessly in three simultaneous virtual machines with Win11. It feels super snappy as well. I did not have to add the GenuineIntel vendor stuff.

@Ogglord
Copy link

Ogglord commented Sep 21, 2024

I simply merged my existing firmware directory with the download command.

@scyto wrote:

Interesting, my error is for a different version, did the log show what interface version you had after copying all the firmware’s?

Here are the relevant lines from dmesg:

[    2.999219] i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adls_dmc_ver2_01.bin (v2.1)
[    3.009788] i915 0000:00:02.0: [drm] GT0: GuC firmware i915/tgl_guc_70.bin version 70.29.2
[    3.009793] i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
...
    3.821415] i915 0000:00:02.1: [drm] GT0: GUC: interface version 0.1.13.4
[    3.821716] i915 0000:00:02.1: [drm] VT-d active for gfx access
[    3.821737] i915 0000:00:02.1: [drm] Using Transparent Hugepages
[    3.822091] i915 0000:00:02.1: [drm] GT0: GUC: interface version 0.1.13.4
[    3.822407] i915 0000:00:02.1: GuC firmware PRELOADED version 1.13 submission:SR-IOV VF
[    3.822409] i915 0000:00:02.1: HuC firmware PRELOADED
[    3.823905] i915 0000:00:02.1: [drm] Protected Xe Path (PXP) protected content support initialized
[    3.823911] i915 0000:00:02.1: [drm] PMU not supported for this GPU.
[    3.823961] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.1 on minor 1
[    3.824151] i915 0000:00:02.0: vgaarb: VGA decodes changed: olddecodes=none,decodes=none:owns=io+mem
[    3.824155] i915 0000:00:02.1: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
               use xe.force_probe='4690' and i915.force_probe='!4690'

From what I can tell, all versions are identical (to my dmesg from the earlier post).

@scyto
Copy link
Author

scyto commented Sep 22, 2024

Given none of the firmware version changed I am not clear why the copying would do anything, you can see this in the dmesg output before and after. Also when I did the directory compare there were no updated versions of the files our system uses. I will try the GUV minor version later today. Then we can validate if it is one or both changes.

@scyto
Copy link
Author

scyto commented Sep 22, 2024

This fixed it for me, no firmware copy needed. That said being on latest firmware is a good idea too.

GUCFIRMWARE_MINOR=4 dkms install -m i915-sriov-dkms -v 2024.08.09 --force --kernelsourcedir /usr/src/linux-headers-6.8.12-2-pve/

given this is a non breaking bug i wouldn't worry about it and i think it will be needed no matter what version drivers you have (replace the digit as needed)

@scyto
Copy link
Author

scyto commented Sep 22, 2024

and update, doing the file copy for updated fimrware didn't change version on mine at....

This was the versions of the firmwares.

[    4.330630] i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adlp_dmc.bin (v2.20)
[    4.340183] i915 0000:00:02.0: [drm] GT0: GuC firmware i915/adlp_guc_70.bin version 70.13.1
[    4.340188] i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3

And the error is the same....

[    4.942723] i915 0000:00:02.1: [drm] Using Transparent Hugepages
[    4.943225] i915 0000:00:02.1: [drm] *ERROR* GT0: IOV: Unable to confirm version 1.4 (0000000000000000)
[    4.943260] i915 0000:00:02.1: [drm] *ERROR* GT0: IOV: Found interface version 0.1.4.1
[    4.943611] i915 0000:00:02.1: GuC firmware PRELOADED version 0.0 submission:SR-IOV VF

as such copying firmwares seems to be utterly irrelevant

@scyto
Copy link
Author

scyto commented Sep 22, 2024

copying firmwares from intels repro made no diff either https://github.com/intel-gpu/intel-gpu-firmware/tree/main/firmware my machine uses the firmwaress above, i wonder if it is pulling them from some other location that /lib/firmware/i915/

--edit--
nope it isn't only one copy of each i think what the interface version is reported is hardware dependent (i.e. that decides what firmware is and isn't loaded)

your hardware loads i915/adls_dmc_ver2_01.bin (v2.1) mine loads i915/adlp_dmc.bin (v2.20) i think this is the root of the difference in 1.4 vs 1.9.

@supportinvis
Copy link

image

can you help me if i get error like this ?

@itachi737
Copy link

itachi737 commented Sep 30, 2024

Hi,

Did anybody manage to get it working on a linux VM? I followed the instructions right now, it works for Windows vms as well as unprivileged LXCs, but I can't get it to work for Linux VMs (I tried Mint and Manjaro). I keep on getting this error:

[ 4.729149] i915 0000:01:00.0: [drm] *ERROR* Device is non-operational; MMIO access returns 0xFFFFFFFF!
[ 4.729539] i915 0000:01:00.0: Device initialization failed (-5)
[ 4.729542] i915 0000:01:00.0: Please file a bug on drm/i915; see https://drm.pages.freedesktop.org/intel-docs/how-to-file-i915-bugs.html for details.
[ 4.729544] i915: probe of 0000:01:00.0 failed with error -5
[ 4.847984] xe 0000:01:00.0: Your graphics device 4680 is not officially supported by xe driver in this kernel version. To force Xe probe, use xe.force_probe='4680' and i915.force_probe='!4680' module parameters or CONFIG_DRM_XE_FORCE_PROBE='4680' and CONFIG_DRM_I915_FORCE_PROBE='!4680' configuration options.

The Linux VM see the Alder Lake iGPU, but the intel drivers won't load. I tried installing the drivers manually, but it says that they are already installed (as they should on kernel 6.8). So right now I'm at a loss. In my pve node, I also get the mismatched version 13.4 when it expects 13. I tried the GUCFIRMWARE_MINOR=13 fix, but it didn't change anything for me. I'm still seeing the error. Any recommendations on what to try next or if you know a fix would be greatly appreciated.

Thank you

@kamilllooo
Copy link

Hello everyone, has anyone of you encountered a similar problem and managed to solve it?
I tried different versions of the firmware by replacing the files with other versions, but it didn't solve my problem. I also installed different versions of the kernel, but it's still the same. Maybe someone has a suggestion what I should check/do. The whole situation happened after my last update of proxmox to the latest version. (I had a Pin made for the kernel, but it also stopped working)

#root@pve:~# dmesg | grep i915

[ 0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-6.5.13-3-pve root=/dev/mapper/pve-root ro quiet intel_iommu=on iommu=pt i915.enable_guc=3 i915.max_vfs=7
[ 0.054742] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-6.5.13-3-pve root=/dev/mapper/pve-root ro quiet intel_iommu=on iommu=pt i915.enable_guc=3 i915.max_vfs=7
[ 3.945953] i915 0000:00:02.0: Running in SR-IOV PF mode
[ 3.947527] i915 0000:00:02.0: [drm] VT-d active for gfx access
[ 3.947657] i915 0000:00:02.0: vgaarb: deactivate vga console
[ 3.947835] i915 0000:00:02.0: [drm] Using Transparent Hugepages
[ 3.948592] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem
[ 3.950131] mei_hdcp 0000:00:16.0-b638ab7e-94e2-4ea2-a552-d1c54b627f04: bound 0000:00:02.0 (ops i915_hdcp_ops [i915])
[ 3.957507] i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adlp_dmc.bin (v2.20)
[ 3.963975] i915 0000:00:02.0: [drm] GT0: GuC firmware i915/adlp_guc_70.bin version 70.20.0
[ 3.963987] i915 0000:00:02.0: [drm] GT0: HuC firmware i915/tgl_huc.bin version 7.9.3
[ 3.991256] i915 0000:00:02.0: [drm] GT0: HuC: authenticated for all workloads!
[ 3.993557] i915 0000:00:02.0: [drm] GT0: GUC: submission enabled
[ 3.993565] i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
[ 3.994172] i915 0000:00:02.0: [drm] GuC RC: enabled
[ 3.994273] i915 0000:00:02.0: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 3.994282] i915 0000:00:02.0: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 3.996642] mei_pxp 0000:00:16.0-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: bound 0000:00:02.0 (ops i915_pxp_tee_component_ops [i915])
[ 3.996923] i915 0000:00:02.0: [drm] Protected Xe Path (PXP) protected content support initialized
[ 4.093060] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.0 on minor 0
[ 4.095823] snd_hda_intel 0000:00:1f.3: bound 0000:00:02.0 (ops i915_audio_component_bind_ops [i915])
[ 4.096487] i915 0000:00:02.0: 7 VFs could be associated with this PF
[ 4.096943] i915 0000:00:02.0: [drm] Cannot find any crtc or sizes
[ 4.097410] i915 0000:00:02.0: [drm] Cannot find any crtc or sizes
[ 5.511500] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
[ 5.511591] i915 0000:00:02.1: enabling device (0000 -> 0002)
[ 5.511622] i915 0000:00:02.1: Running in SR-IOV VF mode
[ 5.512192] i915 0000:00:02.1: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.512823] i915 0000:00:02.1: [drm] VT-d active for gfx access
[ 5.512848] i915 0000:00:02.1: [drm] Using Transparent Hugepages
[ 5.513420] i915 0000:00:02.1: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.514095] i915 0000:00:02.1: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.514097] i915 0000:00:02.1: HuC firmware PRELOADED
[ 5.514294] i915 0000:00:02.1: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.514300] i915 0000:00:02.1: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.516606] i915 0000:00:02.1: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.516613] i915 0000:00:02.1: [drm] PMU not supported for this GPU.
[ 5.516726] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.1 on minor 1
[ 5.517069] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
[ 5.517072] i915 0000:00:02.1: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 5.517144] i915 0000:00:02.2: enabling device (0000 -> 0002)
[ 5.517168] i915 0000:00:02.2: Running in SR-IOV VF mode
[ 5.517558] i915 0000:00:02.2: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.518188] i915 0000:00:02.2: [drm] VT-d active for gfx access
[ 5.518219] i915 0000:00:02.2: [drm] Using Transparent Hugepages
[ 5.518779] i915 0000:00:02.2: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.519425] i915 0000:00:02.2: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.519428] i915 0000:00:02.2: HuC firmware PRELOADED
[ 5.519719] i915 0000:00:02.2: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.519725] i915 0000:00:02.2: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.521718] i915 0000:00:02.2: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.521723] i915 0000:00:02.2: [drm] PMU not supported for this GPU.
[ 5.521827] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.2 on minor 2
[ 5.522168] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
[ 5.522171] i915 0000:00:02.1: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.522174] i915 0000:00:02.2: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 5.522249] i915 0000:00:02.3: enabling device (0000 -> 0002)
[ 5.522274] i915 0000:00:02.3: Running in SR-IOV VF mode
[ 5.522556] i915 0000:00:02.3: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.523003] i915 0000:00:02.3: [drm] VT-d active for gfx access
[ 5.523028] i915 0000:00:02.3: [drm] Using Transparent Hugepages
[ 5.523579] i915 0000:00:02.3: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.524223] i915 0000:00:02.3: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.524226] i915 0000:00:02.3: HuC firmware PRELOADED
[ 5.524472] i915 0000:00:02.3: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.524478] i915 0000:00:02.3: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.526378] i915 0000:00:02.3: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.526384] i915 0000:00:02.3: [drm] PMU not supported for this GPU.
[ 5.526474] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.3 on minor 3
[ 5.526777] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
[ 5.526780] i915 0000:00:02.1: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.526783] i915 0000:00:02.2: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.526786] i915 0000:00:02.3: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 5.526863] i915 0000:00:02.4: enabling device (0000 -> 0002)
[ 5.526886] i915 0000:00:02.4: Running in SR-IOV VF mode
[ 5.527198] i915 0000:00:02.4: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.527629] i915 0000:00:02.4: [drm] VT-d active for gfx access
[ 5.527651] i915 0000:00:02.4: [drm] Using Transparent Hugepages
[ 5.528174] i915 0000:00:02.4: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.528842] i915 0000:00:02.4: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.528844] i915 0000:00:02.4: HuC firmware PRELOADED
[ 5.529131] i915 0000:00:02.4: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.529137] i915 0000:00:02.4: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.530962] i915 0000:00:02.4: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.530968] i915 0000:00:02.4: [drm] PMU not supported for this GPU.
[ 5.531059] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.4 on minor 4
[ 5.531388] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
[ 5.531391] i915 0000:00:02.1: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.531394] i915 0000:00:02.2: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.531397] i915 0000:00:02.3: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.531400] i915 0000:00:02.4: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 5.531476] i915 0000:00:02.5: enabling device (0000 -> 0002)
[ 5.531500] i915 0000:00:02.5: Running in SR-IOV VF mode
[ 5.531723] i915 0000:00:02.5: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.532229] i915 0000:00:02.5: [drm] VT-d active for gfx access
[ 5.532254] i915 0000:00:02.5: [drm] Using Transparent Hugepages
[ 5.532727] i915 0000:00:02.5: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.533433] i915 0000:00:02.5: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.533435] i915 0000:00:02.5: HuC firmware PRELOADED
[ 5.533649] i915 0000:00:02.5: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.533656] i915 0000:00:02.5: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.535872] i915 0000:00:02.5: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.535878] i915 0000:00:02.5: [drm] PMU not supported for this GPU.
[ 5.535973] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.5 on minor 5
[ 5.536279] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
[ 5.536282] i915 0000:00:02.1: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.536285] i915 0000:00:02.2: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.536288] i915 0000:00:02.3: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.536291] i915 0000:00:02.4: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.536294] i915 0000:00:02.5: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 5.536362] i915 0000:00:02.6: enabling device (0000 -> 0002)
[ 5.536386] i915 0000:00:02.6: Running in SR-IOV VF mode
[ 5.536629] i915 0000:00:02.6: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.537129] i915 0000:00:02.6: [drm] VT-d active for gfx access
[ 5.537153] i915 0000:00:02.6: [drm] Using Transparent Hugepages
[ 5.537638] i915 0000:00:02.6: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.538326] i915 0000:00:02.6: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.538328] i915 0000:00:02.6: HuC firmware PRELOADED
[ 5.538525] i915 0000:00:02.6: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.538532] i915 0000:00:02.6: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.540772] i915 0000:00:02.6: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.540779] i915 0000:00:02.6: [drm] PMU not supported for this GPU.
[ 5.540917] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.6 on minor 6
[ 5.541250] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=io+mem
[ 5.541253] i915 0000:00:02.1: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.541256] i915 0000:00:02.2: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.541260] i915 0000:00:02.3: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.541262] i915 0000:00:02.4: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.541265] i915 0000:00:02.5: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
[ 5.541268] i915 0000:00:02.6: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 5.541342] i915 0000:00:02.7: enabling device (0000 -> 0002)
[ 5.541367] i915 0000:00:02.7: Running in SR-IOV VF mode
[ 5.541613] i915 0000:00:02.7: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.542121] i915 0000:00:02.7: [drm] VT-d active for gfx access
[ 5.542148] i915 0000:00:02.7: [drm] Using Transparent Hugepages
[ 5.542647] i915 0000:00:02.7: [drm] GT0: GUC: interface version 0.1.9.0
[ 5.543342] i915 0000:00:02.7: GuC firmware PRELOADED version 1.9 submission:SR-IOV VF
[ 5.543344] i915 0000:00:02.7: HuC firmware PRELOADED
[ 5.543543] i915 0000:00:02.7: [drm] ERROR GT0: GUC: mmio request 0x4100: failure 201/0
[ 5.543550] i915 0000:00:02.7: [drm] ERROR GT0: Failed to retrieve hwconfig table: -ENOENT
[ 5.545724] i915 0000:00:02.7: [drm] Protected Xe Path (PXP) protected content support initialized
[ 5.545731] i915 0000:00:02.7: [drm] PMU not supported for this GPU.
[ 5.545866] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.7 on minor 7
[ 5.546073] i915 0000:00:02.0: Enabled 7 VFs

@azsde
Copy link

azsde commented Oct 7, 2024

Hi,

Did anybody manage to get it working on a linux VM? I followed the instructions right now, it works for Windows vms as well as unprivileged LXCs, but I can't get it to work for Linux VMs (I tried Mint and Manjaro). I keep on getting this error:

[ 4.729149] i915 0000:01:00.0: [drm] *ERROR* Device is non-operational; MMIO access returns 0xFFFFFFFF! [ 4.729539] i915 0000:01:00.0: Device initialization failed (-5) [ 4.729542] i915 0000:01:00.0: Please file a bug on drm/i915; see https://drm.pages.freedesktop.org/intel-docs/how-to-file-i915-bugs.html for details. [ 4.729544] i915: probe of 0000:01:00.0 failed with error -5 [ 4.847984] xe 0000:01:00.0: Your graphics device 4680 is not officially supported by xe driver in this kernel version. To force Xe probe, use xe.force_probe='4680' and i915.force_probe='!4680' module parameters or CONFIG_DRM_XE_FORCE_PROBE='4680' and CONFIG_DRM_I915_FORCE_PROBE='!4680' configuration options.

The Linux VM see the Alder Lake iGPU, but the intel drivers won't load. I tried installing the drivers manually, but it says that they are already installed (as they should on kernel 6.8). So right now I'm at a loss. In my pve node, I also get the mismatched version 13.4 when it expects 13. I tried the GUCFIRMWARE_MINOR=13 fix, but it didn't change anything for me. I'm still seeing the error. Any recommendations on what to try next or if you know a fix would be greatly appreciated.

Thank you

I am having the exact same issue and it is driving me absolutely crazy.

@zackbcom
Copy link

zackbcom commented Oct 11, 2024

Hi,

Did anybody manage to get it working on a linux VM? I followed the instructions right now, it works for Windows vms as well as unprivileged LXCs, but I can't get it to work for Linux VMs (I tried Mint and Manjaro). I keep on getting this error:

[ 4.729149] i915 0000:01:00.0: [drm] *ERROR* Device is non-operational; MMIO access returns 0xFFFFFFFF! [ 4.729539] i915 0000:01:00.0: Device initialization failed (-5) [ 4.729542] i915 0000:01:00.0: Please file a bug on drm/i915; see https://drm.pages.freedesktop.org/intel-docs/how-to-file-i915-bugs.html for details. [ 4.729544] i915: probe of 0000:01:00.0 failed with error -5 [ 4.847984] xe 0000:01:00.0: Your graphics device 4680 is not officially supported by xe driver in this kernel version. To force Xe probe, use xe.force_probe='4680' and i915.force_probe='!4680' module parameters or CONFIG_DRM_XE_FORCE_PROBE='4680' and CONFIG_DRM_I915_FORCE_PROBE='!4680' configuration options.

The Linux VM see the Alder Lake iGPU, but the intel drivers won't load. I tried installing the drivers manually, but it says that they are already installed (as they should on kernel 6.8). So right now I'm at a loss. In my pve node, I also get the mismatched version 13.4 when it expects 13. I tried the GUCFIRMWARE_MINOR=13 fix, but it didn't change anything for me. I'm still seeing the error. Any recommendations on what to try next or if you know a fix would be greatly appreciated.

Thank you

View this Issue for a MR/branch that works.

strongtz/i915-sriov-dkms#198 (comment)

Try PR #207 based on the 6.6 branch.

@itachi737
Copy link

@zackbcom Thank you. I'll look into it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment