unofficial mirror of bug-guix@gnu.org 
 help / color / mirror / code / Atom feed
* bug#55907: VFIO kernel module fails to capture PCI device
@ 2022-06-11 13:40 Nick Zalutskiy
  2023-09-08  8:43 ` Lars Rustand
  2024-07-04 21:37 ` Nikola Brković via Bug reports for GNU Guix
  0 siblings, 2 replies; 3+ messages in thread
From: Nick Zalutskiy @ 2022-06-11 13:40 UTC (permalink / raw)
  To: 55907

[-- Attachment #1: Type: text/plain, Size: 2120 bytes --]

Hello all,

I am trying to capture my graphics card at initrd, using vfio, to later pass it through to a virtual machine. Judging by dmesg, the VFIO module does load early, however, the card is not captured at that point and the amdgpu driver is later loaded instead.

This is what I have in my `operating-system` config:

> (kernel-arguments '("iommu=pt" "vfio-pci.ids=1002:73bf"))
>   (initrd-modules (cons* "vfio_pci" "vfio" "vfio_iommu_type1" "vfio_virqfd" %base-initrd-modules))

There are two video cards in the system, both AMD, but different models. The video card of interest is in a separate IOMMU group and the <vendor id>:<device id> combination is correct for my machine.

Best I can tell, vfio-pci.ids argument is not propagated to the module by initramfs. See the following:

Searching online I came up against a GitHub issue for a different initramfs generator that exhibited the same symptoms: VFIO module was loaded, kernel arguments were correct, yet the card was not captured by the vfio driver. The maintainer there did a great job tracking down and fixing the issue and came up with this insight https://github.com/anatol/booster/issues/20#issuecomment-808956316

> After reading kmod code I found that kernel does not use cmdline params for loadable modules. It was surprising for me. Instead it is expected that userspace handles cmdline parsing and provides required module params explicitly.

Another way to attach the correct driver to the gpu is to run a script at initrd, which I don't know how to accomplish with Guix. This approach has the advantage of working with two identical video cards (or disks, etc) See https://wiki.archlinux.org/title/PCI_passthrough_via_OVMF#Using_identical_guest_and_host_GPUs

I tried following the kernel docs to rebind a different driver after boot, but I believe this doesn't work for video cards, and hasn't worked for me.

Any help is greatly appreciated!

Links:
Kernel docs for vfio
https://www.kernel.org/doc/html/latest/driver-api/vfio.html

Arch guide for GPU passthrough
https://wiki.archlinux.org/title/PCI_passthrough_via_OVMF

Thank you!

-Nick

[-- Attachment #2: Type: text/html, Size: 3207 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#55907: VFIO kernel module fails to capture PCI device
  2022-06-11 13:40 bug#55907: VFIO kernel module fails to capture PCI device Nick Zalutskiy
@ 2023-09-08  8:43 ` Lars Rustand
  2024-07-04 21:37 ` Nikola Brković via Bug reports for GNU Guix
  1 sibling, 0 replies; 3+ messages in thread
From: Lars Rustand @ 2023-09-08  8:43 UTC (permalink / raw)
  To: 55907

Hello Nick,

Did you ever figure this out? I am struggling with the same problem.


Thank you,

- Lars




^ permalink raw reply	[flat|nested] 3+ messages in thread

* bug#55907: VFIO kernel module fails to capture PCI device
  2022-06-11 13:40 bug#55907: VFIO kernel module fails to capture PCI device Nick Zalutskiy
  2023-09-08  8:43 ` Lars Rustand
@ 2024-07-04 21:37 ` Nikola Brković via Bug reports for GNU Guix
  1 sibling, 0 replies; 3+ messages in thread
From: Nikola Brković via Bug reports for GNU Guix @ 2024-07-04 21:37 UTC (permalink / raw)
  To: 55907@debbugs.gnu.org

I have managed to get VFIO working by creating a service of boot-service-type which overrides the GPU driver with vfio-pci and binds the GPU to VFIO:

>(simple-service 'vfio-override boot-service-type
>    '(and (call-with-output-file "/sys/bus/pci/devices/0000:04:00.0/driver_override"
>      (lambda (p)
>       (display "vfio-pci" p)))
>     (call-with-output-file "/sys/bus/pci/drivers/vfio-pci/new_id"
>      (lambda (p)
>       (display "1002 665f" p)))
>  )
>)

Sorry for the hard-coded IDs, you should replace them with your own. You might need to unbind the GPU's audio card from its driver as well, after you're fully booted. QEMU will refuse to pass-through the GPU if the audio card is in the same IOMMU group and not using vfio-pci.

In my case, the service runs early enough in the boot process where amdgpu has not initialized the GPU yet. There might be a better way to accomplish this, I'm still new to Guix and Scheme.

Thanks,
Nikola




^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2024-07-06 21:54 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-06-11 13:40 bug#55907: VFIO kernel module fails to capture PCI device Nick Zalutskiy
2023-09-08  8:43 ` Lars Rustand
2024-07-04 21:37 ` Nikola Brković via Bug reports for GNU Guix

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/guix.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).