- Install Docker and ensure you can run docker ps correctly, add yourself to the docker group if necessary.
- Run
build-rocm.sh
builds a ROCm docker image for your Linux System. It is configured to use therender
group configured in your Linux distribution and ensure that/dev/kfd
and/dev/dri
are writeable by the render group users. Ensure you are added to the render group if necessary. - Run
launch-rocm.sh
if you wish to only use ROCm with the docker image you built.
- Run
build-pytorch.sh
if you wish to build a PyTorch image for your Linux System. It is built as a separate docker image, on top of the ROCm docker image you built earlier. - Run
launch-pytorch.sh
if you wish to run PyTorch with the second image just now built. - Run
source /venv/bin/activate
within the container and you should be able to run all the cool PyTorch things you need.
I ended up needing to use a live USB to repair my Pop OS install (I believe one of the initramfs commands I ran messed something up because I was unable to login after a reboot). After that I installed ROCM again via these instructions. Then, instead of using docker desktop, I installed docker.io via these commands:
sudo apt install docker.io sudo usermod -aG docker $USER
Docker now only works via sudo, but I was able to get the ROCM-pytorch image to successfully run by running these commands as sudo:
For reference this is what /dev/kfd looks like:
Haven't tried stable diffusion or running anything in pytorch yet, but at least I'm past the previous error message. Thanks again for your help.