- Install Docker and ensure you can run docker ps correctly, add yourself to the docker group if necessary.
- Run
build-rocm.sh
builds a ROCm docker image for your Linux System. It is configured to use therender
group configured in your Linux distribution and ensure that/dev/kfd
and/dev/dri
are writeable by the render group users. Ensure you are added to the render group if necessary. - Run
launch-rocm.sh
if you wish to only use ROCm with the docker image you built.
- Run
build-pytorch.sh
if you wish to build a PyTorch image for your Linux System. It is built as a separate docker image, on top of the ROCm docker image you built earlier. - Run
launch-pytorch.sh
if you wish to run PyTorch with the second image just now built. - Run
source /venv/bin/activate
within the container and you should be able to run all the cool PyTorch things you need.
Hi @guilt, thanks for documenting these steps. Not sure if this is the right place to comment, but I'm running into an issue when launching the ROCM docker container. Even though /dev/kfd exists and I've given it all possible permissions, I get a "no such file or directory" error (below). I get a very similar error when I've tried to run the pytorch-rocm image per these instructions.
This is on Pop Os 22.04. Any ideas on how to debug?