- Install Docker and ensure you can run docker ps correctly, add yourself to the docker group if necessary.
- Run
build-rocm.sh
builds a ROCm docker image for your Linux System. It is configured to use therender
group configured in your Linux distribution and ensure that/dev/kfd
and/dev/dri
are writeable by the render group users. Ensure you are added to the render group if necessary. - Run
launch-rocm.sh
if you wish to only use ROCm with the docker image you built.
- Run
build-pytorch.sh
if you wish to build a PyTorch image for your Linux System. It is built as a separate docker image, on top of the ROCm docker image you built earlier. - Run
launch-pytorch.sh
if you wish to run PyTorch with the second image just now built. - Run
source /venv/bin/activate
within the container and you should be able to run all the cool PyTorch things you need.
Please see the updated instructions on the System76 documentation - they seem to have done a more up-to-date job of packaging it. With ROCM nearing 6.2.0 now I think it's high time newer versions are tested and documented too.
They seem to have a working
apt install rocm
step which is in many ways much simpler than this approach. The rest of the pytorch steps will be similar.Did your setup fail with a reboot and a
sudo
? Would be interesting to debug this; I currently do not have this setup to test, even. Feel free to email me and setup a screen share session if you want to go down this path. Best thing I can offer, may not be needed for your case though. 💁