- Install Docker and ensure you can run docker ps correctly, add yourself to the docker group if necessary.
- Run
build-rocm.sh
builds a ROCm docker image for your Linux System. It is configured to use therender
group configured in your Linux distribution and ensure that/dev/kfd
and/dev/dri
are writeable by the render group users. Ensure you are added to the render group if necessary. - Run
launch-rocm.sh
if you wish to only use ROCm with the docker image you built.
- Run
build-pytorch.sh
if you wish to build a PyTorch image for your Linux System. It is built as a separate docker image, on top of the ROCm docker image you built earlier. - Run
launch-pytorch.sh
if you wish to run PyTorch with the second image just now built. - Run
source /venv/bin/activate
within the container and you should be able to run all the cool PyTorch things you need.
I tried a few different user:group configurations (chown) for /dev/kfd. I did change the group back to video, but that didn't work either.
I got sudo docker working, but I seem to have broken my rocm setup in the process (
rocminfo
now says "ROCk module is NOT loaded, possibly no GPU devices") . I'll try to do some more debugging, but a screen share might be helpful. I'll send you an email.