Your GPU customers can't profile their CUDA/PyTorch workloads because NCU profiling is disabled by default:
$ ncu python script.py
==ERROR== ERR_NVGPUCTRPERM: Permission deniedRun this once on each GPU host:
echo "options nvidia NVreg_RestrictProfilingToAdminUsers=0" | sudo tee /etc/modprobe.d/nvprof.conf
sudo rebootThat's it. All containers on that host can now use NCU.
Docker probably needs one more flag to be added
Official NVIDIA guidance: NVIDIA Developer Docs