Running LLama 3 8B Instruct on a single consumer grade AMD graphics card
LLama 3 8B is one of the newest models released by Meta. It punches above its weight class, offering high performance with lower resource consumption than some 22B or 70B models. Its size and output quality makes it interesting for me for systems handeling sensitive data when run locally/offline.
Prerequisites
This guide uses AMD GPUs, with Nvidia you can skip the ROCm install and dependencies to run the script.
Operating System: Ubuntu 22.04