Automatic script to do this easily:
wget https://gist.githubusercontent.com/paralin/90623c77f167fe7a816ef6bae3caf574/raw/9b6ff17fabb26a2bb845f72dc94e941f0d241755/llama-metal.sh
To use it:
export LLM_PROMPT="Who is JFK?"
bash ./llama-metal.sh
Full information on running the 13b model using metal:
- git clone https://github.com/ggerganov/llama.cpp
- cd llama.cpp
- Building llama.cpp: MacOS: LLAMA_METAL=1 make