Skip to content

Instantly share code, notes, and snippets.

View fullstackwebdev's full-sized avatar

fullstack fullstackwebdev

View GitHub Profile
@migtissera
migtissera / mistral-7B-qlora.yaml
Last active March 30, 2024 22:08
Axolotl config to train a Mistral-7B base using QLoRA on 24GB GPU
base_model: /path/to/Mistral-7B-v0.1
model_type: AutoModelForCausalLM
tokenizer_type: AutoTokenizer
is_llama_derived_model: true
load_in_8bit: false
load_in_4bit: true
strict: false
datasets: