Jon Durbin jondurbin

## check_copyright.py
import aiohttp
import asyncio
import os
import re
from loguru import logger
from playwright.async_api import async_playwright


async def check_site(browser, domain, status):
    page = await browser.new_page()

## create_tokenizer.py
import re
import gc
import os
import glob
import json
from copy import deepcopy
from datasets import concatenate_datasets, Dataset
from transformers import AutoTokenizer
from huggingface_hub import snapshot_download

## airoboros-m-7b-3.1.2.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              2 stars
            
          
                jondurbin
                / airoboros-m-7b-3.1.2.md
            
            
              Last active
              October 21, 2023 12:52
            
              
                airoboros-m-7b-3.1.2.md
              
          
    Trained on 10x a6000 GPUs on runpod.io.
I actually ran many fine-tunes, including multiple full-finetunes, fp16 loras, and qloras, and the below qlora actually did best in my testing.
dataset: https://hf.co/datasets/jondurbin/airoboros-3.1 (plus a few unpublished de-censoring instructions)
training script: https://github.com/jondurbin/qlora specifically commit 8cd269bf9bd7753c92164934269019e12f23314f
export BASE_DIR=/workspace

  
## airoboros-m-7b-3.0-tuning.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / airoboros-m-7b-3.0-tuning.md
            
            
              Created
              October 2, 2023 15:07
            
              
                airoboros-m-7b-3.0-tuning.md
              
          
    My fork of qlora (but using the --full-finetune option), commit b5771f3caa9a5ea3ec397526a09720e957dc03d0 (main branch as of 2023-10-02):
6x 80GB a100s on runpod.io
Dataset:
https://hf.co/datasets/jondurbin/airoboros-3.0 (private currently, until fine-tunes are finished)
Script:


## airoboros-l2-7b-3.0-training.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / airoboros-l2-7b-3.0-training.md
            
            
              Created
              September 30, 2023 12:52
            
              
                airoboros-l2-7b-3.0-training.md
              
          
    Using my fork of qlora at this commit:
https://github.com/jondurbin/qlora/tree/ef708769c9365eb86cf173aa9fc7c37e2d772773
Airoboros dataset 3.0:
https://hf.co/datasets/jondurbin/airoboros-3.0
7x h100 servers on https://runpod.io/
export BASE_DIR=/workspace

  
## airoboros-l2-70b-2.2-tuning.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / airoboros-l2-70b-2.2-tuning.md
            
            
              Created
              September 12, 2023 14:32
            
              
                airoboros-l2-70b-2.2-tuning
              
          
    Trained on 7x 80gb a100 nodes in runpod.
Dataset: https://hf.co/datasets/jondurbin/airoboros-2.2 (specifically, instructions.jsonl)
My fork of qlora: https://github.com/jondurbin/qlora
Merged with qmerge.py from my fork of qlora, similar to:
python qlora/qmerge.py --base llama-2-70b-hf --peft spicyboros-70b-2.2-checkpoints/checkpoint-1173/model_adapter --out spicyboros-2.2

  
## spicyboros-70b-2.2-tuning.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / spicyboros-70b-2.2-tuning.md
            
            
              Last active
              September 10, 2023 23:47
            
              
                spicyboros-70b-2.2-tuning
              
          
    Trained on 8x 80gb a100 nodes in runpod.
Dataset: https://hf.co/datasets/jondurbin/airoboros-2.2 (specifically, instructions.jsonl)
My fork of qlora: https://github.com/jondurbin/qlora
Note: the final selected checkpoint used to merge the model was checkpoint-750!
Merged with qmerge.py from my fork of qlora, similar to:


## airoboros-l2-7b-2.2-tuning.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / airoboros-l2-7b-2.2-tuning.md
            
            
              Last active
              September 10, 2023 11:41
            
              
                airoboros-l2-7b-2.2-tuning
              
          
    Dataset used: https://huggingface.co/datasets/jondurbin/airoboros-2.2
Specifically, the instructions-clean.jsonl file.
Fine-tuned with my fork of qlora: https://github.com/jondurbin/qlora
This was a full fine-tune (yes, the script is called qlora, but I used the --full_finetune option)
export BASE_DIR=/workspace
export WANDB_API_KEY=[redacted]

  
## airoboros-l2-13b-2.2-tuning.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / airoboros-l2-13b-2.2-tuning.md
            
            
              Created
              September 10, 2023 11:32
            
              
                airoboros-l2-13b-2.2-tuning
              
          
    Dataset used: https://huggingface.co/datasets/jondurbin/airoboros-2.2
Specifically, the instructions-clean.jsonl file.
Fine-tuned with my fork of qlora: https://github.com/jondurbin/qlora
8x 80gb a100s
This was a full fine-tune (yes, the script is called qlora, but I used the --full_finetune option)


## spicyboros-13b-2.2-tuning.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                jondurbin
                / spicyboros-13b-2.2-tuning.md
            
            
              Last active
              September 10, 2023 11:33
            
              
                spicyboros-13b-2.2-tuning
              
          
    Dataset used: https://huggingface.co/datasets/jondurbin/airoboros-2.2
Specifically, the instructions.jsonl file.
Fine-tuned with my fork of qlora: https://github.com/jondurbin/qlora
8x 80gb a100s
This was a full fine-tune (yes, the script is called qlora, but I used the --full_finetune option)
	import aiohttp
	import asyncio
	import os
	import re
	from loguru import logger
	from playwright.async_api import async_playwright


	async def check_site(browser, domain, status):
	page = await browser.new_page()
	import re
	import gc
	import os
	import glob
	import json
	from copy import deepcopy
	from datasets import concatenate_datasets, Dataset
	from transformers import AutoTokenizer
	from huggingface_hub import snapshot_download