Skip to content

Instantly share code, notes, and snippets.

View TejaGollapudi's full-sized avatar

Teja Gollapudi TejaGollapudi

View GitHub Profile
@TejaGollapudi
TejaGollapudi / generate_alpaca_dataset.py
Created April 5, 2023 00:50
Convert Alpaca dataset into csv format
import json
import pandas as pd
with open('alpaca_data.json') as f:
data = json.load(f)
new_format = []
for i, point in enumerate(data):
@TejaGollapudi
TejaGollapudi / lora_seq2seq.py
Last active April 4, 2023 21:54
LORA training for T5 , UL2 models using Deepspeed and Accelerate
# Modified from https://github.com/huggingface/peft/blob/main/examples/conditional_generation/peft_lora_seq2seq_accelerate_ds_zero3_offload.py
import argparse
import gc
import logging
import os
import threading
import psutil
import torch
from accelerate import Accelerator
@TejaGollapudi
TejaGollapudi / template_config.yaml
Last active April 28, 2023 15:28
Template accelerate config for training the model with deepspeed and accelerate with CPU offsetting
compute_environment: LOCAL_MACHINE
deepspeed_config:
gradient_accumulation_steps: 22
offload_optimizer_device: cpu
offload_param_device: cpu
zero3_init_flag: true
zero3_save_16bit_model: false
zero_stage: 3
distributed_type: DEEPSPEED
downcast_bf16: 'no'
@TejaGollapudi
TejaGollapudi / merge_weights.py
Created April 4, 2023 21:40
Merge PEFT-LORA model's weights with the base model's weights
# Modified from https://github.com/lvwerra/trl/blob/main/examples/sentiment/scripts/gpt-neox-20b_peft/merge_peft_adapter.py
import argparse
import os
import peft
import torch
from peft import PeftConfig, PeftModel
from transformers import AutoTokenizer, T5ForConditionalGeneration
@TejaGollapudi
TejaGollapudi / conda_env.yaml
Created April 4, 2023 21:37
Conda environment to train a Seq2Seq model using LORA and Deepspeed
name: lora_training
channels:
- pytorch
- conda-forge
dependencies:
- cudatoolkit-dev=11.3.1
- numpy=1.24.2
- pip=23.0.1
- python=3.8.16
- pip:
@TejaGollapudi
TejaGollapudi / Demo.ipynb
Created June 30, 2022 04:16
BERT pretraining demo notebook
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.