Skip to content

Instantly share code, notes, and snippets.

View xiyuanHou's full-sized avatar

EvanH0923 xiyuanHou

View GitHub Profile
@kaiokendev
kaiokendev / 4Bit Lora Trainer Notebook.ipynb
Last active October 16, 2023 17:26
4-bit LoRA Training Notebook
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@karpathy
karpathy / pg-pong.py
Created May 30, 2016 22:50
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?
learning_rate = 1e-4
gamma = 0.99 # discount factor for reward