Skip to content

Instantly share code, notes, and snippets.

View greydanus's full-sized avatar

Sam greydanus

View GitHub Profile
@karpathy
karpathy / pg-pong.py
Created May 30, 2016 22:50
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?
learning_rate = 1e-4
gamma = 0.99 # discount factor for reward
@Nagasaki45
Nagasaki45 / sa.py
Created November 12, 2013 21:53
Simplest simulated annealing algorithm.
import numpy as np
class Annealer():
def __init__(self, step_function, energy_function):
self.step_function = step_function
self.energy_function = energy_function
def run(
self, state, temperature, room_temperature, cooling_factor