Skip to content

Instantly share code, notes, and snippets.

@liyougeng
liyougeng / pg-pong.py
Created March 24, 2017 15:31 — forked from karpathy/pg-pong.py
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?
learning_rate = 1e-4
gamma = 0.99 # discount factor for reward
@liyougeng
liyougeng / 128x128_train.prototxt
Created November 5, 2016 02:24 — forked from kjw0612/128x128_train.prototxt
Learning to generate chairs proto
name: "CaffeNet"
layers {
name: "data"
type: DATA
top: "data"
top: "label"
data_param {
source: "@YOUR_PATH_TO_DATA@/chairs_128x128_reduced/data-lmdb"
batch_size: 64
scale: 0.00390625