Skip to content

Instantly share code, notes, and snippets.

Avatar
💭
I may be slow to respond.

Murphy mashoujiang

💭
I may be slow to respond.
View GitHub Profile
View Pong-Playing TensorFlow Neural Network.ipynb
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@mashoujiang
mashoujiang / pg-pong.py
Created Mar 21, 2018 — forked from karpathy/pg-pong.py
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
View pg-pong.py
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?
learning_rate = 1e-4
gamma = 0.99 # discount factor for reward
You can’t perform that action at this time.