Skip to content

Instantly share code, notes, and snippets.

View decastro-alex's full-sized avatar

alex-decastro decastro-alex

View GitHub Profile
@decastro-alex
decastro-alex / pg-pong.py
Created September 7, 2019 22:15 — forked from karpathy/pg-pong.py
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?
learning_rate = 1e-4
gamma = 0.99 # discount factor for reward

Keybase proof

I hereby claim:

  • I am decastro-alex on github.
  • I am alex_decastro (https://keybase.io/alex_decastro) on keybase.
  • I have a public key ASBLi2xgS0cjfMF1ySDmWOqN5aQLsFTIbZg4ytR2YsyeYQo

To claim this, I am signing this object: