Skip to content

Instantly share code, notes, and snippets.

View sebasibarguen's full-sized avatar

Sebastian Ibarguen sebasibarguen

View GitHub Profile
@sebasibarguen
sebasibarguen / rl_pong.py
Created May 17, 2018 03:54 — forked from greydanus/rl_pong.py
Solves Pong with Policy Gradients in Tensorflow.
'''Solves Pong with Policy Gradients in Tensorflow.'''
# written October 2016 by Sam Greydanus
# inspired by karpathy's gist.github.com/karpathy/a4166c7fe253700972fcbc77e4ea32c5
import numpy as np
import gym
import tensorflow as tf
# hyperparameters
n_obs = 80 * 80 # dimensionality of observations
h = 200 # number of hidden layer neurons