Skip to content

Instantly share code, notes, and snippets.

View rmihir96's full-sized avatar

Mihir Ranade rmihir96

View GitHub Profile
#!/usr/bin/env python
# coding: utf-8
# In[1]:
from fastai.vision import *
from fastai.metrics import *
import glob
@rmihir96
rmihir96 / pg-pong.py
Created November 23, 2018 22:04 — forked from karpathy/pg-pong.py
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import cPickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?
learning_rate = 1e-4
gamma = 0.99 # discount factor for reward