Skip to content

Instantly share code, notes, and snippets.

View amirreza-m95's full-sized avatar

Amir Reza amirreza-m95

View GitHub Profile
@amirreza-m95
amirreza-m95 / pg-pong.py
Last active September 1, 2020 16:19 — forked from karpathy/pg-pong.py
Training a Neural Network ATARI Pong agent with Policy Gradients from raw pixels
""" Trains an agent with (stochastic) Policy Gradients on Pong. Uses OpenAI Gym. """
import numpy as np
import _pickle as pickle
import gym
# hyperparameters
H = 200 # number of hidden layer neurons
batch_size = 10 # every how many episodes to do a param update?