Skip to content

Instantly share code, notes, and snippets.

View aonotas's full-sized avatar

Motoki Sato aonotas

View GitHub Profile
@Smerity
Smerity / cartpole.py
Last active May 26, 2017 13:47
Script for Cartpole using policy gradient via Chainer, two layer MLP, dropout, and rejection sampling of historical memories
''' Script for Cartpole using policy gradient via Chainer, two layer MLP, dropout, and rejection sampling of historical memories '''
import gym
import numpy as np
import chainer
from chainer import optimizers
from chainer import ChainList, Variable
import chainer.functions as F