This is a Deep Reinforcement Learning solution to some classic control problems. I've used it to solve MountainCar-v0 problem, CartPole-v0 and [CartPole-v1] (https://gym.openai.com/envs/CartPole-v1) in OpenAI's Gym. This code uses Tensorflow to model a value function for a Reinforcement Learning agent. The code is fundamentally a translation of necnec's algorithm with Theano & Lasagne to Tensorflow. I've run it on Python 3.5 under Windows 7.
- Deep Learning tutorial, David Silver, Google DeepMind.
- necnec's algorithm