Created
May 29, 2017 15:03
-
-
Save lmclupr/d6a324cb330e887640b2cd852d0a2252 to your computer and use it in GitHub Desktop.
Clean and concise, superb!
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Used TD(0) method to solve using the Lunar Lander V1. Used 4 individual feedforward neural networks (one per action) for function approximation using a Keras and Tensorflow stack.