Create a gist now

Instantly share code, notes, and snippets.

@tilarids /README
Last active Aug 30, 2016

What would you like to do?
Policy Gradients (with & without TRPO).
More details and reproducing: https://github.com/tilarids/reinforcement_learning_playground
This specific commit was using to reproduce this: https://github.com/tilarids/reinforcement_learning_playground/commit/fd442e78ee4c93dfa38a3e83677b3d3cb3eefc90
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment