Skip to content

Instantly share code, notes, and snippets.

@pkumusic
Created March 8, 2017 16:08
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save pkumusic/66e37e8ab477f05f789df7e9b6210081 to your computer and use it in GitHub Desktop.
Save pkumusic/66e37e8ab477f05f789df7e9b6210081 to your computer and use it in GitHub Desktop.
A3C with joint density model
4.77 M steps
2500 Score. Very stable for keeping the score!
outperforms A3C+ in paper (142.5)
Speed same as original A3C. Fast. 11iter/s.
Params. Nature, ScheduledHyperParamSetter('learning_rate', [(80, 0.0003), (120, 0.0001)]),
ScheduledHyperParamSetter('entropy_beta', [(80, 0.005)]),
ScheduledHyperParamSetter('explore_factor',
[(80, 2), (100, 3), (120, 4), (140, 5)]),
Could be improved by testing other params. (Next step)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment