Skip to content

Instantly share code, notes, and snippets.

@dhpollack
Forked from awjuliani/SimplePolicy.ipynb
Last active February 9, 2017 12:23
Show Gist options
  • Save dhpollack/2d7b906cd51bbdc870cafd0801da0693 to your computer and use it in GitHub Desktop.
Save dhpollack/2d7b906cd51bbdc870cafd0801da0693 to your computer and use it in GitHub Desktop.
Policy gradient method for solving n-armed bandit problems.
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@dhpollack
Copy link
Author

using python 3 with tensor flow r0.12.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment