Skip to content

Instantly share code, notes, and snippets.

@nunofernandes-plight
Forked from awjuliani/SimplePolicy.ipynb
Created February 7, 2018 13:26
Show Gist options
  • Save nunofernandes-plight/b549b0c9cd90c63d94dd2cb5fa0b2f9a to your computer and use it in GitHub Desktop.
Save nunofernandes-plight/b549b0c9cd90c63d94dd2cb5fa0b2f9a to your computer and use it in GitHub Desktop.
Policy gradient method for solving n-armed bandit problems.
Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment