Skip to content

Instantly share code, notes, and snippets.

@dwleeKAIST
Forked from awjuliani/SimplePolicy.ipynb
Created August 3, 2017 00:47
Show Gist options
  • Save dwleeKAIST/42ee50cf6af719d28a013ce9fe2124e5 to your computer and use it in GitHub Desktop.
Save dwleeKAIST/42ee50cf6af719d28a013ce9fe2124e5 to your computer and use it in GitHub Desktop.
Policy gradient method for solving n-armed bandit problems.
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment