Skip to content

Instantly share code, notes, and snippets.

@bhaktipriya
Forked from awjuliani/ContextualPolicy.ipynb
Created March 21, 2017 15:25
Show Gist options
  • Save bhaktipriya/23793921a930eb5ee4ef00773c703150 to your computer and use it in GitHub Desktop.
Save bhaktipriya/23793921a930eb5ee4ef00773c703150 to your computer and use it in GitHub Desktop.
A Policy-Gradient algorithm that solves Contextual Bandit problems.
Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment