Skip to content

Instantly share code, notes, and snippets.

@bhaktipriya
bhaktipriya / ContextualPolicy.ipynb
Created March 21, 2017 15:25 — forked from awjuliani/ContextualPolicy.ipynb
A Policy-Gradient algorithm that solves Contextual Bandit problems.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@bhaktipriya
bhaktipriya / SimplePolicy.ipynb
Created March 21, 2017 15:19 — forked from awjuliani/SimplePolicy.ipynb
Policy gradient method for solving n-armed bandit problems.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.