Skip to content

Instantly share code, notes, and snippets.

View xkrishnam's full-sized avatar
Do Not Disturb , Learning in progress....

Krishna Kumar Mishra xkrishnam

Do Not Disturb , Learning in progress....
  • India
View GitHub Profile
@xkrishnam
xkrishnam / contextualPolicy-n-arm-bandit.ipynb
Last active September 29, 2022 06:17
tensorflow 2 implementation of Policy gradient method for solving n-armed bandit problems.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@awjuliani
awjuliani / ContextualPolicy.ipynb
Last active October 11, 2022 21:27
A Policy-Gradient algorithm that solves Contextual Bandit problems.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@awjuliani
awjuliani / Q-Table Learning-Clean.ipynb
Last active October 25, 2022 07:57
Q-Table learning in OpenAI grid world.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.