Skip to content

Instantly share code, notes, and snippets.

View xkrishnam's full-sized avatar
Do Not Disturb , Learning in progress....

Krishna Kumar Mishra xkrishnam

Do Not Disturb , Learning in progress....
  • India
View GitHub Profile
@xkrishnam
xkrishnam / contextualPolicy-n-arm-bandit.ipynb
Last active September 29, 2022 06:17
tensorflow 2 implementation of Policy gradient method for solving n-armed bandit problems.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.