Skip to content

Instantly share code, notes, and snippets.

@kiwamizamurai
Created January 25, 2020 07:29
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save kiwamizamurai/ad94f49d9a85969f0ecaa8cc2bf1925c to your computer and use it in GitHub Desktop.
Save kiwamizamurai/ad94f49d9a85969f0ecaa8cc2bf1925c to your computer and use it in GitHub Desktop.
Bandit-tutorial.ipynb
Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@kiwamizamurai
Copy link
Author

kiwamizamurai commented Jan 25, 2020

長くなったのでここでいったん切ります

次回のGistでは

  • UDB
  • Thompson Sampling

そしてできればシンプルなMDPまで実装しようと思います

@kiwamizamurai
Copy link
Author

最後のSoftmaxとの比較間違って認識してますね

オレンジがBanditなので少し良くなってます

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment