- Week 1: Bandit optimalitiess
- Lecture 1: Epsilon-Greedy & the multiarmed bandit problem | StarAi Deep Reinforcement Learning Course
- The Multi-Armed Bandit Problem and Its Solutions
- (22) The Multi-Armed Bandit Problem and Thompson Sampling - YouTube
- Solving the Multi-Armed Bandit Problem from Scratch in Python
- The Credit Assignment Problem - LessWrong 2.0
- Multi-armed bandit - Wikipedia
Created
February 12, 2020 18:45
-
-
Save veb-101/1c910ddbfabfb5ff183aedb6b1975717 to your computer and use it in GitHub Desktop.
additional material along with weekly lectures
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment