veb-101/Reinforcement learning - Swayam.md

## Reinforcement learning - Swayam.md

      
    Raw
  

              Reinforcement learning - Swayam.md
            
          
Week 1: Bandit optimalitiess

 Lecture 1: Epsilon-Greedy & the multiarmed bandit problem | StarAi Deep Reinforcement Learning Course
 The Multi-Armed Bandit Problem and Its Solutions
 (22) The Multi-Armed Bandit Problem and Thompson Sampling - YouTube
 Solving the Multi-Armed Bandit Problem from Scratch in Python
 The Credit Assignment Problem - LessWrong 2.0
 Multi-armed bandit - Wikipedia