shangeth/dl_drl_projects.md

## dl_drl_projects.md

      
    Raw
  

              dl_drl_projects.md
            
          
    Deep Reinforcement learning


Monte Carlo Prediction

Monte Carlo Prediction method reinforcement learning agent for BlackJack Game.


Temporal Difference methods

Temporal Difference methods like

SARSA(0)
SARSAMAX(Q-Learning)
Expected SARSA
and some experiments with TD Methods


Value Function Approximation with Non-Linear networks

Approximation of state-action value function(Q function) with Deep Neural networks.

Deep Q Network
Double Deep Q network
Dueling Deep Q Network


Policy Based DRL methods

Policy function aproximation with Deep NNs.

Optimization methods like Hill Climbing, Cross Entropy.
Gradient Based methods

Policy Gradient
Proximal Policy Approximation