Krishna Kumar Mishra xkrishnam

## contextualPolicy-n-arm-bandit.ipynb

      
              1 file
            
          
              0 forks
            
          
                3 comments
              
            
              3 stars
            
          
                xkrishnam
                / contextualPolicy-n-arm-bandit.ipynb
            
            
              Last active
              September 29, 2022 06:17
            
              
                tensorflow 2 implementation of Policy gradient method for solving n-armed bandit problems.
              
          
      Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## ContextualPolicy.ipynb

      
              1 file
            
          
              11 forks
            
          
                14 comments
              
            
              18 stars
            
          
                awjuliani
                / ContextualPolicy.ipynb
            
            
              Last active
              October 11, 2022 21:27
            
              
                A Policy-Gradient algorithm that solves Contextual Bandit problems.
              
          
      Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.
      
    
## Q-Table Learning-Clean.ipynb

      
              1 file
            
          
              30 forks
            
          
                5 comments
              
            
              39 stars
            
          
                awjuliani
                / Q-Table Learning-Clean.ipynb
            
            
              Last active
              October 25, 2022 07:57
            
              
                Q-Table learning in OpenAI grid world.
              
          
      Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.