Krishna Kumar Mishra xkrishnam

## contextualPolicy-n-arm-bandit.ipynb

      
        
          
            
              
              1 file
            
          
          
            
              
              0 forks
            
          
          
            
              
              3 comments
            
          
          
            
              
              3 stars
            
          
        
        
          
              
          
          
            
                xkrishnam
                / contextualPolicy-n-arm-bandit.ipynb
            
            
              Last active
              September 29, 2022 06:17
            
              
                tensorflow 2 implementation of Policy gradient method for solving n-armed bandit problems.
              
          
        
      
        
  
    
    

          
    
        Loading

      Sorry, something went wrong. Reload?
      Sorry, we cannot display this file.
      Sorry, this file is invalid so it cannot be displayed.
      
          Viewer requires iframe.