Andrei Muntean andreimuntean

## README.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                andreimuntean
                / README.md
            
            
              Created
              March 12, 2017 00:09
            
          
    A3C

Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model written in TensorFlow.
This AI does not rely on hand-engineered rules or features. Instead, it masters the environment by looking at raw pixels and learning from experience, just as humans do.
For Pong, an average score of 18 was reached in 72 hours of training on an 8-core CPU. Training and evaluation code is available at github.com/andreimuntean/a3c.
Dependencies


OpenAI Gym 0.8
TensorFlow 1.0


## README.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                andreimuntean
                / README.md
            
            
              Created
              March 11, 2017 19:45
            
          
    Deep Q-Learning

Deep reinforcement learning for environments with small state spaces.
The CartPole Gym environment was mastered in just a few minutes of training. Training and evaluation code is available at github.com/andreimuntean/deep-q-learning-lite.
Dependencies


OpenAI Gym 0.8
TensorFlow 1.0

Learning Environment


## README.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              1 star
            
          
                andreimuntean
                / README.md
            
            
              Last active
              April 15, 2018 12:50
            
          
    A3C

Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model written in TensorFlow.
This AI does not rely on hand-engineered rules or features. Instead, it masters the environment by looking at raw pixels and learning from experience, just as humans do.
The Flappy Bird Gym environment was mastered in 48 hours of training using a CPU with 8 cores. Training and evaluation code is available at github.com/andreimuntean/a3c.
Dependencies


OpenAI Gym 0.8
TensorFlow 1.0


## TensorFlow_Windows.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                andreimuntean
                / TensorFlow_Windows.md
            
            
              Last active
              October 8, 2016 19:24
                — forked from ericjang/TensorFlow_Windows.md
            
              
                Setting up TensorFlow on Windows using Docker.
              
          
    TensorFlow development environment on Windows using Docker

Here are instructions to set up TensorFlow dev environment on Docker if you are running Windows, and configure it so that you can access Jupyter Notebook from within the VM + edit files in your text editor of choice on your Windows machine.
Installation

First, install
https://www.docker.com/docker-toolbox
Since this is Windows, creating the Docker group "docker" is not necessary.