Steven Kapturowski steveKapturowski

## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 23, 2017 02:32
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 260M agent steps
To reproduce run:
python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 22, 2017 14:40
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 160M agent steps
To reproduce run:
python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 22, 2017 14:38
            
              
                Async DQN+CTS
              
          
    Produced using DQN+CTS implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 18M agent steps using epsilon of .01
To reproduce run:
python main.py Freeway-v0 \
	--load_config config/dqn-cts.yaml \
	--epsilon_annealing_steps 200000 \
	--clip_norm_type ignore \
	--q_target_update_steps 20000 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 22, 2017 07:25
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 300M agent steps
To reproduce run:
python main.py WizardOfWor-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 22, 2017 06:10
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 80M agent steps
To reproduce run:
python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 22, 2017 03:50
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 56M agent steps
To reproduce run:
python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Last active
              June 21, 2017 19:47
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 300M agent steps
To reproduce run:
python main.py YarsRevenge-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Last active
              June 21, 2017 04:39
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 180M agent steps
To reproduce run:
python main.py YarsRevenge-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Last active
              June 21, 2017 04:38
            
              
                PGQ
              
          
    Produced using PGQ implementation from tensorflow-rl
at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 80M agent steps
To reproduce run:
python main.py YarsRevenge-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \


## readme.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                steveKapturowski
                / readme.md
            
            
              Created
              June 19, 2017 20:16
            
              
                A3C-LSTM
              
          
    Generated using a3c-lstm implementation from https://github.com/steveKapturowski/tensorflow-rl
Trained for 75M agent steps with 16 agents and --max_local_steps=20 --td_lambda=.97