Skip to content

Instantly share code, notes, and snippets.

View steveKapturowski's full-sized avatar

Steven Kapturowski steveKapturowski

  • N/A
  • United States
View GitHub Profile

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 260M agent steps

To reproduce run:

python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 160M agent steps

To reproduce run:

python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \
@steveKapturowski
steveKapturowski / readme.md
Created June 22, 2017 14:38
Async DQN+CTS

Produced using DQN+CTS implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 18M agent steps using epsilon of .01

To reproduce run:

python main.py Freeway-v0 \
	--load_config config/dqn-cts.yaml \
	--epsilon_annealing_steps 200000 \
	--clip_norm_type ignore \
	--q_target_update_steps 20000 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 300M agent steps

To reproduce run:

python main.py WizardOfWor-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 80M agent steps

To reproduce run:

python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 56M agent steps

To reproduce run:

python main.py Boxing-v0 \
	--initial_lr 0.00025 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 300M agent steps

To reproduce run:

python main.py YarsRevenge-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 180M agent steps

To reproduce run:

python main.py YarsRevenge-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \

Produced using PGQ implementation from tensorflow-rl at commit daf75a33a20aae461a63c5b650b61216117b3f7b. Evaluation generated from checkpoint at 80M agent steps

To reproduce run:

python main.py YarsRevenge-v0 \
	--initial_lr 0.0007 \
	--momentum .99 \
	--clip_norm_type ignore \
	--frame_skip 2 4 \