- Install simple_dqn.
- Run
./train.sh Breakout-v0 --evironment gym
. - Check
results/Breakout-v0.csv
for best performing epoch (in my case it was 61). - Run
./test_gym.sh snapshots/Breakout-v0_61.pkl
(replace 61 with your best epoch). - Optional: run
./upload_gym.sh results/Breakout-v0 --api_key <your_key>
to upload the results.
The Simple DQN implementation uses network architecture and hyperparameters from DeepMind Nature paper.