tambetm/Pong-v0.md

## Pong-v0.md

      
    Raw
  

              Pong-v0.md
            
          
Install simple_dqn.
Run ./train.sh Pong-v0 --evironment gym.
Check results/Pong-v0.csv for best performing epoch (in my case it was 81).
Run ./test_gym.sh snapshots/Pong-v0_81.pkl (replace 81 with your best epoch).
Optional: run ./upload_gym.sh results/Pong-v0 --api_key <your_key> to upload the results.

The Simple DQN implementation uses network architecture and hyperparameters from DeepMind Nature paper.