Last active
July 8, 2022 08:29
-
-
Save yingzwang/2c5b455907942c7bdf3c0fece640095b to your computer and use it in GitHub Desktop.
Deep-Q learning implementation in Tensorflow and Keras (solving CartPole-v0)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi dylan, HaiyangChen
I'm not associated with yingzwang, but i can give some information, this an implementation of DQN algorithm ( https://deepmind.com/research/dqn/. ) So, the architecture of the algorithm is essentially the same as the one presented in the paper. The difference is a soft-update to the weights of the target network by using exponential moving averages parameterized by tau. She also uses a decreasing exploration strategy, which clearly helps in this problem. The rest is just good hyper parameter tunning.
The code is also very good, good code practices all around.