Uses A3C (Asynchronous Advantage Actor Critic). The result only shows the performance of one of 8 A3C threads. A separate network is used for the actor and another one for the critic. The code can be found in my Github repo.
Last active
May 28, 2017 20:02
-
-
Save arnomoonens/00cfdd0b4e32ab96e0a146be0943ff7e to your computer and use it in GitHub Desktop.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment