Skip to content

Instantly share code, notes, and snippets.

@chuchro3
Created February 24, 2017 21:14
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save chuchro3/d444891c7c737ee686df324addf3e199 to your computer and use it in GitHub Desktop.
Save chuchro3/d444891c7c737ee686df324addf3e199 to your computer and use it in GitHub Desktop.
FrozenLake-v0
Model free Q-Learning in an MDP style environment
Utilized code from Berkeley's CS188 Reinforcement Learning project
Introduced an epsilon decay to offer a transition between early exploration and late exploitation
QLearning paramters:
alpha = 0.1
epsilon = 1.0
gamma = .99
epsilon_decay = .9995
learning_decay = 1.0
Program paramters:
python openai.py -v FrozenLake-v0
-a 0.1
-e 1.0
-g .99
--learningDecay 1.0
--explorationDecay .9995
-x 10000
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment