Skip to content

Instantly share code, notes, and snippets.

@Andrew-Pynch
Created March 22, 2020 02:05
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save Andrew-Pynch/395e11f44491564f317238c635b582bc to your computer and use it in GitHub Desktop.
Save Andrew-Pynch/395e11f44491564f317238c635b582bc to your computer and use it in GitHub Desktop.
new_q = (1 - LEARNING_RATE) * current_q + LEARNING_RATE * (reward + DISCOUNT * max_future_q)
q_table[observation][action] = reward
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment