Skip to content

Instantly share code, notes, and snippets.

@msaroufim
Last active October 27, 2019 23:43
Show Gist options
  • Save msaroufim/2c0ed03847975585f111c01a43448a1c to your computer and use it in GitHub Desktop.
Save msaroufim/2c0ed03847975585f111c01a43448a1c to your computer and use it in GitHub Desktop.
# src/Reinforce.jl
abstract type AbstractPolicy end
"""
a = action(policy, r, s, A)
Take in the last reward `r`, current state `s`,
and set of valid actions `A = actions(env, s)`,
then return the next action `a`.
Note that a policy could do a 'sarsa-style' update simply by saving the last state and action `(s,a)`.
"""
function action end
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment