Skip to content

Instantly share code, notes, and snippets.

@pragatibaheti
Last active March 13, 2020 02:10
Show Gist options
  • Save pragatibaheti/1d5707476b2ab4b779130599e2b6035c to your computer and use it in GitHub Desktop.
Save pragatibaheti/1d5707476b2ab4b779130599e2b6035c to your computer and use it in GitHub Desktop.
scores = []
for i in range(700):
current_state = np.random.randint(0, int(Q.shape[0]))
available_act = available_actions(current_state)
action = sample_next_action(available_act)
score = update(current_state,action,gamma)
scores.append(score)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment