Skip to content

Instantly share code, notes, and snippets.

@awjuliani
Last active May 2, 2018 18:55
Show Gist options
  • Save awjuliani/86ae316a231bceb96a3e2ab3ac8e646a to your computer and use it in GitHub Desktop.
Save awjuliani/86ae316a231bceb96a3e2ab3ac8e646a to your computer and use it in GitHub Desktop.
Reinforcement Learning Tutorial 2 (Cart Pole problem)
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@LoveDLWujing
Copy link

log(P(y|x)) = (1-input_y)log(probability) + input_ylog(1-probability) , and the loss function in above code happened to the same result as this maximum likelihood.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment