Skip to content

Instantly share code, notes, and snippets.

@d0znpp
Created December 12, 2017 00:46
Show Gist options
  • Save d0znpp/35989385d1db1fa2a7215a00cdd07589 to your computer and use it in GitHub Desktop.
Save d0znpp/35989385d1db1fa2a7215a00cdd07589 to your computer and use it in GitHub Desktop.
def store_rollout(self, state, reward):
self.reward_buffer.append(reward)
self.state_buffer.append(state[0])
def train_step(self, steps_count):
states = np.array(self.state_buffer[-steps_count:])/self.division_rate
rewars = self.reward_buffer[-steps_count:]
_, ls = self.sess.run([self.train_op, self.loss],
{self.states: states,
self.discounted_rewards: rewars})
return ls
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment