Skip to content

Instantly share code, notes, and snippets.

Embed
What would you like to do?
counter = tf.Variable(0)
agent = DqnAgent(
train_env.time_step_spec(),
train_env.action_spec(),
q_network = q_network,
optimizer = tf.compat.v1.train.AdamOptimizer(learning_rate=1e-3),
td_errors_loss_fn = common.element_wise_squared_loss,
train_step_counter = counter)
agent.initialize()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment