chuchro3/cartpole

## cartpole
Tabular Q-Learning on CartPole-v1:

Utilized code from Berkeley's CS188 Q-Learning project

Discretized the state space from continuous values

#cart_x, cart_velocity, pole_theta, pole_velocity
[5,10,20,10]

Introduced an epsilon decay to offer a transition between early exploration and late exploitation

Q-Learning Parameters:

epsilon = 1.0
alpha = .4
gamma = .997
epsilon_decay = .997
alpha_decay = 1.0
	Tabular Q-Learning on CartPole-v1:

	Utilized code from Berkeley's CS188 Q-Learning project

	Discretized the state space from continuous values

	#cart_x, cart_velocity, pole_theta, pole_velocity
	[5,10,20,10]

	Introduced an epsilon decay to offer a transition between early exploration and late exploitation

	Q-Learning Parameters:

	epsilon = 1.0
	alpha = .4
	gamma = .997
	epsilon_decay = .997
	alpha_decay = 1.0