Jesse Cooper JKCooper2

## utility.cls
/*
Provides utility functions for checking/accessing sObject properties

Updates:
 - Namespace no longer required
 - Replaced getPopulatedFieldsAsMap() with a plain try catch to improve speed
 - Allowed for returns of SObjects
 - Throw out SObjectException where the field exists but wasn't queried

To Do:

## README.md

      
              3 files
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                JKCooper2
                / README.md
            
            
              Created
              June 18, 2016 03:41
            
          
    Random agent with action_space value checking

  
## discrete.py
import numpy as np


class Discrete:
    def __init__(self, values):
        self.values = values
        self.max = np.prod(self.values)

    def __validate(self, observation):
        for i in range(len(self.values)):

## README
Alteration to [standard hill climbing model](https://gym.openai.com/algorithms/alg_WKinUO3TNabzwPeaD7A)

Uses biased update that allows for worse performance to becomes new standard with reduced probability

For CartPole environment should result in larger percentage of tests solving the problem

## README
For part 1 of https://openai.com/requests-for-research/#cartpole

Quite often it doesn't solve (because of local minimum)

## README
For Section 1: https://openai.com/requests-for-research/#cartpole

Requirement of environment for algorithm to work:
 - Action space has two discrete actions
 - Ratio of observations can decide to the best action to take

## CartPole-v0_MA.py
# CARTPOLE MULTI AGENT
# Set up to allow for using a pool of agents

import logging
import gym
from CrossEntropyMethod import CrossEntropyMethodPool

import gym.scoreboard.scoring
import gym.monitoring.monitor

## README
Unachievable score resulting from action returning 20 rather than 2
Environment currently sets: velocity += (action-1)*0.001 + math.cos(3*position)*(-0.0025) without a bound checks on the input
Will look at adding range check to the environment

## Acrobot-v0.py
import logging
import gym
from SimulatedAnnealing import SimulatedAnnealingAgent


def main():
    logger = logging.getLogger()
    logger.setLevel(logging.DEBUG)

    env = gym.make('Acrobot-v0')

## CartPole-v0.py
import logging
import gym
from SimulatedAnnealing import SimulatedAnnealingAgent


def main():
    logger = logging.getLogger()
    logger.setLevel(logging.DEBUG)

    env = gym.make('CartPole-v0')
	/*
	Provides utility functions for checking/accessing sObject properties

	Updates:
	- Namespace no longer required
	- Replaced getPopulatedFieldsAsMap() with a plain try catch to improve speed
	- Allowed for returns of SObjects
	- Throw out SObjectException where the field exists but wasn't queried

	To Do:
	import numpy as np


	class Discrete:
	def __init__(self, values):
	self.values = values
	self.max = np.prod(self.values)

	def __validate(self, observation):
	for i in range(len(self.values)):
	Alteration to [standard hill climbing model](https://gym.openai.com/algorithms/alg_WKinUO3TNabzwPeaD7A)

	Uses biased update that allows for worse performance to becomes new standard with reduced probability

	For CartPole environment should result in larger percentage of tests solving the problem
	For part 1 of https://openai.com/requests-for-research/#cartpole

	Quite often it doesn't solve (because of local minimum)
	For Section 1: https://openai.com/requests-for-research/#cartpole

	Requirement of environment for algorithm to work:
	- Action space has two discrete actions
	- Ratio of observations can decide to the best action to take
	# CARTPOLE MULTI AGENT
	# Set up to allow for using a pool of agents

	import logging
	import gym
	from CrossEntropyMethod import CrossEntropyMethodPool

	import gym.scoreboard.scoring
	import gym.monitoring.monitor
	Unachievable score resulting from action returning 20 rather than 2
	Environment currently sets: velocity += (action-1)0.001 + math.cos(3position)*(-0.0025) without a bound checks on the input
	Will look at adding range check to the environment
	import logging
	import gym
	from SimulatedAnnealing import SimulatedAnnealingAgent


	def main():
	logger = logging.getLogger()
	logger.setLevel(logging.DEBUG)

	env = gym.make('Acrobot-v0')