Takashi Nagata nagataka

## math_in_english.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                nagataka
                / math_in_english.md
            
            
              Created
              February 28, 2021 15:06
            
              
                数学表現 in English
              
          
    たまに色々と忘れるので便利な資料をメモ

How to Read Figures, Mathematical Expressions and Equations, and Glossary
英語による数学表現


## study_lstm.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                nagataka
                / study_lstm.md
            
            
              Last active
              February 6, 2021 01:24
            
              
                Studying LSTM
              
          
ブログなど

日本語なら 今更聞けないLSTMの基本 がとても参考になる。
In English: Understanding LSTM Networks


Implementation

Docs > torch.nn > LSTM
SEQUENCE MODELS AND LONG-SHORT TERM MEMORY NETWORKS


その他、未読だけれど参考になりそうなリンク (随時読んだら上へ)


The Unreasonable Effectiveness of Recurrent Neural Networks


## blocking_maze_env01.py
# OpenAI gym custom environment mimicking Blocking Maze
# See Sutton and Barto "Reinforcement Learning an Introduction"
#     Example 8.2: Blocking Maze
from enum import Enum
import sys
import copy

import gym
from gym import error, spaces, utils
from gym.utils import seeding

## settings.json
{
    "python.formatting.provider": "black",
    "python.linting.pylintEnabled": false,
    "python.linting.flake8Enabled": true,
    "python.linting.flake8Args": [
        "--ignore=E501,W503"
    ],
    "python.sortImports.args": [
        "-m 3"
    ],

## kelly_criterion.py
import random
import numpy as np
np.random.seed(0)

def kerri(p, b):
    """https://en.wikipedia.org/wiki/Kelly_criterion
    """
    return (p*(b+1)-1 )/b

N = 300

## minimal_rllib.py
import gym
import ray
from ray.rllib.agents.ppo import PPOTrainer, DEFAULT_CONFIG

import pprint as pp

#tune.run(PPOTrainer, config={"env": "Breakout-v0", "use_pytorch": True})
ray.init(num_gpus=1, ignore_reinit_error=True, log_to_driver=False)

# https://github.com/ray-project/ray/blob/master/rllib/agents/ppo/ppo.py#L15

## notify_slack.sh
#!/bin/bash

set -eu

### Incoming WebHooks URL
WEBHOOKURL="https://hooks.slack.com/services/FILL_YOUR_WEBHOOKURL"

### channel
CHANNEL=${CHANNEL:-"#notifications"}

## README.md

      
              1 file
            
          
              0 forks
            
          
              0 comments
            
          
              0 stars
            
          
                nagataka
                / README.md
            
            
              Last active
              November 20, 2019 19:23
            
              
                README_template.md
              
          
    The repository is organized as follows:


src : Contains the source codes for all .... The source code is written in Python and it takes advantage of Numpy and Matplotlib. In order to run a simulation you have to use the file run_xxxx.py.


tools: In this folder you can find some tools for.... With yyy.py you can reproduce the figures found in ().


data: Here are saved all the results once you run a simulation.


params: Here you can find all the configuration files containing all the parameters (for each experiments).


## gym_template.py
"""A template to implement RL agent with OpenAI Gym

Usage: python ./gym_template.py --env=CarRacing-v0 --algo=policy_gradient --epochs 1

implementation of algorithms need to be ./algorithms/ directory, or change the following line to your env
> algo = import_module('algorithms.'+args.algo)
"""
import argparse
import numpy as np

## policy_evaluation.py
import gym
import sys
sys.path.append("reinforcement-learning/lib/envs")
import gridworld
import random
import numpy as np
import copy

NUM_EPOCHS = 10000
GAMMA = 1.0
	# OpenAI gym custom environment mimicking Blocking Maze
	# See Sutton and Barto "Reinforcement Learning an Introduction"
	# Example 8.2: Blocking Maze
	from enum import Enum
	import sys
	import copy

	import gym
	from gym import error, spaces, utils
	from gym.utils import seeding
	{
	"python.formatting.provider": "black",
	"python.linting.pylintEnabled": false,
	"python.linting.flake8Enabled": true,
	"python.linting.flake8Args": [
	"--ignore=E501,W503"
	],
	"python.sortImports.args": [
	"-m 3"
	],
	import random
	import numpy as np
	np.random.seed(0)

	def kerri(p, b):
	"""https://en.wikipedia.org/wiki/Kelly_criterion
	"""
	return (p*(b+1)-1 )/b

	N = 300
	import gym
	import ray
	from ray.rllib.agents.ppo import PPOTrainer, DEFAULT_CONFIG

	import pprint as pp

	#tune.run(PPOTrainer, config={"env": "Breakout-v0", "use_pytorch": True})
	ray.init(num_gpus=1, ignore_reinit_error=True, log_to_driver=False)

	# https://github.com/ray-project/ray/blob/master/rllib/agents/ppo/ppo.py#L15
	#!/bin/bash

	set -eu

	### Incoming WebHooks URL
	WEBHOOKURL="https://hooks.slack.com/services/FILL_YOUR_WEBHOOKURL"

	### channel
	CHANNEL=${CHANNEL:-"#notifications"}
	"""A template to implement RL agent with OpenAI Gym

	Usage: python ./gym_template.py --env=CarRacing-v0 --algo=policy_gradient --epochs 1

	implementation of algorithms need to be ./algorithms/ directory, or change the following line to your env
	> algo = import_module('algorithms.'+args.algo)
	"""
	import argparse
	import numpy as np
	import gym
	import sys
	sys.path.append("reinforcement-learning/lib/envs")
	import gridworld
	import random
	import numpy as np
	import copy

	NUM_EPOCHS = 10000
	GAMMA = 1.0