Skip to content

Instantly share code, notes, and snippets.

View ArturNiederfahrenhorst's full-sized avatar

Artur Niederfahrenhorst ArturNiederfahrenhorst

View GitHub Profile
@ArturNiederfahrenhorst
ArturNiederfahrenhorst / multiagent.py
Last active August 15, 2022 22:22
Training multiple policies in RLlib, reporting rewards separately @ Ray 2.0
import numpy as np
from pettingzoo.sisl import waterworld_v3
import ray
from ray.tune import CLIReporter
from ray import air, tune
from ray.rllib.algorithms.ppo import PPOConfig
from ray.rllib.env.wrappers.pettingzoo_env import PettingZooEnv
from ray.tune.registry import register_env