Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Last update: Jan 06, 2023

Overview

RIIT

Our open-source code for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implement and standardize the hyperparameters of numerous QMIX variant algorithms that achieve SOTA.

Python MARL framework

PyMARL is WhiRL's framework for deep multi-agent reinforcement learning and includes implementations of the following algorithms:

Value-based Methods:

Actor Critic Methods:

PyMARL is written in PyTorch and uses SMAC as its environment.

Installation instructions

Install Python packages

# require Anaconda 3 or Miniconda 3
bash install_dependecies.sh

Set up StarCraft II and SMAC:

bash install_sc2.sh

This will download SC2 into the 3rdparty folder and copy the maps necessary to run over.

Run an experiment

# For SMAC
python3 src/main.py --config=qmix --env-config=sc2 with env_args.map_name=corridor

# For Cooperative Predator-Prey
python3 src/main.py --config=qmix_prey --env-config=stag_hunt with env_args.map_name=stag_hunt

The config files act as defaults for an algorithm or environment.

They are all located in src/config. --config refers to the config files in src/config/algs --env-config refers to the config files in src/config/envs

Run parallel experiments:

# bash run.sh config_name map_name_list (threads_num arg_list gpu_list experinments_num)
bash run.sh qmix corridor 2 epsilon_anneal_time=500000 0,1 5

xxx_list is separated by ,.

All results will be stored in the Results folder and named with map_name.

Force all trainning processes to exit

# all python and game processes of current user will quit.
bash clean.sh

Some test results on Super Hard scenarios

Cite

@article{hu2021riit,
      title={RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning}, 
      author={Jian Hu and Siyang Jiang and Seth Austin Harding and Haibin Wu and Shih-wei Liao},
      year={2021},
      eprint={2102.03479},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

Related tags

Overview

RIIT

Python MARL framework

Installation instructions

Run an experiment

Run parallel experiments:

Force all trainning processes to exit

Some test results on Super Hard scenarios

Cite

Owner

A study project using the AA-RMVSNet to reconstruct buildings from multiple images

details on efforts to dump the Watermelon Games Paprium cart

Research into Forex price prediction from price history using Deep Sequence Modeling with Stacked LSTMs.

An educational tool to introduce AI planning concepts using mobile manipulator robots.

Python lib to talk to pylontech lithium batteries (US2000, US3000, ...) using RS485

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

Official repository of the paper Privacy-friendly Synthetic Data for the Development of Face Morphing Attack Detectors

Tensorflow implementation of ID-Unet: Iterative Soft and Hard Deformation for View Synthesis.

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"

Deep ViT Features as Dense Visual Descriptors

Simple node deletion tool for onnx.

Transformer model implemented with Pytorch

Self-Supervised Learning of Event-based Optical Flow with Spiking Neural Networks

Code for the KDD 2021 paper 'Filtration Curves for Graph Representation'

Reinforcement Learning for finance

Pytorch implementation of forward and inverse Haar Wavelets 2D

MVFNet: Multi-View Fusion Network for Efficient Video Recognition (AAAI 2021)