Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

Gapmm2: gapped alignment using minimap2 (align transcripts to genome)

CRF-RNN for Semantic Image Segmentation - PyTorch version

Metadata-Extractor - Metadata Extractor Script can be used to read in exif metadata

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

PiRank: Learning to Rank via Differentiable Sorting

Official implementation of "Learning Not to Reconstruct" (BMVC 2021)

Semantic Segmentation in Pytorch

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"

Library for implementing reservoir computing models (echo state networks) for multivariate time series classification and clustering.

Rank 3 : Source code for OPPO 6G Data Generation Challenge

Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model

Official implementation of the Implicit Behavioral Cloning (IBC) algorithm

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Minecraft Hack Detection With Python

A generalized framework for prototyping full-stack cooperative driving automation applications under CARLA+SUMO.

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.

Implementation of Research Paper "Learning to Enhance Low-Light Image via Zero-Reference Deep Curve Estimation"

Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI