Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Last update: Dec 19, 2021

Related tags

Overview

Official code for Continual Learning In Environments With Polynomial Mixing Times

Continual Learning in Environments with Polynomial Mixing Times

This repository provides official code base for the paper "Continual Learning in Environments with Polynomial Mixing Times"

Basic Setup

Clone this repository and then follow this command

cd polynomial-mixing-times

Create either use a python virtualenv or a conda environment and activate it.

pip install virtualenv
virtualenv -p /usr/bin/python3.7 mixing-times
source mixing-times/bin/activate

To install all the relevant packages use the following command:

pip install -e .

Running the experiments

We provide a running script with all relevant hyperparameters used for both baselines and our proposed model. One can run run_bottleneck.sh to run all the models.

To run the experiments of the proposed models on the Example 2 Bottleneck MDP class with 4 rooms, "random" task evolution and a random seed of 1, use the following command

bash run_bottleneck.sh 1 4 "random"

Available Models

Online Q learning
Q learning with Replay
Q learning w/ Dyna
Model based n-step TD
Vanilla Policy Gradient
Onpolicy rho learning
Off-policy rho learning
rho Policy Gradient

List of Environments

ScaleClass-v0
NBottleneckClass-v0
NCycleClass-v0

System requirements

We used python 3.7 version to run all our experiments.

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Related tags

Overview

Continual Learning in Environments with Polynomial Mixing Times

Basic Setup

Running the experiments

Available Models

List of Environments

System requirements

Owner

Sharath Raparthy

Tensorflow 2 implementation of our high quality frame interpolation neural network

AFLNet: A Greybox Fuzzer for Network Protocols

unofficial pytorch implementation of RefineGAN

Civsim is a basic civilisation simulation and modelling system built in Python 3.8.

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

WHENet - ONNX, OpenVINO, TFLite, TensorRT, EdgeTPU, CoreML, TFJS, YOLOv4/YOLOv4-tiny-3L

"Neural Turing Machine" in Tensorflow

BalaGAN: Image Translation Between Imbalanced Domains via Cross-Modal Transfer

Generative Art Using Neural Visual Grammars and Dual Encoders

Official PyTorch implementation of "Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks" (AAAI 2022)

PyTorch common framework to accelerate network implementation, training and validation

Google Recaptcha solver.

Based on Stockfish neural network(similar to LcZero)

Machine Learning University: Accelerated Computer Vision Class

Code for "Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation". [AAAI 2021]

Bridging Composite and Real: Towards End-to-end Deep Image Matting

This repository stores the code to reproduce the results published in "TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios"

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

A repo to show how to use custom dataset to train s2anet, and change backbone to resnext101

Unofficial PyTorch implementation of Google AI's VoiceFilter system