Deep Reinforcement Learning Agents

This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython notebook here were written to go along with a still-underway tutorial series I have been publishing on Medium. If you are new to reinforcement learning, I recommend reading the accompanying post for each algorithm.

The repository currently contains the following algorithms:

Q-Table - An implementation of Q-learning using tables to solve a stochastic environment problem.
Q-Network - A neural network implementation of Q-Learning to solve the same environment as in Q-Table.
Simple-Policy - An implementation of policy gradient method for stateless environments such as n-armed bandit problems.
Contextual-Policy - An implementation of policy gradient method for stateful environments such as contextual bandit problems.
Policy-Network - An implementation of a neural network policy-gradient agent that solves full RL problems with states and delayed rewards, and two opposite actions (ie. CartPole or Pong).
Vanilla-Policy - An implementation of a neural network vanilla-policy-gradient agent that solves full RL problems with states, delayed rewards, and an arbitrary number of actions.
Model-Network - An addition to the Policy-Network algorithm which includes a separate network which models the environment dynamics.
Double-Dueling-DQN - An implementation of a Deep-Q Network with the Double DQN and Dueling DQN additions to improve stability and performance.
Deep-Recurrent-Q-Network - An implementation of a Deep Recurrent Q-Network which can solve reinforcement learning problems involving partial observability.
Q-Exploration - An implementation of DQN containing multiple action-selection strategies for exploration. Strategies include: greedy, random, e-greedy, Boltzmann, and Bayesian Dropout.
A3C-Doom - An implementation of Asynchronous Advantage Actor-Critic (A3C) algorithm. It utilizes multiple agents to collectively improve a policy. This implementation can solve RL problems in 3D environments such as VizDoom challenges.

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Related tags

Overview

Deep Reinforcement Learning Agents

Owner

Arthur Juliani

Video Contrastive Learning with Global Context

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

Parameter-ensemble-differential-evolution - Shows how to do parameter ensembling using differential evolution.

Link prediction using Multiple Order Local Information (MOLI)

This folder contains the python code of UR5E's advanced forward kinematics model.

Repo for our ICML21 paper Unsupervised Learning of Visual 3D Keypoints for Control

AAAI 2022: Stationary diffusion state neural estimation

Code for CoMatch: Semi-supervised Learning with Contrastive Graph Regularization

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

A Protein-RNA Interface Predictor Based on Semantics of Sequences

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

This tutorial repository is to introduce the functionality of KGTK to first-time users

Designing a Practical Degradation Model for Deep Blind Image Super-Resolution (ICCV, 2021) (PyTorch) - We released the training code!

Experiments for Operating Systems Lab (ETCS-352)

MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2

Top #1 Submission code for the first https://alphamev.ai MEV competition with best AUC (0.9893) and MSE (0.0982).

WSDM‘2022: Knowledge Enhanced Sports Game Summarization

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)