A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Last update: Dec 28, 2022

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

I wrote these notebooks in March 2017 while I took the COMP 767: Reinforcement Learning [5] class by Prof. Doina Precup at McGill, Montréal. I highly recommend you to go through the class notes and references of all the papers the intructors have posted on the website.

These notebooks should be used while you read the book and go beyond the same with the referenced papers. I would suggest watching David Silver's videos and reading the book simultaneously. And when you are done with a few chapters, start implementing them. The algorithms follow a pattern and mostly are variants of each other. I have tried my best to explain each notebook's results and possible future directions.

Disclaimer: The code is a little messy. I'd written this when I was not a Pythonista. If you would like to clean them up and want to make it into a nice interface, feel free to contact me. I will be very pleased to collaborate. If you use them then please cite the source and also mention the credits as listed below. Also, email me with ways to improve, let me know if you find any bugs.

Feel free to reach me at [email protected] or see my website here

Special Credits:

[1] Denny Britz

[2] Monica Patel

[3] Sutton and Barto

[4] David Silver

[5] Doina Precup's course

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Related tags

Overview

Reinforcement-Learning-Notebooks

A collection of Reinforcement Learning algorithms from Sutton and Barto's book and other research papers implemented in Python.

Owner

Pulkit Khandelwal

Iterative Normalization: Beyond Standardization towards Efficient Whitening

Introducing neural networks to predict stock prices

Energy consumption estimation utilities for Jetson-based platforms

ONNX Command-Line Toolbox

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

Generative Exploration and Exploitation - This is an improved version of GENE.

Towards Understanding Quality Challenges of the Federated Learning: A First Look from the Lens of Robustness

Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

Pytorch implementation of Masked Auto-Encoder

Accelerating BERT Inference for Sequence Labeling via Early-Exit

📚 A collection of Jupyter notebooks for learning and experimenting with OpenVINO 👓

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

Implementation of self-attention mechanisms for general purpose. Focused on computer vision modules. Ongoing repository.

VID-Fusion: Robust Visual-Inertial-Dynamics Odometry for Accurate External Force Estimation

Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets" (ECCV 2020 Spotlight)

Learning to Prompt for Vision-Language Models.

Integrated physics-based and ligand-based modeling.

StyleMapGAN - Official PyTorch Implementation

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding