Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Last update: Dec 26, 2022

Related tags

Deep Learning DQN-tensorflow

Overview

Human-Level Control through Deep Reinforcement Learning

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning.

This implementation contains:

Deep Q-network and Q-learning
Experience replay memory
- to reduce the correlations between consecutive updates
Network for Q-learning targets are fixed for intervals
- to reduce the correlations between target and predicted Q-values

Requirements

Python 2.7 or Python 3.3+
gym
tqdm
SciPy or OpenCV2
TensorFlow 0.12.0

Usage

First, install prerequisites with:

$ pip install tqdm gym[all]

To train a model for Breakout:

$ python main.py --env_name=Breakout-v0 --is_train=True
$ python main.py --env_name=Breakout-v0 --is_train=True --display=True

To test and record the screen with gym:

$ python main.py --is_train=False
$ python main.py --is_train=False --display=True

Results

Result of training for 24 hours using GTX 980 ti.

Simple Results

Details of Breakout with model m2(red) for 30 hours using GTX 980 Ti.

Details of Breakout with model m3(red) for 30 hours using GTX 980 Ti.

Detailed Results

[1] Action-repeat (frame-skip) of 1, 2, and 4 without learning rate decay

[2] Action-repeat (frame-skip) of 1, 2, and 4 with learning rate decay

[1] & [2]

[3] Action-repeat of 4 for DQN (dark blue) Dueling DQN (dark green) DDQN (brown) Dueling DDQN (turquoise)

The current hyper parameters and gradient clipping are not implemented as it is in the paper.

[4] Distributed action-repeat (frame-skip) of 1 without learning rate decay

[5] Distributed action-repeat (frame-skip) of 4 without learning rate decay

References

License

MIT License.

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Related tags

Overview

Human-Level Control through Deep Reinforcement Learning

Requirements

Usage

Results

Simple Results

Detailed Results

References

License

Owner

Devsisters Corp.

Tensor-Based Quantum Machine Learning

Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)

MapReader: A computer vision pipeline for the semantic exploration of maps at scale

Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models.

Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources

Official implementation for paper: Feature-Style Encoder for Style-Based GAN Inversion

A new data augmentation method for extreme lighting conditions.

Arch-Net: Model Distillation for Architecture Agnostic Model Deployment

Structure-Preserving Deraining with Residue Channel Prior Guidance (ICCV2021)

The devkit of the nuScenes dataset.

deep learning model that learns to code with drawing in the Processing language

Open-Domain Question-Answering for COVID-19 and Other Emergent Domains

PyTorch DepthNet Training on Still Box dataset

Python Fanduel API (2021) - Lineup Automation

Propose a principled and practically effective framework for unsupervised accuracy estimation and error detection tasks with theoretical analysis and state-of-the-art performance.

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation (CoRL 2021)

Grammar Induction using a Template Tree Approach

DeepVoxels is an object-specific, persistent 3D feature embedding.

Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation (CVPR 2021)

TensorFlow implementation of "Attention is all you need (Transformer)"