PyTorch implementation of ENet

Last update: Dec 29, 2022

Overview

PyTorch-ENet

PyTorch (v1.1.0) implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation, ported from the lua-torch implementation ENet-training created by the authors.

This implementation has been tested on the CamVid and Cityscapes datasets. Currently, a pre-trained version of the model trained in CamVid and Cityscapes is available here.

Dataset	Classes ¹	Input resolution	Batch size	Epochs	Mean IoU (%)	GPU memory (GiB)	Training time (hours)²
CamVid	11	480x360	10	300	52.1³	4.2	1
Cityscapes	19	1024x512	4	300	59.5⁴	5.4	20

¹ When referring to the number of classes, the void/unlabeled class is always excluded.
² These are just for reference. Implementation, datasets, and hardware changes can lead to very different results. Reference hardware: Nvidia GTX 1070 and an AMD Ryzen 5 3600 3.6GHz. You can also train for 100 epochs or so and get similar mean IoU (± 2%).
³ Test set.
⁴ Validation set.

Installation

Local pip

Python 3 and pip
Set up a virtual environment (optional, but recommended)
Install dependencies using pip: pip install -r requirements.txt

Docker image

Build the image: docker build -t enet .
Run: docker run -it --gpus all --ipc host enet

Usage

Run main.py, the main script file used for training and/or testing the model. The following options are supported:

python main.py [-h] [--mode {train,test,full}] [--resume]
               [--batch-size BATCH_SIZE] [--epochs EPOCHS]
               [--learning-rate LEARNING_RATE] [--lr-decay LR_DECAY]
               [--lr-decay-epochs LR_DECAY_EPOCHS]
               [--weight-decay WEIGHT_DECAY] [--dataset {camvid,cityscapes}]
               [--dataset-dir DATASET_DIR] [--height HEIGHT] [--width WIDTH]
               [--weighing {enet,mfb,none}] [--with-unlabeled]
               [--workers WORKERS] [--print-step] [--imshow-batch]
               [--device DEVICE] [--name NAME] [--save-dir SAVE_DIR]

For help on the optional arguments run: python main.py -h

Examples: Training

python main.py -m train --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Examples: Resuming training

python main.py -m train --resume True --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Examples: Testing

python main.py -m test --save-dir save/folder/ --name model_name --dataset name --dataset-dir path/root_directory/

Project structure

Folders

data: Contains instructions on how to download the datasets and the code that handles data loading.
metric: Evaluation-related metrics.
models: ENet model definition.
save: By default, main.py will save models in this folder. The pre-trained models can also be found here.

Files

args.py: Contains all command-line options.
main.py: Main script file used for training and/or testing the model.
test.py: Defines the Test class which is responsible for testing the model.
train.py: Defines the Train class which is responsible for training the model.
transforms.py: Defines image transformations to convert an RGB image encoding classes to a torch.LongTensor and vice versa.

PyTorch implementation of ENet

Related tags

Overview

PyTorch-ENet

Installation

Local pip

Docker image

Usage

Examples: Training

Examples: Resuming training

Examples: Testing

Project structure

Folders

Files

Owner

David Silva

Sign Language Transformers (CVPR'20)

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

[NeurIPS 2021] "G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators"

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'

An SE(3)-invariant autoencoder for generating the periodic structure of materials

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

(AAAI 2021) Progressive One-shot Human Parsing

Fuse radar and camera for detection

Equipped customers with insights about their EVs Hourly energy consumption and helped predict future charging behavior using LSTM model

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Predicts an answer in yes or no.

Winners of the Facebook Image Similarity Challenge

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

[CVPRW 2022] Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

ProjectOxford-ClientSDK - This repo has moved :house: Visit our website for the latest SDKs & Samples

Reproduction process of AlexNet

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

Predicting Event Memorability from Contextual Visual Semantics

PyTorch implementation of ENet

Related tags

Overview

PyTorch-ENet

Installation

Local pip

Docker image

Usage

Examples: Training

Examples: Resuming training

Examples: Testing

Project structure

Folders

Files

Owner

David Silva

Sign Language Transformers (CVPR'20)

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

[NeurIPS 2021] "G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators"

Official pytorch implementation of the paper: "SinGAN: Learning a Generative Model from a Single Natural Image"

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'

An SE(3)-invariant autoencoder for generating the periodic structure of materials

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

(AAAI 2021) Progressive One-shot Human Parsing

Fuse radar and camera for detection

Equipped customers with insights about their EVs Hourly energy consumption and helped predict future charging behavior using LSTM model

Computer Vision Paper Reviews with Key Summary of paper, End to End Code Practice and Jupyter Notebook converted papers

Predicts an answer in yes or no.

Winners of the Facebook Image Similarity Challenge

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

[CVPRW 2022] Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

ProjectOxford-ClientSDK - This repo has moved :house: Visit our website for the latest SDKs & Samples

Reproduction process of AlexNet

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

Predicting Event Memorability from Contextual Visual Semantics

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务