An implementation of RetinaNet in PyTorch.

Last update: Jan 04, 2023

Overview

RetinaNet

An implementation of RetinaNet in PyTorch.

Installation
Training
Evaluation
Todo
Credits

Installation

Install PyTorch and torchvision.
For faster data augmentation, install pillow-simd:

pip uninstall -y pillow
pip install pillow-simd

Training

COCO 2017

First, install pycocotools:

git clone https://github.com/pdollar/coco/
cd coco/PythonAPI
make
python setup.py install
cd ../..
rm -r coco

Then download COCO 2017 into ./datasets/COCO/:

cd datasets
mkdir COCO
cd COCO

If your using wget:

wget http://images.cocodataset.org/zips/train2017.zip &&
wget http://images.cocodataset.org/zips/val2017.zip &&
wget http://images.cocodataset.org/annotations/annotations_trainval2017.zip

If your using aria2c (recommended on for higher bandwidth connections and for allowing resumption of the download. Tune the number of max concurrent downloads (-j) and max connections per server (-x) as needed:

aria2c -x 10 -j 10 http://images.cocodataset.org/zips/train2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/zips/val2017.zip &&
aria2c -x 10 -j 10 http://images.cocodataset.org/annotations/annotations_trainval2017.zip

unzip *.zip
rm *.zip

Then just run:

python train_coco.py

Pascal VOC

cd datasets
mkdir VOC
cd VOC

wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
wget http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtrainval_06-Nov-2007.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2012/VOCtrainval_11-May-2012.tar &&
aria2c -x 10 -j 10 http://host.robots.ox.ac.uk/pascal/VOC/voc2007/VOCtest_06-Nov-2007.tar

tar xf *.tar
rm *.tar

Then just run:

python train_voc.py

Custom Dataset

Lots to write here. 😉

Evaluation

To evaluate an image on a trained model:

python eval.py [checkpoint_path] [image_path]

This will create an image (output.jpg) with bounding box annotations.

Todo

Finish converting the COCO dataset class to work with batches.
Train COCO 2017 for 90,000 iterations and save a reusable checkpoint.
Try training on Pascal VOC and add download instructions.
Produce bounding box outputs for a few sanity check images.
Upload trained weights to Github releases.
Train on the 🔮 magic proprietary dataset ✨ .

An implementation of RetinaNet in PyTorch.

Related tags

Overview

RetinaNet

Installation

Training

COCO 2017

Pascal VOC

Custom Dataset

Evaluation

Todo

Credits

Owner

Conner Vercellino

Privacy-Preserving Portrait Matting [ACM MM-21]

MediaPipe Kullanarak İleri Seviye Bilgisayarla Görü

[ICCV21] Self-Calibrating Neural Radiance Fields

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network

Fully Convolutional DenseNets for semantic segmentation.

Tutorial repo for an end-to-end Data Science project

mmfewshot is an open source few shot learning toolbox based on PyTorch

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Facial Action Unit Intensity Estimation via Semantic Correspondence Learning with Dynamic Graph Convolution

Cryptocurrency Prediction with Artificial Intelligence (Deep Learning via LSTM Neural Networks)

This repository contains notebook implementations of the following Neural Process variants: Conditional Neural Processes (CNPs), Neural Processes (NPs), Attentive Neural Processes (ANPs).

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

A working implementation of the Categorical DQN (Distributional RL).

Julia package for contraction of tensor networks, based on the sweep line algorithm outlined in the paper General tensor network decoding of 2D Pauli codes

MoCap-Solver: A Neural Solver for Optical Motion Capture Data

Official codebase for running the small, filtered-data GLIDE model from GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models.

Awesome Remote Sensing Toolkit based on PaddlePaddle.

Video Contrastive Learning with Global Context

Repo for the paper "DiLBERT: Cheap Embeddings for Disease Related Medical NLP"