A PyTorch library and evaluation platform for end-to-end compression research

Last update: Jan 06, 2023

Overview

CompressAI

CompressAI (compress-ay) is a PyTorch library and evaluation platform for end-to-end compression research.

CompressAI currently provides:

custom operations, layers and models for deep learning based data compression
a partial port of the official TensorFlow compression library
pre-trained end-to-end compression models for learned image compression
evaluation scripts to compare learned models against classical image/video compression codecs

Note: Multi-GPU support is now experimental.

Installation

CompressAI supports python 3.6+ and PyTorch 1.7+.

pip:

pip install compressai

Note: wheels are available for Linux and MacOS.

From source:

A C++17 compiler, a recent version of pip (19.0+), and common python packages are also required (see setup.py for the full list).

To get started locally and install the development version of CompressAI, run the following commands in a virtual environment:

git clone https://github.com/InterDigitalInc/CompressAI compressai
cd compressai
pip install -U pip && pip install -e .

For a custom installation, you can also run one of the following commands:

pip install -e '.[dev]': install the packages required for development (testing, linting, docs)
pip install -e '.[tutorials]': install the packages required for the tutorials (notebooks)
pip install -e '.[all]': install all the optional packages

Note: Docker images will be released in the future. Conda environments are not officially supported.

Documentation

Usage

Examples

Script and notebook examples can be found in the examples/ directory.

To encode/decode images with the provided pre-trained models, run the codec.py example:

python3 examples/codec.py --help

An examplary training script with a rate-distortion loss is provided in examples/train.py. You can replace the model used in the training script with your own model implemented within CompressAI, and then run the script for a simple training pipeline:

python3 examples/train.py -d /path/to/my/image/dataset/ --epochs 300 -lr 1e-4 --batch-size 16 --cuda --save

Note: the training example uses a custom ImageFolder structure.

A jupyter notebook illustrating the usage of a pre-trained model for learned image compression is also provided in the examples directory:

pip install -U ipython jupyter ipywidgets matplotlib
jupyter notebook examples/

Evaluation

To evaluate a trained model on your own dataset, CompressAI provides an evaluation script:

python3 -m compressai.utils.eval_model checkpoint /path/to/images/folder/ -a $ARCH -p $MODEL_CHECKPOINT...

To evaluate traditional image/video codecs:

python3 -m compressai.utils.bench --help
python3 -m compressai.utils.bench bpg --help
python3 -m compressai.utils.bench vtm --help

Tests

Run tests with pytest:

pytest -sx --cov=compressai --cov-append --cov-report term-missing tests

Slow tests can be skipped with the -m "not slow" option.

License

CompressAI is licensed under the Apache License, Version 2.0

Contributing

We welcome feedback and contributions. Please open a GitHub issue to report bugs, request enhancements or if you have any questions.

Before contributing, please read the CONTRIBUTING.md file.

Authors

Jean Bégaint, Fabien Racapé, Simon Feltman and Akshay Pushparaja, InterDigital AI Lab.

Citation

If you use this project, please cite the relevant original publications for the models and datasets, and cite this project as:

@article{begaint2020compressai,
	title={CompressAI: a PyTorch library and evaluation platform for end-to-end compression research},
	author={B{\'e}gaint, Jean and Racap{\'e}, Fabien and Feltman, Simon and Pushparaja, Akshay},
	year={2020},
	journal={arXiv preprint arXiv:2011.03029},
}

A PyTorch library and evaluation platform for end-to-end compression research

Related tags

Overview

CompressAI

Installation

Documentation

Usage

Examples

Evaluation

Tests

License

Contributing

Authors

Citation

Related links

Owner

InterDigital

Geometric Algebra package for JAX

PyExplainer: A Local Rule-Based Model-Agnostic Technique (Explainable AI)

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

MultiLexNorm 2021 competition system from ÚFAL

Orthogonal Over-Parameterized Training

Resources for our AAAI 2022 paper: "LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification".

A curated (most recent) list of resources for Learning with Noisy Labels

A custom DeepStack model that has been trained detecting ONLY the USPS logo

Create animations for the optimization trajectory of neural nets

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

Contextual Attention Localization for Offline Handwritten Text Recognition

Implementation of temporal pooling methods studied in [ICIP'20] A Comparative Evaluation Of Temporal Pooling Methods For Blind Video Quality Assessment

This code is for eCaReNet: explainable Cancer Relapse Prediction Network.

A Learning-based Camera Calibration Toolbox

Semantic Segmentation Architectures Implemented in PyTorch

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Human4D Dataset tools for processing and visualization

Data loaders and abstractions for text and NLP

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

Self-driving car env with PPO algorithm from stable baseline3