The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Last update: Jan 05, 2023

Overview

OC-SORT

Observation-Centric SORT (OC-SORT) is a pure motion-model-based multi-object tracker. It aims to improve tracking robustness in crowded scenes and when objects are in non-linear motion. It is designed by recognizing and fixing limitations in Kalman filter and SORT. It is flexible to integrate with different detectors and matching modules, such as appearance similarity. It remains, Simple, Online and Real-time.

News

[04/27/2022]: Support intergration with BYTE and multiple cost metrics, such as GIoU, CIoU, etc.
[04/02/2022]: A preview version is released after a primary cleanup and refactor.
[03/27/2022]: The arxiv preprint of OC-SORT is released.

Benchmark Performance

Dataset	HOTA	AssA	IDF1	MOTA	FP	FN	IDs	Frag
MOT17 (private)	63.2	63.2	77.5	78.0	15,129	107,055	1,950	2,040
MOT17 (public)	52.4	57.6	65.1	58.2	4,379	230,449	784	2,006
MOT20 (private)	62.4	62.5	76.4	75.9	20,218	103,791	938	1,004
MOT20 (public)	54.3	59.5	67.0	59.9	4,434	202,502	554	2,345
KITTI-cars	76.5	76.4	-	90.3	2,685	407	250	280
KITTI-pedestrian	54.7	59.1	-	65.1	6,422	1,443	204	609
DanceTrack-test	55.1	38.0	54.2	89.4	114,107	139,083	1,992	3,838
CroHD HeadTrack	44.1	-	62.9	67.9	102,050	164,090	4,243	10,122

Results are from reusing detections of previous methods and shared hyper-parameters. Tune the implementation adaptive to datasets may get higher performance.
The inference speed is ~28FPS by a RTX 2080Ti GPU. If the detections are provided, the inference speed of OC-SORT association is 700FPS by a i9-3.0GHz CPU.
A sample from DanceTrack-test set is as below and more visualizatiosn are available on Google Drive

Get Started

See INSTALL.md for instructions of installing required components.
See GET_STARTED.md for how to get started with OC-SORT.
See MODEL_ZOO.md for available YOLOX weights.
See DEPLOY.md for deployment support over ONNX, TensorRT and ncnn.

Demo

To run the tracker on a provided demo video from Youtube:

python3 tools/demo_track.py --demo_type video -f exps/example/mot/yolox_dancetrack_test.py -c pretrained/ocsort_dance_model.pth.tar --path videos/dance_demo.mp4 --fp16 --fuse --save_result --out_path demo_out.mp4

Roadmap

We are still actively updating OC-SORT. We always welcome contributions to make it better for the community. We have some high-priorty to-dos as below:

Add more asssocitaion cost choices: GIoU, CIoU, etc.
Support OC-SORT in mmtracking.
Add more deployment options and improve the inference speed.
Make OC-SORT adaptive to customized detector.

Acknowledgement and Citation

The codebase is built highly upon YOLOX, filterpy, and ByteTrack. We thank their wondeful works. OC-SORT, filterpy and ByteTrack are available under MIT License. And YOLOX uses Apache License 2.0 License.

If you find this work useful, please consider to cite our paper:

@article{cao2022observation,
  title={Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking},
  author={Cao, Jinkun and Weng, Xinshuo and Khirodkar, Rawal and Pang, Jiangmiao and Kitani, Kris},
  journal={arXiv preprint arXiv:2203.14360},
  year={2022}
}

The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.

Related tags

Overview

OC-SORT

News

Benchmark Performance

Get Started

Demo

Roadmap

Acknowledgement and Citation

Owner

Jinkun Cao

Notebooks, slides and dataset of the CorrelAid Machine Learning Winter School

Official implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

(Preprint) Official PyTorch implementation of "How Do Vision Transformers Work?"

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion (CVPR 2021)

This was initially the repo for the project of [email protected] of Asaf Mazar, Millad Kassaie and Georgios Chochlakis named "Powered by the Will? Exploring Lay Theories of Behavior Change through Social Media"

Definition of a business problem according to Wilson Lower Bound Score and Time Based Average Rating

Custom TensorFlow2 implementations of forward and backward computation of soft-DTW algorithm in batch mode.

A PyTorch implementation for PyramidNets (Deep Pyramidal Residual Networks)

LETR: Line Segment Detection Using Transformers without Edges

Code for "Learning Graph Cellular Automata"

Restricted Boltzmann Machines in Python.

Scalable, event-driven, deep-learning-friendly backtesting library

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

MDMM - Learning multi-domain multi-modality I2I translation

Very simple NCHW and NHWC conversion tool for ONNX. Change to the specified input order for each and every input OP. Also, change the channel order of RGB and BGR. Simple Channel Converter for ONNX.

NVIDIA container runtime

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

PyTorch implementation of the paper Dynamic Data Augmentation with Gating Networks

Deep Hedging Demo - An Example of Using Machine Learning for Derivative Pricing.