Welcome to the comma.ai Calibration Challenge!

Your goal is to predict the direction of travel (in camera frame) from provided dashcam video.

This repo provides 10 videos. Every video is 1min long and 20 fps.
5 videos are labeled with a 2D array describing the direction of travel at every frame of the video with a pitch and yaw angle in radians.
5 videos are unlabeled. It is your task to generate the labels for them.
The example labels are generated using a Neural Network, and the labels were confirmed with a SLAM algorithm.
You can estimate the focal length to be 910 pixels.

Context

The devices that run openpilot are not mounted perfectly. The camera is not exactly aligned to the vehicle. There is some pitch and yaw angle between the camera of the device and the vehicle, which can vary between installations. Estimating these angles is essential for accurate control of the vehicle. The best way to start estimating these values is to predict the direction of motion in camera frame. More info can be found in this readme.

Deliverable

Your deliverable is the 5 labels called 5.txt to 9.txt. These labels should be a 2D array that contains the pitch and yaw angles of the direction of travel (in camera frame) of every frame of the respective videos. Zip them up and e-mail it to [email protected].

Evaluation

We will evaluate your mean squared error against our ground truth labels. Errors for frames where the car speed is less than 4m/s will be ignored. Those are also labeled as NaN in the example labels.

This repo includes an eval script that will give an error score (lower is better). You can use it to test your solutions against the labeled examples. We will use this script to evaluate your solution.

Hints

Keep the goal and evaluation script in mind, creative solutions are allowed.
Look at plots of your solutions before submitting.

$500 Prize CLAIMED

The first submission that scores an error under 25% on the unlabeled set, will receive a $500 prize.

The comma.ai Calibration Challenge!

Related tags

Overview

Welcome to the comma.ai Calibration Challenge!

Context

Deliverable

Evaluation

Hints

$500 Prize CLAIMED

Owner

comma.ai

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Ready-to-use code and tutorial notebooks to boost your way into few-shot image classification.

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

The code repository for "RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection" (ACM MM'21)

CLIP (Contrastive Language–Image Pre-training) trained on Indonesian data

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

The implementation of the CVPR2021 paper "Structure-Aware Face Clustering on a Large-Scale Graph with 10^7 Nodes"

Fast methods to work with hydro- and topography data in pure Python.

An implementation of "MixHop: Higher-Order Graph Convolutional Architectures via Sparsified Neighborhood Mixing" (ICML 2019).

TDN: Temporal Difference Networks for Efficient Action Recognition

Dense Prediction Transformers

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

Python implementation of "Elliptic Fourier Features of a Closed Contour"

Official Implementation for the paper DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover’s Distance Improves Out-Of-Distribution Face Identification

Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models

WiFi-based Multi-task Sensing

[ICCV-2021] An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation

CAUSE: Causality from AttribUtions on Sequence of Events

Weighted QMIX: Expanding Monotonic Value Function Factorisation