10th place solution for Google Smartphone Decimeter Challenge at kaggle.

Last update: Oct 25, 2022

Overview

Under refactoring

10th place solution for Google Smartphone Decimeter Challenge at kaggle.

Global Navigation Satellite System (GNSS) provides raw signals, which the GPS chipset uses to compute a position.
Current mobile phones only offer 3-5 meters of positioning accuracy. While useful in many cases,
it can create a “jumpy” experience. For many use cases the results are not fine nor stable enough to be reliable.

This competition, hosted by the Android GPS team, is being presented at the ION GNSS+ 2021 Conference.
They seek to advance research in smartphone GNSS positioning accuracy
and help people better navigate the world around them.

In this competition, you'll use data collected from the host team’s own Android phones
to compute location down to decimeter or even centimeter resolution, if possible.
You'll have access to precise ground truth, raw GPS measurements,
and assistance data from nearby GPS stations, in order to train and test your submissions.

Predictions with host baseline for highway area(upper figure) are really good, but for downtown area(lower figure) are noisy due to the effect of Multipath.

Overview

Predicting the Noise, Noise = Ground Truth - Baseline, like denoising in computer vision
Using the speed latDeg(t + dt) - latDeg(t)/dt as input instead of the absolute position for preventing overfitting on the train dataset.
Making 2D image input with Short Time Fourier Transform, STFT, and then using ImageNet convolutional neural network

STFT and Conv Network Part

Input: Using librosa, generating STFT for both latDeg&lngDeg speeds.
- Each phone sequence are split into 256 seconds sequence then STFT with n_tft=256, hop_length=1 and win_length=16 , result in (256, 127, 2) feature for each degree. The following 2D images are generated from 1D sequence.

Model: Regression and Segmentation
- Regression: EfficientNet B3, predict latDeg&lngDeg noise,
- Segmentation: Unet ++ with EfficientNet encoder(segmentation pyroch) , predict stft noise
  - segmentation prediction + input STFT -> inverse STFT -> prediction of latDeg&lngDeg speeds
  - this speed prediction was used for:
    1. Low speed mask; The points of low speed area are replaced with its median.
    2. Speed disagreement mask: If the speed from position prediction and this speed prediction differ a lot, remove such points and interpolate.
  - prediction example for the segmentation.

LightGBM Part

Input: IMU data excluding magnetic filed feature
- also excluding y acceleration and z gyro because of phone mounting condition
- adding moving average as additional features, window_size=5, 15, 45
Predict latDeg&lngDeg noise

KNN at downtown Part

similar to Snap to Grid, but using both global and local feature. Local re-ranking comes from the host baseline of GLR2021

Use train ground truth as database
Global search: query(latDeg&lngDeg) -> find 10 candidates
Local re-ranking: query(latDeg&lngDeg speeds and its moving averages) -> find 3 candidates -> taking mean over candidates

Public Post Process Part

There are lots of nice and effective PPs in public notebooks. Thanks to the all authors. I used the following notebooks.

score

Check each idea with late submissions.
actually conv position pred part implemented near deadline, before that I used only the segmentation model for STFT image.

status	Host baseline + Public PP	conv position pred	gbm	speed mask	knn global	knn local	Private Board Score
1 day before deadline	✓			✓	✓	✓	3.07323
10 hours before deadline	✓		✓	✓	✓	✓	2.80185
my best submission	✓	✓	✓	✓	✓	✓	2.61693
late sub							5.423
late sub	✓						3.61910
late sub	✓	✓					3.28516
late sub	✓	✓	✓				3.19016
late sub	✓	✓	✓	✓			2.81074
late sub	✓	✓	✓	✓	✓		2.66377

How to run

environment

Ubuntu 18.04
Python with Anaconda
NVIDIA GPUx1

Data Preparation

First, download the data, here, and then place it like below.

../input/
    └ google-smartphone-decimeter-challenge/

During run, temporary cached will be stored under ../data/ and outputs will be stored under ../working/ through hydra.

Code&Pacakage Installation

# clone project
git clone https://github.com/Fkaneko/kaggle_Google_Smartphone_Decimeter_Challenge

# install project
cd kaggle_Google_Smartphone_Decimeter_Challenge
conda create -n gsdc_conv python==3.8.0
yes | bash install.sh
# at my case I need an additional run of `yes | bash install.sh` for installation.

Training/Testing

3 different models

for conv training, python train.py at each branch. Please check the src/config/config.yaml for the training configuration.
for LightGBM position you need mv ./src/notebook/lightgbm_position_prediction.ipynb ./ and then starting juypter notebook.

model	branch	training	test
conv stft segmentation	main	./train.py	./test.py
conv position	conv_position	./train.py	./test.py
LightGBM position	main	./src/notebook/lightgbm_position_prediction.ipynb	included training notebook

Testing

10th place solution trained weights

I've uploaded pretrained weights as kaggle dataset, here. So extract it on ./ and you can see ./model_weights. And then running python test.py yields submission.csv. This csv will score ~2.61 at kaggle private dataset, which equals to 10th place.

your trained weights

For conv stft segmentation please change paths at the config, src/config/test_weights/compe_sub_github.yaml, and then run followings.

# at main branch
python test.py  \
     conv_pred_path="your conv position prediction csv path"\
     gbm_pred_path="your lightgbm position prediction path"

Regarding, conv_pred_path and gbm_pred_path, you need to create each prediction csv with the table above before run this code. Or you can use mv prediction results on the same kaggle dataset as pretrained weights.

License

Code

Apache 2.0

Dataset

Please check the kaggle page -> https://www.kaggle.com/c/google-smartphone-decimeter-challenge/rules

pretrained weights

These trained weights were generated from ImageNet pretrained weights. So please check ImageNet license if you use pretrained weights for a serious case.

10th place solution for Google Smartphone Decimeter Challenge at kaggle.

Related tags

Overview

Under refactoring

10th place solution for Google Smartphone Decimeter Challenge at kaggle.

Overview

STFT and Conv Network Part

LightGBM Part

KNN at downtown Part

Public Post Process Part

score

How to run

environment

Data Preparation

Code&Pacakage Installation

Training/Testing

3 different models

Testing

10th place solution trained weights

your trained weights

License

Code

Dataset

pretrained weights

Owner

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

Controlling the MicriSpotAI robot from scratch

A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo

[ICCV2021] Learning to Track Objects from Unlabeled Videos

Official repository for "Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems"

Supervised & unsupervised machine-learning techniques are applied to the database of weighted P4s which admit Calabi-Yau hypersurfaces.

GitHub repository for "Improving Video Generation for Multi-functional Applications"

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

A toy compiler that can convert Python scripts to pickle bytecode 🥒

The datasets and code of ACL 2021 paper "Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions".

Pytorch implementation of "ARM: Any-Time Super-Resolution Method"

(ICCV 2021) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing."

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, CVPR2021

PantheonRL is a package for training and testing multi-agent reinforcement learning environments.

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

Turning SymPy expressions into JAX functions

Multi-angle c(q)uestion answering

This repository contains the code for the paper in EMNLP 2021: "HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression".

The Pytorch code of "Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification", CVPR 2022 (Oral).