GeDML is an easy-to-use generalized deep metric learning library

Last update: Dec 05, 2022

Overview

News

[2021-9-6]: v0.0.0 has been released.

Introduction

GeDML is an easy-to-use generalized deep metric learning library, which contains:

State-of-the-art DML algorithms: We contrain 18+ losses functions and 6+ sampling strategies, and divide these algorithms into three categories (i.e., collectors, selectors, and losses).
Bridge bewteen DML and SSL: We attempt to bridge the gap between deep metric learning and self-supervised learning through specially designed modules, such as collectors.
Auxiliary modules to assist in building: We also encapsulates the upper interface for users to start programs quickly and separates the codes and configs for managing hyper-parameters conveniently.

Installation

Pip

pip install gedml

Framework

This project is modular in design. The pipeline diagram is as follows:

Code structure

_debug: Debug files.
demo: Demos of configuration files.
docs: Documentation.
src: Source code.
- core: Losses, selectors, collectors, etc.
- client: Tmux manager.
- config: Config files including link, convert, assert and params.
- launcher: Manager, Trainer, Tester, etc.
- recorder: Recorder.

Method

Collectors

method	description
BaseCollector	Base class
DefaultCollector	Do nothing
ProxyCollector	Maintain a set of proxies
MoCoCollector	paper: Momentum Contrast for Unsupervised Visual Representation Learning
SimSiamCollector	paper: Exploring Simple Siamese Representation Learning
HDMLCollector	paper: Hardness-Aware Deep Metric Learning
DAMLCollector	paper: Deep Adversarial Metric Learning
DVMLCollector	paper: Deep Variational Metric Learning

Losses

classifier-based

method	description
CrossEntropyLoss	Cross entropy loss for unsupervised methods
LargeMarginSoftmaxLoss	paper: Large-Margin Softmax Loss for Convolutional Neural Networks
ArcFaceLoss	paper: ArcFace: Additive Angular Margin Loss for Deep Face Recognition
CosFaceLoss	paper: CosFace: Large Margin Cosine Loss for Deep Face Recognition

pair-based

method	description
ContrastiveLoss	paper: Learning a Similarity Metric Discriminatively, with Application to Face Verification
MarginLoss	paper: Sampling Matters in Deep Embedding Learning
TripletLoss	paper: Learning local feature descriptors with triplets and shallow convolutional neural networks
AngularLoss	paper: Deep Metric Learning with Angular Loss
CircleLoss	paper: Circle Loss: A Unified Perspective of Pair Similarity Optimization
FastAPLoss	paper: Deep Metric Learning to Rank
LiftedStructureLoss	paper: Deep Metric Learning via Lifted Structured Feature Embedding
MultiSimilarityLoss	paper: Multi-Similarity Loss With General Pair Weighting for Deep Metric Learning
NPairLoss	paper: Improved Deep Metric Learning with Multi-class N-pair Loss Objective
SignalToNoiseRatioLoss	paper: Signal-To-Noise Ratio: A Robust Distance Metric for Deep Metric Learning
PosPairLoss	paper: Exploring Simple Siamese Representation Learning

proxy-based

method	description
ProxyLoss	paper: No Fuss Distance Metric Learning Using Proxies
ProxyAnchorLoss	paper: Proxy Anchor Loss for Deep Metric Learning
SoftTripleLoss	paper: SoftTriple Loss: Deep Metric Learning Without Triplet Sampling

Selectors

method	description
BaseSelector	Base class
DefaultSelector	Do nothing
DenseTripletSelector	Select all triples
DensePairSelector	Select all pairs

Quickstart

Please set the environment variable WORKSPACE first to indicate where to manage your project.

Initialization

Use ConfigHandler to create all objects.

config_handler = ConfigHandler()
config_handler.get_params_dict()
objects_dict = config_handler.create_all()

Start

Use manager to automatically call trainer and tester.

manager = utils.get_default(objects_dict, "managers")
manager.run()

Directly use trainer and tester.

trainer = utils.get_default(objects_dict, "trainers")
tester = utils.get_default(objects_dict, "testers")
recorder = utils.get_default(objects_dict, "recorders")

# start to train
utils.func_params_mediator(
    [objects_dict],
    trainer.__call__
)

# start to test
metrics = utils.func_params_mediator(
    [
        {"recorders": recorder},
        objects_dict,
    ],
    tester.__call__
)

Document

For more information, please refer to:

📖 👉 Docs

Some specific guidances:

Configs

We will continually update the optimal parameters of different configs in TsinghuaCloud

Code Reference

TODO:

assert parameters
distributed methods and Non-distributed methods!!!
write github action to automate unit-test, package publish and docs building.
add cross-validation splits protocol.

GeDML is an easy-to-use generalized deep metric learning library

Related tags

Overview

News

Introduction

Installation

Pip

Framework

Code structure

Method

Collectors

Losses

classifier-based

pair-based

proxy-based

Selectors

Quickstart

Initialization

Start

Document

Configs

Code Reference

TODO:

Owner

Borui Zhang

Phy-Q: A Benchmark for Physical Reasoning

A framework for GPU based high-performance medical image processing and visualization

3D detection and tracking viewer (visualization) for kitti & waymo dataset

Adversarial Attacks are Reversible via Natural Supervision

TabNet for fastai

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

Sinkformers: Transformers with Doubly Stochastic Attention

A Marvelous ChatBot implement using PyTorch.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

A simple software for capturing human body movements using the Kinect camera.

This repo holds codes of the ICCV21 paper: Visual Alignment Constraint for Continuous Sign Language Recognition.

WiFi-based Multi-task Sensing

Generate images from texts. In Russian

Differentiable Annealed Importance Sampling (DAIS)

Code repo for "RBSRICNN: Raw Burst Super-Resolution through Iterative Convolutional Neural Network" (Machine Learning and the Physical Sciences workshop in NeurIPS 2021).

A deep-learning pipeline for segmentation of ambiguous microscopic images.

Official repository for Jia, Raghunathan, Göksel, and Liang, "Certified Robustness to Adversarial Word Substitutions" (EMNLP 2019)

Search Youtube Video and Get Video info

NALSM: Neuron-Astrocyte Liquid State Machine

Pytorch implementation of Decoupled Spatial-Temporal Transformer for Video Inpainting