https://arxiv.org/abs/2102.11005

Last update: Dec 19, 2022

Related tags

Overview

LogME

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

How to use

Just feed the features f and labels y to the function, and you can get a nice score which well correlates with the transfer learning performance.

from LogME import LogME
score = LogME(f, y)

Then you can use the score to quickly select a good pre-trained model. The larger the score is, the better transfer performance you get.

Experimental results

We extensively validate the generality and superior performance of LogME on 14 pre-trained models and 17 downstream tasks, covering various pre-trained models (supervised pre-trained and unsupervised pre-trained), downstream tasks (classification and regression), and modalities (vision and language). Check the paper for all the results.

Computer vision

9 datasets and 10 pre-trained models. LogME is a reasonably good indicator for transfer performance.

NLP

7 tasks and 4 pre-trained models. LogME is a good indicator for transfer performance.

Speedup

LogME provides a dramatic speedup for assessing pre-trained models. The speedup comes from two aspects:

LogME does not need hyper-parameter tuning whereas vanilla fine-tuning requires extensive hyper-parameter tuning.
We designed a fast algorithm to further speedup the computation of LogME.

Citation

If you find it useful, please cite the following paper:

@article{you_logme:_2021,
	title = {LogME: Practical Assessment of Pre-trained Models for Transfer Learning},
	author = {You, Kaichao and Liu, Yong and Long, Mingsheng and Wang, Jianmin},
	journal = {arxiv},
	volume = {abs/2102.11005},
	year = {2021},
	url = {https://arxiv.org/abs/2102.11005},
}

Contact

If you have any question or want to use the code, please contact [email protected] .

https://arxiv.org/abs/2102.11005

Related tags

Overview

LogME

How to use

Experimental results

Computer vision

NLP

Speedup

Citation

Contact

Owner

THUML: Machine Learning Group @ THSS

Sum-Product Probabilistic Language

Creating a custom CNN hypertunned architeture for the Fashion MNIST dataset with Python, Keras and Tensorflow.

Denoising Diffusion Probabilistic Models

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

Implementation of E(n)-Transformer, which extends the ideas of Welling's E(n)-Equivariant Graph Neural Network to attention

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

U-Net implementation in PyTorch for FLAIR abnormality segmentation in brain MRI

Yolov5-opencv-cpp-python - Example of using ultralytics YOLO V5 with OpenCV 4.5.4, C++ and Python

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation (ICCV 2021)

⚓ Eurybia monitor model drift over time and securize model deployment with data validation

A machine learning project which can detect and predict the skin disease through image recognition.

FCN (Fully Convolutional Network) is deep fully convolutional neural network architecture for semantic pixel-wise segmentation

Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance

Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation

PyTorch implementation of Decoupling Value and Policy for Generalization in Reinforcement Learning

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets"

Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)

Cours d'Algorithmique Appliquée avec Python pour BTS SIO SISR

a general-purpose Transformer based vision backbone

Hybrid CenterNet - Hybrid-supervised object detection / Weakly semi-supervised object detection