E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Last update: Dec 15, 2022

Overview

End-to-end Music Remastering System

This repository includes source code and pre-trained models of the work End-to-end Music Remastering System Using Self-supervised and Adversarial Training by Junghyun Koo, Seungryeol Paik, and Kyogu Lee.

We provide inference code of the proposed system, which targets to alter the mastering style of a song to desired reference track.

Pre-trained Models

Model	Number of Epochs Trained	Details
Music Effects Encoder	1000	Trained with MTG-Jamendo Dataset
Mastering Cloner	1000	Trained with the above pre-trained Music Effects Encoder and Projection Discriminator

Inference

To run the inference code,

Download pre-trained models above and place them under the folder named 'model_checkpoints' (default)
Prepare input and reference tracks under the folder named 'inference_samples' (default).
Target files should be organized as follow:

    "path_to_data_directory"/"song_name_#1"/input.wav
    "path_to_data_directory"/"song_name_#1"/reference.wav
    ...
    "path_to_data_directory"/"song_name_#n"/input.wav
    "path_to_data_directory"/"song_name_#n"/reference.wav

Run 'inference.py'

python inference.py \
    --ckpt_dir "path_to_checkpoint_directory" \
    --data_dir_test "path_to_directory_containing_inference_samples"

Outputs will be stored under the folder 'inference_samples' (default)

Note: The system accepts WAV files of stereo-channeled, 44.1kHZ, and 16-bit rate. Target files shold be named "input.wav" and "reference.wav".

Configurations of each sub-networks

A detailed configuration of each sub-networks can also be found at

Self_Supervised_Music_Remastering_System/configs.yaml

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Related tags

Overview

End-to-end Music Remastering System

Pre-trained Models

Inference

Configurations of each sub-networks

Owner

Junghyun (Tony) Koo

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation

Code for the ICME 2021 paper "Exploring Driving-Aware Salient Object Detection via Knowledge Transfer"

Trading Strategies for Freqtrade

Framework for training options with different attention mechanism and using them to solve downstream tasks.

GAN-based 3D human pose estimation model for 3DV'17 paper

Free-duolingo-plus - Duolingo account creator that uses your invite code to get you free duolingo plus

SCNet: Learning Semantic Correspondence

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

codes for IKM (arXiv2021, Submitted to IEEE Trans)

This is a code repository for the paper "Graph Auto-Encoders for Financial Clustering".

Codes to calculate solar-sensor zenith and azimuth angles directly from hyperspectral images collected by UAV. Works only for UAVs that have high resolution GNSS/IMU unit.

Building Ellee — A GPT-3 and Computer Vision Powered Talking Robotic Teddy Bear With Human Level Conversation Intelligence

"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

U^2-Net - Portrait matting This repository explores possibilities of using the original u^2-net model for portrait matting.

Modular Probabilistic Programming on MXNet

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

Official PyTorch implementation of "The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation" (ICCV 21).

A collection of inference modules for fastai2

multimodal transformer

Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies