music-ai

Deep learning transformer model that generates unique music sequences.

Abstract

In 2017, a new state-of-the-art was published for natural language processing: the Transformer. Relying solely on attention mechanisms, the Transformer outperformed existing solutions based on recurrent and convolutional neural networks¹. However, recurrent neural networks, long short-term memory, and gated recurrent neural networks remain dominant in the field of generative music. I aim to introduce the Transformer into the field of music, with the goal of teaching the deep learning model to predict the second half of a composition given the first half. A Transformer equipped with 32 attention heads and sinusoidal positional encoding was trained on the Nottingham MIDI dataset for 5000 epochs over a period of 48 hours, optimized by stochastic gradient descent and measured with cross entropy loss, and regulated by an exponential learning rate decrease schedule. For the first thousand epochs, the model had noticeable improvement but lacked arrangement to the generated sequences. By five thousand epochs, the model clearly demonstrated the knowledge of general music trends used to better predict how classical composers write their pieces, and most tracks were melodic to the human ear. Future applications of this technique include generating tracks for various instruments, rating the quality of existing music tracks, and complete originality if combined with a generative network mapping melodies to latent space.

¹ Attention Is All You Need

Video

Hardware

Ubuntu

32 GB RAM
Intel Core i3-4170 CPU @3.70 GHz x4 (4 GB RAM)
NVIDIA GeForce GTX 1050 Ti

Deep learning transformer model that generates unique music sequences.

Related tags

Overview

music-ai

Abstract

Video

Hardware

Owner

xacer

Analyze, visualize and process sound field data recorded by spherical microphone arrays.

Bot duniya Music Player

Telegram Bot to play music in VoiceChat with Channel Support and autostarts Radio.

Simple, hackable offline speech to text - using the VOSK-API.

This is a realtime voice translator program which gets input from user at any language and converts it to the desired language that the user asks

XA Music Player - Telegram Music Bot

Accompanying code for our paper "Point Cloud Audio Processing"

Using python to generate a bat script of repetitive lines of code that differ in some way but can sort out a group of audio files according to their common names

A simple voice detection system which can be applied practically for designing a device with capability to detect a baby’s cry and automatically turning on music

A Simple Script that will help you to Play / Change Songs with just your Voice

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

A telegram bot for which is help to play songs in vc 🥰 give 🌟 and fork this repo before use 😏

Reading list for research topics in sound event detection

𝙰 𝙼𝚞𝚜𝚒𝚌 𝙱𝚘𝚝 𝙲𝚛𝚎𝚊𝚝𝚎𝚍 𝙱𝚢 𝚃𝚎𝚊𝚖𝙳𝚕𝚝 💖

A library for augmenting annotated audio data

A Python wrapper for the high-quality vocoder "World"

convert-to-opus-cli is a Python CLI program for converting audio files to opus audio format.

Sequencer: Deep LSTM for Image Classification

A GUI-based audio player with support for a large variety of formats

A fast MDCT implementation using SciPy and FFTs