A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Last update: Jul 14, 2022

Overview

WaveGlow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Quick Start:

Install requirements:

pip install -r requirements.txt

Download dataset:

wget http://festvox.org/cmu_arctic/cmu_arctic/packed/cmu_us_slt_arctic-0.95-release.tar.bz2
tar xf cmu_us_slt_arctic-0.95-release.tar.bz2

Extract features: feature extracting pipeline is the same as tacotron
Training with default hyperparams:

python train.py

Synthesize from model:

python generate.py --checkpoint=/path/to/model --local_condition_file=/path/to/local_conditon

Notes:

This is not official implementation, some details are not necessarily correct.
Work in progress.

Owner

Yuchao Zhang

speech synthesis/machine learning

GitHub Repository

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

1 Nov 24, 2021

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

LightSpeech UnOfficial PyTorch implementation of LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search.

54 Dec 03, 2022

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

epub2audiobook Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech Input examples qual a pasta do seu

7 Aug 25, 2022

Mednlp - Medical natural language parsing and utility library

Medical natural language parsing and utility library A natural language medical

3 Aug 24, 2022

A flask application to predict the speech emotion of any .wav file.

This is a speech emotion recognition app. It will allow you to train a modular MLP model with the RAVDESS dataset, and then use that model with a flask application to predict the speech emotion of an

2 Dec 15, 2021

CoSENT、STS、SentenceBERT

CoSENT_Pytorch 比Sentence-BERT更有效的句向量方案

102 Dec 07, 2022

Code for our ACL 2021 (Findings) Paper - Fingerprinting Fine-tuned Language Models in the wild .

🌳 Fingerprinting Fine-tuned Language Models in the wild This is the code and dataset for our ACL 2021 (Findings) Paper - Fingerprinting Fine-tuned La

5 Sep 13, 2022

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

This repository contains code for the following two papers: VisualBERT: A Simple and Performant Baseline for Vision and Language (arxiv) with a short

464 Jan 04, 2023

Machine learning classifiers to predict American Sign Language .

ASL-Classifiers American Sign Language (ASL) is a natural language that serves as the predominant sign language of Deaf communities in the United Stat

0 Feb 08, 2022

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）此版本基于Pytorch后端的huggingface进行实现。由于此实现使用了Oneflow的dataloader作为数据读入的方式，因此也需要安装Oneflow。其它框架的数据读取可以参考OneflowDataloade

9 Oct 13, 2022

Text Normalization（文本正则化）

Text Normalization（文本正则化）任务描述：通过机器学习算法将英文文本的“手写”形式转换成“口语“形式，例如“6ft”转换成“six feet”等实验结果 XGBoost + bag-of-words: 0.99159 XGBoost+Weights+rules：0.99002

0 Feb 26, 2022

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

Basic-UI-for-GPT-J-6B-with-low-vram A repository to run GPT-J-6B on low vram systems by using both ram, vram and pinned memory. There seem to be some

90 Dec 25, 2022

Various capabilities for static malware analysis.

Malchive The malchive serves as a compendium for a variety of capabilities mainly pertaining to malware analysis, such as scripts supporting day to da

64 Nov 22, 2022

문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.

Namuwiki corpus 문장단위로 미리 분절된 나무위키 코퍼스. 목적이 LM등에서 사용하기 위한 데이터셋이라, 링크/이미지/테이블 등등이 잘려있습니다. 문장 단위 분절은 kss를 활용하였습니다. 라이선스는 나무위키에 명시된 바와 같이 CC BY-NC-SA 2.0

16 Apr 02, 2022

Code examples for my Write Better Python Code series on YouTube.

Write Better Python Code This repository contains the code examples used in my Write Better Python Code series published on YouTube: https:/

858 Dec 29, 2022

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Deep-Learning-for-Text-Document-Classification Text classification is one of the popular tasks in NLP that allows a program to classify free-text docu

2 Mar 17, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Related tags

Overview

WaveGlow

Quick Start:

Notes:

Owner

Yuchao Zhang

A number of methods in order to perform Natural Language Processing on live data derived from Twitter

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech

Mednlp - Medical natural language parsing and utility library

A flask application to predict the speech emotion of any .wav file.

CoSENT、STS、SentenceBERT

Code for our ACL 2021 (Findings) Paper - Fingerprinting Fine-tuned Language Models in the wild .

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Machine learning classifiers to predict American Sign Language .

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Text Normalization（文本正则化）

A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.

Various capabilities for static malware analysis.

문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.

Code examples for my Write Better Python Code series on YouTube.

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

Utilize Korean BERT model in sentence-transformers library

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

IEEEXtreme15.0 Questions And Answers

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks