Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Last update: Dec 28, 2022

Related tags

Overview

Knover

Knover is a toolkit for knowledge grounded dialogue generation based on PaddlePaddle. Knover allows researchers and developers to carry out efficient training/inference of large-scale dialogue generation models.

What's New:

December 2021: We are opening the dialogue generation model of PLATO-XL, with up to 11 billion parameters.
October 2021: We are opening AG-DST, an amendable generation for dialogue state tracking.
February 2021: We are opening our implementation (Team 19) in DSTC9-Track1.
July 2020: We are opening PLATO-2, a large-scale generative model with latent space for open-domain dialogue systems.

Requirements and Installation

python version >= 3.7
paddlepaddle-gpu version >= 2.0.0
- You can install PaddlePaddle following the instructions.
- The specific version of PaddlePaddle is also based on your CUDA version (recommended version: 10.1) and CuDNN version (recommended version: 7.6). See more information on PaddlePaddle document about GPU support
sentencepiece
termcolor
If you want to run distributed training, you'll also need NCCL
Install Knover locally:

git clone https://github.com/PaddlePaddle/Knover.git
cd Knover
pip3 install -e .

Or you can setup PYTHONPATH only:

export PYTHONPATH=/abs/path/to/Knover:$PYTHONPATH

Basic usage

See usage document.

Disclaimer

This project aims to facilitate further research progress in dialogue generation. Baidu is not responsible for the 3rd party's generation with the pre-trained system.

Contact information

For help or issues using Knover, please submit a GitHub issue.

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

Related tags

Overview

Knover

What's New:

Requirements and Installation

Basic usage

Disclaimer

Contact information

Owner

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)

Intent parsing and slot filling in PyTorch with seq2seq + attention

Code for lyric-section-to-comment generation based on huggingface transformers.

Russian GPT3 models.

VD-BERT: A Unified Vision and Dialog Transformer with BERT

用Resnet101+GPT搭建一个玩王者荣耀的AI

An automated program that helps customers of Pizza Palour place their pizza orders

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

Refactored version of FastSpeech2

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

Unsupervised Language Model Pre-training for French

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃

Pattern Matching in Python

Conditional probing: measuring usable information beyond a baseline

ChatBotProyect - This is an unfinished project about a simple chatbot.

Korean stereoypte detector with TUNiB-Electra and K-StereoSet