This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 13,042 2,829 Updated Jun 22, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,858 2,408 Updated Jun 15, 2026

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,413 5,357 Updated Sep 22, 2025

chrisdonahue / wavegan

WaveGAN: Learn to synthesize raw audio with generative adversarial networks

Python 1,383 282 Updated Nov 27, 2022

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,230 6,680 Updated Sep 30, 2025

gentaiscool / end2end-asr-pytorch

End-to-End Automatic Speech Recognition on PyTorch

Python 304 62 Updated Jun 2, 2022

flashlight / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,444 992 Updated Jan 12, 2026

syhw / wer_are_we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

1,862 225 Updated Jun 27, 2022

SeanNaren / deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Python 2,135 620 Updated Dec 13, 2022

mkotha / WaveRNN

A WaveRNN implementation

Python 201 46 Updated Oct 14, 2019

mmorise / World

A high-quality speech analysis, manipulation and synthesis system

C++ 1,321 265 Updated Feb 18, 2026

swasun / VQ-VAE-Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Python 271 54 Updated Aug 13, 2019

holoviz / datashader

Quickly and accurately render even the largest data.

Python 3,551 378 Updated Jun 4, 2026

r9y9 / wavenet_vocoder

WaveNet vocoder

Python 2,371 493 Updated Jul 29, 2023

openai / glow

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

Python 3,184 525 Updated Jul 23, 2024

xiph / rnnoise

Recurrent neural network for audio noise reduction

C 5,650 1,059 Updated Feb 22, 2025

xiph / LPCNet

Efficient neural speech synthesis

C 1,213 306 Updated Sep 21, 2024

NVIDIA / waveglow

A Flow-based Generative Network for Speech Synthesis

Python 2,337 536 Updated Oct 19, 2023

npuichigo / waveglow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Python 205 35 Updated Nov 6, 2018

ksw0306 / FloWaveNet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Python 490 108 Updated Apr 23, 2019

ifnspaml / Enhancement-Coded-Speech

MATLAB 24 14 Updated Apr 25, 2022

JeremyCCHsu / vqvae-speech

Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)

Python 129 31 Updated Jul 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sen Li triwoods

Block or report triwoods

Stars

DonkeyHang / pulseaudio_speech_enhancement

rosinality / vq-vae-2-pytorch

facebookresearch / encodec

santi-pdp / segan

xiph / opus

jagger2048 / WebRtc_noise_suppression

kuleshov / audio-super-res

lucidrains / vector-quantize-pytorch

Rudrabha / Wav2Lip