Stars
Real-time speech enhancement for a better workflow. PyTorch deep learning model deployed as Flask application on GCP.
Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Speech Enhancement Generative Adversarial Network in TensorFlow
This is WebRtc noise suppression module demo.
Audio super resolution using neural networks
Vector (and Scalar) Quantization, in Pytorch
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
kaldi-asr/kaldi is the official location of the Kaldi project.
WaveGAN: Learn to synthesize raw audio with generative adversarial networks
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
End-to-End Automatic Speech Recognition on PyTorch
Facebook AI Research's Automatic Speech Recognition Toolkit
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Speech Recognition using DeepSpeech2.
A high-quality speech analysis, manipulation and synthesis system
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Quickly and accurately render even the largest data.
Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"
Recurrent neural network for audio noise reduction
A Flow-based Generative Network for Speech Synthesis
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Tensorflow implementation of the speech model described in Neural Discrete Representation Learning (a.k.a. VQ-VAE)