- South Africa
- https://orcid.org/0000-0002-8168-7857
Lists (1)
Sort Name ascending (A-Z)
Stars
🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Official implementation of Meta-StyleSpeech and StyleSpeech
Implementation code of non-parallel sequence-to-sequence VC
A repository for benchmarking neural vocoders by their quality and speed.
The official Implementation of PeriodWave and PeriodWave-Turbo
SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
A tool for automatic phoneme transcription
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.
Full version of wav2lip-onnx including face alignment and face enhancement and more...
geneing / WaveRNN-Pytorch
Forked from G-Wang/WaveRNN-PytorchFatcord's Alternative WaveRNN (Faster training)
The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023
Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
Voice swapping with VQ-VAE and diffusion models
Sequence-to-sequence TTS based on Kyubyong's dc_tts
Jason Riggle's chart of phonological features in JSON format + extras
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.