Stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
End to end text to speech system using gruut and onnx
Local version of Deforum Stable Diffusion, supports txt settings file input and animation features!
Grapheme to phoneme conversion with deep learning.
A tokenizer, text cleaner, and phonemizer for many human languages.
ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques …
Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.
Uses ctypes and libespeak-ng to transform test into IPA phonemes
An implementation of GlowTTS designed to work with Gruut
Transform audio files into mel spectrograms for text-to-speech model training
Flexible tool for assigning integer ids to phonemes