🗣️ Enable text-to-speech with Qwen TTS, a simple API solution that seamlessly integrates into your applications using Docker and Home Assistant.
-
Updated
Mar 31, 2026 - Python
🗣️ Enable text-to-speech with Qwen TTS, a simple API solution that seamlessly integrates into your applications using Docker and Home Assistant.
🔊 Convert text into MP3 audio files quickly, enhancing content, accessibility, and automation with clear, spoken audio.
🗣️ Deploy high-quality text-to-speech services with Gemini, OpenAI, and Microsoft Azure TTS on your own platform easily.
Generate AI-voiced short videos with synced subtitles using Python, ElevenLabs TTS, and FFmpeg for clear, automated social media and educational content.
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
zero-shot realtime TTS system, fully offline, free and open source
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
MelGAN with catalyst framework
MelGAN Multi GPU Implementation.
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Unofficial implementation of Multi-band MelGAN
A neural network (GAN) trained to apply metal screaming effects, turning vocals from songs, speeches or whispers into realistic screams and growls.
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Add a description, image, and links to the melgan topic page so that developers can more easily learn about it.
To associate your repository with the melgan topic, visit your repo's landing page and select "manage topics."