-
chatterbox Public
Forked from resemble-ai/chatterboxSoTA open-source TTS
-
Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities
Python MIT License UpdatedDec 1, 2025 -
TTS-WebUI Public
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
-
Audiocraft provides MusicGen and MAGNeT models for high-quality music and audio generation
Python MIT License UpdatedNov 23, 2025 -
-
tts_webui_extension.bark Public
Bark: A text-to-speech model
Python MIT License UpdatedNov 21, 2025 -
-
stable-audio-tools Public
Forked from Stability-AI/stable-audio-toolsGenerative models for conditional audio generation
-
audiotools Public
Forked from descriptinc/audiotoolsObject-oriented handling of audio data, with GPU-powered augmentations, and more.
Python MIT License UpdatedNov 13, 2025 -
StyleTTS2 is a text-to-speech model that generates high-quality speech with controllable style
Python MIT License UpdatedNov 13, 2025 -
StyleTTS2 Public
Forked from sidharthrajaram/StyleTTS2🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Python Other UpdatedNov 13, 2025 -
audiocraft Public
Forked from facebookresearch/audiocraftAudiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Python MIT License UpdatedNov 13, 2025 -
tts_webui_extension.vall_e_x Public
Vall-E-X: Multilingual text-to-speech model supporting English, Chinese, and Japanese
-
VALL-E-X Public
Forked from Plachtaa/VALL-E-XAn open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Python MIT License UpdatedNov 13, 2025 -
tts_webui_extension.rvc Public
RVC: Retrieval-based Voice Conversion
-
Retrieval-based-Voice-Conversion Public
Forked from RVC-Project/Retrieval-based-Voice-Conversionin preparation...
-
-
-
-
-
-
-
-
tts_webui_extension.maha_tts Public
Maha TTS allows generating speech from text using the MahaTTS model.
Python MIT License UpdatedNov 1, 2025 -
-
tts_webui_extension.xtts_rvc_ui Public
Forked from Vali-98/XTTS-RVC-UIA Gradio UI for XTTSv2 and RVC.
Python MIT License UpdatedOct 31, 2025 -
-
-
tts_webui_extension.vocos Public
Vocos is a neural audio codec for high-quality audio compression and reconstruction
-
Stable Audio is a text-to-audio model for generating high-quality music and sound effects
Python MIT License UpdatedOct 31, 2025