Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
-
Updated
May 25, 2026 - Python
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
开箱即用的本地私有化部署语音服务,快速搭建Qwen3ASR/FunASR与CosyVoice2/3后端
🎙️ CosyVoice All-in-One Docker - Production-ready TTS with Web UI, REST API & Voice Cloning
将 ZipEnhancer 模型从 ModelScope pipeline 中剥离,用纯 PyTorch 重新实现推理流程,并封装为高性能 FastAPI 批量语音降噪服务,为 CosyVoice 提供干净的人声输入
LightTTS is a lightweight TTS inference framework optimized for CosyVoice2 and CosyVoice3, enabling fast and scalable speech synthesis in Python and supports stream and bistream modes.
Home Assistant integrates Alibaba Cloud's BaiLian Platform TTS
Self-hosted text-to-speech platform with multi-backend support, voice cloning, and a modern web UI.
CosyVoice3 LoRA fine-tuning companion repo — PEFT integration, 9 known pitfalls from IMDA NSC production runs, corrected from failed full-SFT attempt
🎙️ CosyVoice LoRA 微调框架:LLM+Flow 联合训练,实现无 Prompt 语音合成
Add a description, image, and links to the cosyvoice topic page so that developers can more easily learn about it.
To associate your repository with the cosyvoice topic, visit your repo's landing page and select "manage topics."