A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Oct 2, 2025 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
🎙️ Generate natural-sounding speech and clone voices without a tokenizer using VoxCPM's context-aware TTS. Experience true-to-life voice generation today.
Code written researching my dissertation, exploring screenless technologies for a leraning through play experience
🎙️ Enhance speech generation and voice cloning using ComfyUI with the VoxCPM integration for token-free, context-aware TTS.
Add a description, image, and links to the t2s topic page so that developers can more easily learn about it.
To associate your repository with the t2s topic, visit your repo's landing page and select "manage topics."