Stars
Handle multiprompts and images within one run. Quick OutputLists from spreadsheet, JSON, multiline text, numberranges for sequential processing. Combinations of lists and prompts. Load any file wit…
ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2, drone footage, stock video, personal recordings, etc.
FSampler is a training‑free, sampler‑agnostic acceleration layer for diffusion sampling.
ComfyUI Chatterbox TTS & Voice Conversion Node
Sarania / blissful-tuner
Forked from kohya-ss/musubi-tunerAdvanced CLI diffusion inference/training suite based on Musubi Tuner
Very customizable imageboard/booru downloader with powerful filenaming features.
A sd-webui extension for utilizing DanTagGen to "upsample prompts".
Simple string manipulation nodes for ComfyUI (strip/remove text strings, search and replace text strings, boolean check for string, preview modified string outputs). Useful for modifying prompts or…
ComfyUI-RealESRGAN_Upscaler
Create and render complex ffmpeg filtergraphs in the browser.
https://wavespeed.ai/ [WIP] The all in one inference optimization solution for ComfyUI, universal, flexible, and fast.
A set of ComfyUI nodes providing additional control for the LTX Video model
Calculate the execution time of all nodes.
Execution Time Analysis, Reroute Enhancement, Remote Python Logs, For ComfyUI developers.
Webui for using XTTS and for finetuning it
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design