Stars
Run ComfyUI workflows on multiple local GPUs/networked machines.
ComfyUI extension that enables multi-GPU processing locally, remotely and in the cloud
Download Hugging Face and CivitAI models and other assets used in ComfyUI workflows
Dead simple FLUX LoRA training UI with LOW VRAM support
Free Download Manager Add-On. Provides support for downloading videos from various sites.
A Web UI simplify the AI videos generation using Hunyuan Video Diffusion Model
all workflow packs for ComfyUI from @driftjohnson @scuffedepoch
A pipeline parallel training script for diffusion models.
ComfyUI nodes for training AnimateDiff motion loras
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
openvpi / DiffSinger
Forked from MoonInTheRiver/DiffSingerAn advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs
🔊 Text-Prompted Generative Audio Model
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …
Easily train a good VC model with voice data <= 10 mins!
Tumor Cell localization using Capsule Networks & U-Net.
Abhi0323 / RAG-Powered-AI-Assistant-Transforming-Data-Retrieval-and-Analysis-Across-the-Web-and-PDFs
Harness the power of Retrieval-Augmented Generation with the Personal AI Assistant, an innovative tool designed to extract and synthesize information from web and PDF sources efficiently. This cutt…
A python bot to apply all Linkedin Easy Apply jobs based on your preferences.
A playwright bot which is implemented to scrape linkedin and store advertisement data in a database and telegram channel
OpenUI let's you describe UI using your imagination, then see it rendered live.
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation