Stars
Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms
KevinAHM / echo-tts-api
Forked from jordandare/echo-ttsEcho-TTS OpenAI Compatible Speech Endpoint w/ Streaming
One-click 3D Gaussian Splatting generation from a single image.
[ICCV 2023] DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders
An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance
Python SDK for ComfyUI - Support Local or Cloud - Generate images, videos, audio in 3 lines. https://puke3615.github.io/ComfyKit
A mass video player for easy browsing of large video datasets
lihaoyun6 / FlashVSR_plus
Forked from OpenImagingLab/FlashVSRTowards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
A Video Slider Component for Gradio Application
[CVPR 2026] Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny co…
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Official implementation of HYPIR: Harnessing Diffusion-Yielded Score Priors for Image Restoration (SIGGRAPH 2025)
The ultimate training toolkit for finetuning diffusion models
[v0.5.1] FramePack Video App offering multiple generation types: Original, F1, video extension, end frame. Features include: LoRA support, job queueing, advanced timestamped prompts, offline mode, …
FP-Studio / framepack-studio
Forked from lllyasviel/FramePackExpanding FramePack into a multifunction video creation tool
nirvash / FramePack
Forked from lllyasviel/FramePackLets make video diffusion practical!
Metadata-indexer and Viewer for AI-generated images
[CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
deepbeepmeep / YuEGP
Forked from multimodal-art-projection/YuEYuE: Open Full-song Generation Foundation for the GPU Poor
Examples of using the llasa-tts models locally
GPU Poor Version of Hunyuan3D-2
Enhance-A-Video: Better Generated Video for Free
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
Flow is a custom node designed to provide a user-friendly interface for ComfyUI.