Stars
The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
🎵 The Ultimate Open Source Suno Alternative - Professional UI for ACE-Step 1.5 AI Music Generation. Free, local, unlimited. Stop paying for Suno!
Retropex / mempool
Forked from mempool/mempoolExplore the Bitcoin ecosystem
Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
Clean, polished interface for Tencent’s SongGeneration. Create songs from text prompts or reference audio, with batch processing and smart model selection. Minimum Requirement: 10GB of VRAM
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
HY-Motion model for 3D human motion or 3D character animation generation.
Soprano: Instant, Ultra-Realistic Text-to-Speech
A Foundation Model for Generalist Gaming Agents
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
This tool retrieves tokens for all devices connected to Xiaomi cloud and encryption keys for BLE devices.
Patches for the officially unsupported nvidia-470xx driver to work with the latest Linux kernels.
HunyuanVideo-1.5: A leading lightweight video generation model
[CVPR 2026] Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
Kandinsky 5.0: A family of diffusion models for Video & Image generation
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
A modern, flexible React component and hook for image mask editing.
Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
Wan: Open and Advanced Large-Scale Video Generative Models
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
bluetooth mesh chat, IRC vibes