Stars
Native and Compact Structured Latents for 3D Generation
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
Sharp Monocular View Synthesis in Less Than a Second
PersonaLive! : Expressive Portrait Image Animation for Live Streaming
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
This project is a collection of Docker-based web user interfaces designed to easily run various state-of-the-art generative AI models locally. It simplifies the deployment of these AI tools by pack…
A cross-platform desktop All-in-One assistant tool for Claude Code, Codex & Gemini CLI.
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …
ComfyUI-InfiniteTalk-MultiImage
Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
[Support 0.49.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
Curated list of awesome Cursor Rules .mdc files
This tool is used to copy cursor rule files for initialize any project
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
A WebUI app for Music-Source-Separation-Training and we packed UVR together!
Repository for training models for music source separation.
AliceNavigator / Music-Source-Separation-Training-GUI
Forked from ZFTurbo/Music-Source-Separation-TrainingMSST-GUI is a Qt5-based inference GUI, designed to provide a convenient and intuitive way to inference (mainly for my own use)
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stringent computational constraints.
Xray panel supporting multi-protocol multi-user expire day & traffic & IP limit (Vmess, Vless, Trojan, ShadowSocks, Wireguard, Tunnel, Mixed, HTTP)
[[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions
MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.
[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
MAGI-1: Autoregressive Video Generation at Scale