Stars
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
PersonaLive! : Expressive Portrait Image Animation for Live Streaming
Taming large-scale full-parameter few-step training with self-adversarial flows! 👏🏻
Kandinsky 5.0: A family of diffusion models for Video & Image generation
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework for efficient and causal video generation using adversarial s…
This is the official repo for the paper "LongCat-Flash-Omni Technical Report"
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
rCM: SOTA Diffusion Distillation & Few-Step Video Generation based on sCM/MeanFlow
Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.
A minimal implementation of DeepMind's Genie world model
LongLive: Real-time Interactive Long Video Generation
MiMo-Audio: Audio Language Models are Few-Shot Learners
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Tongyi Deep Research, the Leading Open-source Deep Research Agent
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
Official repository for the UAE paper, unified-GRPO, and unified-Bench
SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
A iterative feedback driven benchmark on LLM's instruction following ability
The codes for Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration
Official Repository of "OmniTry: Virtual Try-On Anything without Masks"