Starred repositories
We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.
Native and Compact Structured Latents for 3D Generation
Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?
Towards Scalable Pre-training of Visual Tokenizers for Generation
[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
A framework for efficient model inference with omni-modality models
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion
A collection of paper/projects that trains flow matching model/policies via RL.
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"
Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer
Project Sekai sticker maker
A series of math-specific large language models of our Qwen2 series.
Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…
Official inference repo for FLUX.2 models
魔法少女的魔女裁判的文本框脚本,具体使用方式为按下enter自动生成图片并发送
HunyuanVideo-1.5: A leading lightweight video generation model