Skip to content
View QuanjianSong's full-sized avatar
  • 20:17 (UTC +08:00)

Block or report QuanjianSong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 358 16 Updated Feb 18, 2026

[NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Jupyter Notebook 194 7 Updated Jan 7, 2026

VideoCoF: Unified Video Editing with Temporal Reasoner

Python 137 6 Updated Feb 11, 2026

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 213 6 Updated Feb 3, 2026

📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.

349 15 Updated Jan 8, 2026

[NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"

Python 258 6 Updated Dec 31, 2025

Vision Bridge Transformer at Scale

Python 139 7 Updated Dec 1, 2025

[ICLR 26 Oral] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

Python 2,041 165 Updated Jan 19, 2026

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,215 71 Updated Aug 7, 2025

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 1,569 118 Updated Dec 31, 2025

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,808 203 Updated Jan 30, 2026

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 572 25 Updated Jan 5, 2026

"MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"

173 7 Updated Dec 9, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

5,459 338 Updated Feb 3, 2026

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,057 89 Updated Feb 15, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,738 1,045 Updated Feb 17, 2026

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 706 53 Updated Jan 31, 2026

HunyuanVideo-1.5: A leading lightweight video generation model

Python 4,425 220 Updated Feb 12, 2026

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,125 143 Updated Dec 8, 2025

我的 nano-banana 创意玩法大合集! 持续更新中!

3,538 344 Updated Sep 18, 2025

Official Repo for Self-Forcing++ High Quality Long Video Generation

235 4 Updated Oct 13, 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,911 207 Updated Jan 18, 2026

[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Python 429 8 Updated Jan 7, 2026

[Unofficial] RF Inversion implemented for SD3 / SD3.5

Python 13 1 Updated Nov 4, 2024

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,601 740 Updated Feb 17, 2026
Python 47 4 Updated Oct 11, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,366 430 Updated Feb 10, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,089 266 Updated Feb 18, 2026

4-steps distilled version of Wan2.2-TI2V-5B

Python 140 9 Updated Jan 26, 2026
Next