Skip to content
View ziqihuangg's full-sized avatar

Block or report ziqihuangg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An inference-time, plug-and-play method for temporal control in multi-event generation

JavaScript 10 Updated Mar 17, 2026

[ICLR 2026] Official Code for "the Quest for Generalizable Motion Generation: Data, Model, and Evaluation"

Python 86 3 Updated Mar 19, 2026

[CVPR 2026] MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator

Python 542 31 Updated Mar 16, 2026

[CVPR 2026] WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 192 16 Updated Jan 18, 2026

This is a collection of recent papers on reasoning in video generation models.

143 5 Updated Mar 23, 2026

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

156 6 Updated Mar 16, 2026
Python 31 1 Updated Dec 17, 2025
Python 22 1 Updated Feb 13, 2026

Code for CineScale, higher-resolution video generation based on Wan

Python 185 2 Updated Aug 25, 2025

[ICIP2025 Spotlight] Efficient and High-Fidelity Image Generation

JavaScript 4 1 Updated Jan 12, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,801 1,793 Updated Mar 17, 2026

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 92 3 Updated Sep 12, 2025

A list of works on video generation towards world model

435 8 Updated Mar 21, 2026

Lets make video diffusion practical!

Python 16,698 1,648 Updated Oct 16, 2025

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)

Python 52 2 Updated Jan 14, 2026

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,527 104 Updated Mar 4, 2026

A Python package that makes it easy for developers to create AI apps powered by various AI providers.

Python 1,645 205 Updated Apr 8, 2025

[ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible

Python 122 5 Updated Aug 10, 2025

Understand Human Behavior to Align True Needs

Python 4,059 393 Updated Aug 13, 2025

Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation

Python 49 1 Updated Mar 26, 2023

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,193 1,098 Updated Nov 18, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,252 95 Updated Feb 16, 2025

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

421 21 Updated Sep 22, 2025

[CSUR] A Survey on Video Diffusion Models

2,283 112 Updated Mar 14, 2026

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 6,605 736 Updated Mar 19, 2025

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 545 21 Updated Jan 18, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 938 43 Updated Sep 27, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,545 107 Updated Mar 23, 2026

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 3,007 201 Updated Mar 9, 2024

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

1,896 77 Updated Dec 24, 2024
Next