Skip to content
View arthur-qiu's full-sized avatar

Organizations

@AI-secure

Block or report arthur-qiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 445 17 Updated Dec 19, 2025

Official repository for LTX-Video

Python 8,916 835 Updated Oct 25, 2025

The official implementation of paper “VChain: Chain-of-Visual-Thought for Reasoning in Video Generation”

109 1 Updated Oct 7, 2025

Code for CineScale, higher-resolution video generation based on Wan

Python 180 2 Updated Aug 25, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,981 219 Updated Sep 12, 2025

Inception Score for GANs in Pytorch

Python 661 126 Updated Mar 3, 2020

Pytorch implementation of common image generation metrics.

Python 192 17 Updated Jun 14, 2024

Lets make video diffusion practical!

Python 16,368 1,594 Updated Oct 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,956 2,215 Updated Dec 15, 2025

Rectified Rotary Position Embeddings

Python 384 30 Updated May 20, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 33,607 4,199 Updated Aug 6, 2024

CLIPScore EMNLP code

Python 242 26 Updated Dec 16, 2022

🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

160 8 Updated Dec 26, 2024

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 1,203 49 Updated Jun 8, 2025

[ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"

Python 303 29 Updated Jan 9, 2025

[ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation

Python 145 5 Updated Oct 9, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,483 1,154 Updated Nov 21, 2025

A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.

Python 890 31 Updated Dec 4, 2025

Quick scripts to calculate CLIP text-image similarity

Python 302 18 Updated Apr 14, 2025

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 1,127 78 Updated Aug 2, 2025

High-fidelity performance metrics for generative models in PyTorch

Python 1,156 86 Updated Nov 18, 2025

Let's finetune video generation models!

Python 531 28 Updated Sep 15, 2025

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

Python 3,349 258 Updated Aug 6, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,258 1,233 Updated Nov 4, 2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 914 23 Updated Mar 17, 2025

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 530 30 Updated Apr 4, 2025

[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Python 1,482 55 Updated Dec 13, 2025

High-resolution models for human tasks.

Python 5,251 309 Updated Nov 18, 2024

Official inference repo for FLUX.1 models

Python 24,935 1,829 Updated Jul 31, 2025

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,882 92 Updated Oct 31, 2024
Next