Skip to content
View arthur-qiu's full-sized avatar

Organizations

@AI-secure

Block or report arthur-qiu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
226 results for source starred repositories
Clear filter

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 558 26 Updated Jan 5, 2026

Official repository for LTX-Video

Python 9,233 866 Updated Jan 5, 2026

[ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

115 1 Updated Oct 7, 2025

Code for CineScale, higher-resolution video generation based on Wan

Python 183 2 Updated Aug 25, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,120 238 Updated Sep 12, 2025

Inception Score for GANs in Pytorch

Python 664 126 Updated Mar 3, 2020

Pytorch implementation of common image generation metrics.

Python 198 17 Updated Jun 14, 2024

Lets make video diffusion practical!

Python 16,604 1,639 Updated Oct 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,288 2,371 Updated Dec 15, 2025

Rectified Rotary Position Embeddings

Python 388 30 Updated May 20, 2024

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 34,198 4,256 Updated Aug 6, 2024

CLIPScore EMNLP code

Python 244 26 Updated Dec 16, 2022

🔥🔥🔥A curated list of papers on recent diffusion-based high-resolution image and video synthesis works.

165 8 Updated Dec 26, 2024

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Python 1,253 51 Updated Jun 8, 2025

[ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"

Python 305 29 Updated Jan 9, 2025

[ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation

Python 148 5 Updated Oct 9, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,697 1,189 Updated Nov 21, 2025

A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail.

Python 929 33 Updated Dec 21, 2025

Quick scripts to calculate CLIP text-image similarity

Python 305 18 Updated Apr 14, 2025

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Python 1,137 79 Updated Aug 2, 2025

High-fidelity performance metrics for generative models in PyTorch

Python 1,172 87 Updated Nov 18, 2025

Let's finetune video generation models!

Python 536 29 Updated Sep 15, 2025

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

Python 3,379 261 Updated Aug 6, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,405 1,253 Updated Nov 4, 2025

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Python 913 24 Updated Mar 17, 2025

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 538 30 Updated Apr 4, 2025

[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Python 1,505 54 Updated Dec 13, 2025

High-resolution models for human tasks.

Python 5,276 314 Updated Nov 18, 2024

Official inference repo for FLUX.1 models

Python 25,189 1,851 Updated Jul 31, 2025

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,896 95 Updated Oct 31, 2024
Next