Skip to content
View yzy-thu's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@THUDM

Block or report yzy-thu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generate audio-reactive SCAIL pose sequences for character animation without requiring input video tracking.

Python 5 1 Updated Dec 19, 2025

A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.

Python 147 16 Updated Dec 15, 2025

official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".

Python 21 2 Updated Dec 19, 2025

Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Python 375 14 Updated Dec 19, 2025

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 557 104 Updated Nov 26, 2025

Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.

Python 88 7 Updated Dec 11, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,976 218 Updated Sep 12, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,402 7,019 Updated Dec 19, 2025

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io

Python 314 21 Updated May 17, 2025
Python 20 1 Updated Jul 20, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,111 63 Updated Aug 7, 2025

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

Python 65 Updated May 7, 2025

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,349 174 Updated Mar 13, 2025

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 354 11 Updated Mar 26, 2025

Official code for MotionBench (CVPR 2025)

Python 61 2 Updated Mar 3, 2025

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 791 44 Updated Aug 30, 2025

Simple Controlnet module for CogvideoX model.

Jupyter Notebook 176 11 Updated Jan 12, 2025

Helpful tools and examples for working with flex-attention

Python 1,089 67 Updated Dec 18, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,476 295 Updated Dec 19, 2025

[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 1,015 37 Updated Jul 14, 2025

Next-Token Prediction is All You Need

Python 2,266 91 Updated Nov 19, 2025

Keyframe Interpolation with CogvideoX

Python 139 3 Updated Oct 31, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

431 23 Updated Mar 8, 2025

[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Python 1,223 58 Updated Jul 9, 2025

Scalable and memory-optimized training of diffusion models

Python 1,311 144 Updated Jun 4, 2025

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,100 79 Updated Mar 29, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,706 128 Updated Dec 19, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,257 1,233 Updated Nov 4, 2025
Next