yzy-thu

Follow

🏠

Working from home

yangzy_thu yzy-thu

🏠

Working from home

Follow

21 followers · 7 following

Achievements

Achievements

Organizations

Stars

ckinpdx / ComfyUI-SCAIL-AudioReactive

Generate audio-reactive SCAIL pose sequences for character animation without requiring input video tracking.

Python 5 1 Updated Dec 19, 2025

zai-org / RealVideo

A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.

Python 147 16 Updated Dec 15, 2025

zai-org / SSVAE

official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".

Python 21 2 Updated Dec 19, 2025

zai-org / SCAIL

Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

Python 375 14 Updated Dec 19, 2025

bilibili / Index-anisora

Python 2,300 126 Updated Nov 2, 2025

yihao-meng / HoloCine

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 557 104 Updated Nov 26, 2025

zai-org / Kaleido

Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.

Python 88 7 Updated Dec 11, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,976 218 Updated Sep 12, 2025

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,402 7,019 Updated Dec 19, 2025

roboterax / video-prediction-policy

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io

Python 314 21 Updated May 17, 2025

thu-coai / VPO

Python 20 1 Updated Jul 20, 2025

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,111 63 Updated Aug 7, 2025

ML-GSAI / Concat-ID

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

Python 65 Updated May 7, 2025

fudan-generative-vision / hallo3

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,349 174 Updated Mar 13, 2025

zai-org / VisionReward

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 354 11 Updated Mar 26, 2025

zai-org / MotionBench

Official code for MotionBench (CVPR 2025)

Python 61 2 Updated Mar 3, 2025

PKU-YuanGroup / ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 791 44 Updated Aug 30, 2025

TheDenk / cogvideox-controlnet

Simple Controlnet module for CogvideoX model.

Jupyter Notebook 176 11 Updated Jan 12, 2025

meta-pytorch / attention-gym

Helpful tools and examples for working with flex-attention

Python 1,089 67 Updated Dec 18, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,476 295 Updated Dec 19, 2025

3DTopia / 3DTopia-XL

[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 1,015 37 Updated Jul 14, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,266 91 Updated Nov 19, 2025

feizc / CogvideX-Interpolation

Keyframe Interpolation with CogvideoX

Python 139 3 Updated Oct 31, 2024

facebookresearch / MovieGenBench

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

431 23 Updated Mar 8, 2025

alibaba / Tora

[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Python 1,223 58 Updated Jul 9, 2025

huggingface / finetrainers

Scalable and memory-optimized training of diffusion models

Python 1,311 144 Updated Jun 4, 2025

zai-org / CogView4

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,100 79 Updated Mar 29, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,706 128 Updated Dec 19, 2025

kijai / ComfyUI-CogVideoXWrapper

Python 1,537 94 Updated Aug 7, 2025

zai-org / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,257 1,233 Updated Nov 4, 2025