Skip to content
View ailingzengzzz's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report ailingzengzzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LPM 1.0: Video-based Character Performance Model

HTML 120 5 Updated Apr 10, 2026

Krea Realtime 14B. An open-source realtime AI video model.

Python 520 35 Updated Nov 13, 2025

Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"

21 Updated Dec 22, 2025

[NeurIPS 2025] TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation

Python 31 Updated Dec 14, 2025
Python 1,684 193 Updated Nov 15, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,814 178 Updated Apr 9, 2026
Python 127 5 Updated Aug 10, 2025

Community trainer for Lightricks' LTX Video model 🎬 ⚡️

Python 424 58 Updated Jan 6, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,167 1,846 Updated Mar 17, 2026

We write your reusable computer vision tools. 💜

Python 37,888 3,328 Updated Apr 8, 2026

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,568 77 Updated Oct 16, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,033 332 Updated Aug 14, 2025

Open-source unified multimodal model

Python 5,797 512 Updated Oct 27, 2025

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Python 39 Updated May 4, 2025

HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.

Python 140 10 Updated Mar 10, 2026

TalkingMachines

JavaScript 179 8 Updated Aug 2, 2025

MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models

Python 184 9 Updated Jul 21, 2025

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,501 181 Updated Mar 28, 2025

[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Python 956 90 Updated Sep 11, 2025

Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

67 2 Updated May 13, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 6,737 1,413 Updated Jan 29, 2026

[CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).

Python 142 6 Updated Oct 1, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

716 44 Updated Nov 11, 2025

Lets make video diffusion practical!

Python 16,730 1,650 Updated Oct 16, 2025

[CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".

Python 608 50 Updated Mar 6, 2026

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,589 207 Updated Mar 17, 2026

Versatile Evaluation of Speech and Audio

Python 398 45 Updated Dec 9, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,804 191 Updated Apr 7, 2026

Enjoy the magic of Diffusion models!

Python 12,212 1,187 Updated Apr 8, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,779 2,556 Updated Mar 5, 2026
Next