Skip to content
View ailingzengzzz's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report ailingzengzzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Krea Realtime 14B. An open-source realtime AI video model.

Python 517 34 Updated Nov 13, 2025

Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"

21 Updated Dec 22, 2025

[NeurIPS 2025] TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation

Python 31 Updated Dec 14, 2025
Python 1,676 191 Updated Nov 15, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,791 171 Updated Apr 3, 2026
Python 127 5 Updated Aug 10, 2025

Community trainer for Lightricks' LTX Video model 🎬 ⚡️

Python 422 58 Updated Jan 6, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,019 1,821 Updated Mar 17, 2026

We write your reusable computer vision tools. 💜

Python 37,462 3,268 Updated Apr 1, 2026

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,563 76 Updated Oct 16, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,016 331 Updated Aug 14, 2025

Open-source unified multimodal model

Python 5,782 512 Updated Oct 27, 2025

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Python 39 Updated May 4, 2025

HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.

Python 140 10 Updated Mar 10, 2026

TalkingMachines

JavaScript 179 8 Updated Aug 2, 2025

MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models

Python 185 9 Updated Jul 21, 2025

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,500 183 Updated Mar 28, 2025

[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Python 950 90 Updated Sep 11, 2025

Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

66 2 Updated May 13, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 6,688 1,397 Updated Jan 29, 2026

[CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).

Python 142 6 Updated Oct 1, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

714 44 Updated Nov 11, 2025

Lets make video diffusion practical!

Python 16,712 1,651 Updated Oct 16, 2025

[CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".

Python 608 50 Updated Mar 6, 2026

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,581 207 Updated Mar 17, 2026

Versatile Evaluation of Speech and Audio

Python 396 46 Updated Dec 9, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,804 190 Updated May 20, 2025

Enjoy the magic of Diffusion models!

Python 12,159 1,183 Updated Apr 2, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,735 2,521 Updated Mar 5, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,339 308 Updated Apr 3, 2026
Next