Skip to content
View ailingzengzzz's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report ailingzengzzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Krea Realtime 14B. An open-source realtime AI video model.

Python 425 24 Updated Nov 13, 2025

Dataset for paper "OmniMotion-X: Versatile Multimodal Whole-Body Motion Generation"

20 Updated Oct 27, 2025

[NeurIPS 2025] TalkCuts: A Large-Scale Dataset for Multi-Shot Human Speech Video Generation

Python 24 Updated Dec 14, 2025
Python 1,452 152 Updated Nov 15, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,436 121 Updated Dec 19, 2025
Python 123 5 Updated Aug 10, 2025

Community trainer for Lightricks' LTX Video model 🎬 ⚡️

Python 366 49 Updated Oct 26, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,904 1,502 Updated Dec 17, 2025

We write your reusable computer vision tools. 💜

Python 36,177 3,054 Updated Dec 15, 2025

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,347 66 Updated Oct 16, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,742 302 Updated Aug 14, 2025

Open-source unified multimodal model

Python 5,479 481 Updated Oct 27, 2025

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Python 38 Updated May 4, 2025

HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.

Python 128 8 Updated Jul 14, 2025

TalkingMachines

JavaScript 174 8 Updated Aug 2, 2025

MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models

Python 180 8 Updated Jul 21, 2025

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,462 180 Updated Mar 28, 2025

[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Python 881 79 Updated Sep 11, 2025

Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

58 2 Updated May 13, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 5,208 851 Updated Aug 11, 2025

[CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).

Python 138 6 Updated Oct 1, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

603 38 Updated Nov 11, 2025

Lets make video diffusion practical!

Python 16,357 1,592 Updated Oct 16, 2025

[CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".

Python 585 47 Updated Aug 17, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,507 196 Updated Jul 15, 2025

Versatile Evaluation of Speech and Audio

Python 365 46 Updated Dec 9, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,753 178 Updated May 20, 2025

Enjoy the magic of Diffusion models!

Python 11,174 1,054 Updated Dec 19, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,954 2,209 Updated Dec 15, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,834 226 Updated Dec 19, 2025
Next