An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 4,048 333 Updated Aug 14, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,798 513 Updated Oct 27, 2025

LanDiff / LanDiff

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Python 39 Updated May 4, 2025

vivoCameraResearch / Hyper-Motion

HyperMotion is a pose guided human image animation framework based on a large-scale video diffusion Transformer.

Python 140 10 Updated Mar 10, 2026

aaxwaz / TalkingMachines

TalkingMachines

JavaScript 179 8 Updated Aug 2, 2025

ZulutionAI / MoviiGen1.1

MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models

Python 184 9 Updated Jul 21, 2025

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,502 181 Updated Mar 28, 2025

aigc3d / LAM

[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Python 956 90 Updated Sep 11, 2025

llm-lab-org / Generative-AI-for-Character-Animation-Survey

Generative AI for Character Animation: A Comprehensive Survey of Techniques, Applications, and Future Directions

67 2 Updated May 13, 2025

SkyworkAI / SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Python 6,743 1,416 Updated Jan 29, 2026

alsdudrla10 / ARD

[CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).

Python 142 6 Updated Oct 1, 2025

mayuelala / Awesome-Controllable-Video-Generation

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

716 44 Updated Nov 11, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,734 1,650 Updated Oct 16, 2025

IsshikiHugh / HSMR

[CVPR25 Oral (Top 3.3%)] Official code for paper "Reconstructing Humans with a Biomechanically Accurate Skeleton".

Python 609 50 Updated Mar 6, 2026

aigc3d / LHM

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,589 207 Updated Mar 17, 2026

wavlab-speech / versa

Versatile Evaluation of Speech and Audio

Python 398 45 Updated Dec 9, 2025

Tencent-Hunyuan / HunyuanVideo-I2V

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,805 191 Updated Apr 7, 2026

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 12,213 1,188 Updated Apr 8, 2026

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,791 2,573 Updated Mar 5, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ailing Zeng ailingzengzzz

Achievements

Achievements

Block or report ailingzengzzz

Starred repositories

large-performance-model / large-performance-model.github.io

krea-ai / realtime-video

GuoweiXu368 / OmniMocap-X

UMass-Embodied-AGI / TalkCuts

character-ai / Ovi

ByteDance-Seed / VeOmni

Kai-46 / KnapFormer

Lightricks / LTX-Video-Trainer

Wan-Video / Wan2.2

roboflow / supervision

XueZeyue / DanceGRPO

modelscope / ClearerVoice-Studio