Stars
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
SkyReels-V2: Infinite-length Film Generative model
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
[ICLR 2025] Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios
效果更好的补帧软件,显存占用更小,是DAIN速度的10-25倍,包含抽帧处理,去除动漫卡顿感
zero-shot voice conversion & singing voice conversion, with real-time support
ACE-Step: A Step Towards Music Generation Foundation Model
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
An AI agent that beats the classic game "Snake".
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …
Powerful & Easy-to-Use Video Face Swapping and Editing Software
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
口令爆破字典,有键盘组合字典、拼音字典、字母与数字混合这三种类型
Synchronized Translation for Videos. Video dubbing
Convert your videos to densepose and use it on MagicAnimate
An autoagentic AGI that is self-evolving and modular.
Repository for most of the code from my YouTube channel