Stars
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
Command-line program to download videos from YouTube.com and other video sites
real time face swap and one-click video deepfake with only a single image
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
No fortress, purely open ground. OpenManus is Coming.
The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
State-of-the-art 2D and 3D Face Analysis Project
SoftVC VITS Singing Voice Conversion
Generative Models by Stability AI
Image-to-Image Translation in PyTorch
DeepFaceLab is the leading software for creating deepfakes.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Lets make video diffusion practical!
Wan: Open and Advanced Large-Scale Video Generative Models
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
"RAG-Anything: All-in-One RAG Framework"
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)