Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
real time face swap and one-click video deepfake with only a single image
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A generative speech model for daily dialogue.
Official Code for DragGAN (SIGGRAPH 2023)
Instant voice cloning by MIT and MyShell. Audio foundation model.
Real-time face swap for PC streaming or video calls
Industry leading face manipulation platform
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
DeepFaceLab is the leading software for creating deepfakes.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Python bindings for FFmpeg - with complex filtering support
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)