Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Command-line program to download videos from YouTube.com and other video sites
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
real time face swap and one-click video deepfake with only a single image
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
A latent text-to-image diffusion model
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
Interact with your documents using the power of GPT, 100% privately, no data leaks
The simplest, fastest repository for training/finetuning medium-sized GPTs.
No fortress, purely open ground. OpenManus is Coming.
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Port of OpenAI's Whisper model in C/C++
The original local LLM interface. Text, vision, tool-calling, training, and more. 100% offline.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Making large AI models cheaper, faster and more accessible
🔊 Text-Prompted Generative Audio Model
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Official Code for DragGAN (SIGGRAPH 2023)
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Easily train a good VC model with voice data <= 10 mins!
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM