Starred repositories
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
Python tool for converting files and office documents to Markdown.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
A high-throughput and memory-efficient inference and serving engine for LLMs
A natural language interface for computers
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A generative speech model for daily dialogue.
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
Generative Models by Stability AI
An autonomous agent that conducts deep research on any data using any LLM providers.
Official inference repo for FLUX.1 models
An open-source RAG-based tool for chatting with your documents.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
CLI tool for configuring and monitoring Claude Code
🤖 可 DIY 的 多模态 AI 聊天机器人 | 🚀 快速接入 微信、 QQ、Telegram、等聊天平台 | 🦈支持DeepSeek、Grok、Claude、Ollama、Gemini、OpenAI | 工作流系统、网页搜索、AI画图、人设调教、虚拟女仆、语音对话 |