Starred repositories
Stable Diffusion web UI
Command-line program to download videos from YouTube.com and other video sites
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Robust Speech Recognition via Large-Scale Weak Supervision
Python tool for converting files and office documents to Markdown.
real time face swap and one-click video deepfake with only a single image
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Get your documents ready for gen AI
Making large AI models cheaper, faster and more accessible
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Instant voice cloning by MIT and MyShell. Audio foundation model.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
TradingAgents: Multi-Agents LLM Financial Trading Framework
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
A simple Python Pydantic model for Honkai: Star Rail parsed data from the Mihomo API.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
DeepSeek Coder: Let the Code Write Itself
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Xiaomi Home Integration for Home Assistant
Intelligent automation and multi-agent orchestration for Claude Code
Stable Diffusion with Core ML on Apple Silicon
Experience macOS just like before
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in.
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
A research prototype of a human-centered web agent