Highlights
Starred repositories
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
real time face swap and one-click video deepfake with only a single image
💫 Toolkit to help you get started with Spec-Driven Development
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A high-throughput and memory-efficient inference and serving engine for LLMs
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
scikit-learn: machine learning in Python
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Clone a voice in 5 seconds to generate arbitrary speech in real-time
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构…
The original local LLM interface. Text, vision, tool-calling, training. UI + API, 100% offline and private.
The highest-scoring AI memory system ever benchmarked. And it's free.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
Official Code for DragGAN (SIGGRAPH 2023)
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.