Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A curated list of awesome Machine Learning frameworks, libraries and software.
LlamaIndex is the leading document agent and OCR platform
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
CowAgent是基于大模型的超级AI助理,能主动思考和任务规划、访问操作系统和外部资源、创造和执行Skills、拥有长期记忆并不断成长,比OpenClaw更轻量和便捷。同时支持微信、飞书、钉钉、企微、QQ、公众号、网页等接入,可选择OpenAI/Claude/Gemini/DeepSeek/ Qwen/GLM/Kimi/LinkAI,能处理文本、语音、图片和文件,可快速搭建个人AI助理和企…
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
Instant voice cloning by MIT and MyShell. Audio foundation model.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
DSPy: The framework for programming—not prompting—language models
A book-in-progress about the Linux kernel and its insides.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Official inference repo for FLUX.1 models
🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天记录创造…
A collaborative note taking, wiki and documentation platform that scales. Built with Django and React.
SMSBoom - Deprecate: Due to judicial reasons, the repository has been suspended!
Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme
Deduplicating archiver with compression and authenticated encryption.
PaddleFormers is an easy-to-use library of pre-trained large language model zoo based on PaddlePaddle.
Modular visual interface for GDB in Python
A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.
A framework to enable multimodal models to operate a computer.