Starred repositories
Fast Rust library for PDF inspection, classification, and text extraction. Intelligently detects scanned vs text-based PDFs to enable smart routing decisions.
🚀2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platfor…
Supercharge Your LLM Application Evaluations 🚀
The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.
Convert PDF to markdown + JSON quickly with high accuracy
A fast, helpful, and open-source document parser
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, GitHub Copilot CLI, OpenClaw, Factory Droid, Trae, Google Antigravity). Turn any folder of code, docs, papers, images, o…
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
⭕️ AstroWind: A free template using Astro 5 and Tailwind CSS. Astro starter theme.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Pyt…
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
Document engine for AI. Reason, don't vector. ⭐ Star this repo if you find it useful.
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.
Rust library and CLI tool for OCR (extracting text from images)
Arena based tree 🌲 structure by using indices instead of reference counted pointers
Open-source context retrieval layer for AI agents
A rich text editor React component for markdown
TypeScript AI AI Function Calling Framework enhanced by compiler skills.
Vectorless, Reasoning-Based Retrieval-Augmented Generation (RAG)
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Beginner, advanced, expert level Rust training material
The GitButler version control client, backed by Git, powered by Tauri/Rust/Svelte