-
infiniflow
- Shanghai
Lists (3)
Sort Name ascending (A-Z)
Stars
The Next-Gen Database for AI—an infrastructure designed for data and AI. As the MySQL of the AI era.
The fastest Office document library for Python, Rust, Go, JS/TS, C# and WASM. DOCX, XLSX, PPTX, DOC, XLS, PPT. Up to 100× faster than python-docx/openpyxl/python-pptx. 100% pass rate on valid Offic…
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Adaptive Chunking: automatically select the best chunking method per document for RAG. Accepted at LREC 2026.
Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Ragflow-Plus 是 Ragflow 的二次开发版本,使其更为简洁实用
🚀 Next Gen Multi-tenant AI One-Stop Solution. Builtin Admin & Billing System. Enterprise-Grade Unified LLM Gateway Support for 200+ Models And 35+ Providers, Load Balacing w/ Priority-base Routing,…
vsag is a vector indexing library used for similarity search.
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
Make your JSON data collaborative and version-controlled with CRDTs
精选了10K+项目,包括机器学习、深度学习、NLP、GNN、推荐系统、生物医药、机器视觉、前后端开发等内容。Selected more than 10k+ projects, including machine learning, deep learning, NLP, GNN, recommendation system, biomedicine, machine vision, etc.…
Pebblo enables developers to safely load data and promote their Gen AI app to deployment
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Embedded relational database and native Rust data API.
Lightweight, asynchronous based on LSM Leveled Compaction KV database
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
A hybrid thread / fiber task scheduler written in C++ 11
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-re…
ByConity is an open source cloud data warehouse
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
😎 Awesome list of tools and projects with the awesome LangChain framework
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.