Stars
Official Astra distribution: documentation and release binaries for enterprise users.
Secure memory management for AI Agents • Ensures data integrity • Reduces hallucinations • Maintains consistent long-term context
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
21 Lessons, Get Started Building with Generative AI
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.
PerconaFT is a high-performance, transactional key-value store
Verified, concurrent, crash-safe transaction system
A composable and fully extensible C++ execution engine library for data management systems.
DuckDB is an analytical in-process SQL database management system
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
Apache Doris is an easy-to-use, high performance and unified analytics database.
Self-Driving Database Management System from Carnegie Mellon University
🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaSc…
A list of learning materials to understand databases internals
Rich is a Python library for rich text and beautiful formatting in the terminal.
Python 3 library to aid coding with VeChain, eg. Wallets/Tx/Sign/Verify.
Distributed Task Queue (development branch)
Python library that makes it easy for data scientists to create charts.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Debugger capable of attaching to and injecting code into python processes.