Stars
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
21 Lessons, Get Started Building with Generative AI
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
PerconaFT is a high-performance, transactional key-value store
Verified, concurrent, crash-safe transaction system
A composable and fully extensible C++ execution engine library for data management systems.
DuckDB is an analytical in-process SQL database management system
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
Apache Doris is an easy-to-use, high performance and unified analytics database.
Self-Driving Database Management System from Carnegie Mellon University
🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaSc…
A list of learning materials to understand databases internals
Rich is a Python library for rich text and beautiful formatting in the terminal.
Python 3 library to aid coding with VeChain, eg. Wallets/Tx/Sign/Verify.
Python library that makes it easy for data scientists to create charts.
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Debugger capable of attaching to and injecting code into python processes.
nanomsg-next-generation -- light-weight brokerless messaging