Starred repositories
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
DuckDB is an analytical in-process SQL database management system
A composable and fully extensible C++ execution engine library for data management systems.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
LangChain4j is an idiomatic, open-source Java library for building LLM-powered applications on the JVM. It offers a unified API over popular LLM providers and vector stores, and makes implementing …
Pocket Flow: Codebase to Tutorial
A library that provides an embeddable, persistent key-value store for fast storage.
GoogleTest - Google Testing and Mocking Framework
Apache Spark - A unified analytics engine for large-scale data processing
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
A software library of stochastic streaming algorithms, a.k.a. sketches.
Apache Kafka® running on Kubernetes
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Programming framework for writing and deploying cloud applications.
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Chia blockchain python implementation (full node, farmer, harvester, timelord, and wallet)
An advanced guide which might benefit you a lot 🎉 . 人生进阶指南 离谱的人生 离谱的英语学习指南/英语学习教程/英语学习/学英语
The Internals of Spark on Kubernetes
🧑💻 Full ZIO 2 Stack: A sample IM that uses zio, zio-redis, zio-actors, zio-schema, zio-streams, zio-crypto, circe, tapir, akka-http,redis4cats.
A java agent to generate method mappings to use with the linux `perf` tool