Stars
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
Free MLOps course from DataTalks.Club
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
A simple, high-throughput file client for mounting an Amazon S3 bucket as a local file system.
Pluggable in-process caching engine to build and scale high performance services
FUSE implementation in Java using Java Native Runtime (JNR)
Streaming data platform. Real-time stream processing, low-latency serving, and Iceberg table management.
Probably the fastest coroutine lib in the world!
A library that allows you to easily mock out tests based on AWS infrastructure.
Prometheus documentation: content and static site generator
The gflags package contains a C++ library that implements commandline flags processing. It includes built-in support for standard types such as string and the ability to define flags in the source …
A collection of modern C++ libraries, include coro_http, coro_rpc, compile-time reflection, struct_pack, struct_json, struct_xml, struct_pb, easylog, async_simple etc.
brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…
A high performance and generic framework for distributed DNN training
mimalloc is a compact general purpose allocator with excellent performance.
Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file stor…
Netperf is a benchmark that can be used to measure the performance of many different types of networking. It provides tests for both unidirectional throughput, and end-to-end latency.
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
A list of learning materials to understand databases internals
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…
a full featured file system for online data storage
lakeFS - Data version control for your data lake | Git for data