Lists (3)
Sort Name ascending (A-Z)
Stars
A tool to benchmark L (loading) workloads within ETL workloads
An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Linux Foundation.
Apache Kafka® compatible broker with S3, PostgreSQL, SQLite, Apache Iceberg and Delta Lake
Upserts, Deletes And Incremental Processing on Big Data.
A composable and fully extensible C++ execution engine library for data management systems.
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
Flyway by Redgate • Database Migrations Made Easy.
A fully-featured AWS Athena database driver (+ athenareader https://github.com/uber/athenadriver/tree/master/athenareader)
An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
Collect, aggregate, and visualize a data ecosystem's metadata
High Performance Inter-Thread Messaging Library
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
vinothchandar / hudi
Forked from apache/hudiSpark Library for Hadoop Upserts And Incrementals
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A Fast Key-Value Storage Engine Based on Hierarchical B+-Tree Trie
A collection of inspiring resources related to engineering management and tech leadership
Spinnaker is an open source, multi-cloud continuous delivery platform for releasing software changes with high velocity and confidence.
YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
Ceph is a distributed object, block, and file storage platform
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
Native ZooKeeper client for Go. This project is no longer maintained. Please use https://github.com/go-zookeeper/zk instead.
rockset / rocksdb-cloud
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage optimized for AWS
Efficient reliable UDP unicast, UDP multicast, and IPC message transport
A collection of C++ HTTP libraries including an easy to use HTTP server.