Stars
Skills for Real Engineers. Straight from my .claude directory.
Production-grade engineering skills for AI coding agents.
π2.3x faster than MinIO for 4KB object payloads. RustFS is an open-source, S3-compatible high-performance object storage system supporting migration and coexistence with other S3-compatible platforβ¦
A Java library for generating Time-Sorted Unique Identifiers (TSID).
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
a curated list of awesome streaming frameworks, applications, etc
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
A curated list of awesome big data frameworks, ressources and other awesomeness.
A curated list of data engineering tools for software developers
π Awesome lists about all kinds of interesting topics
π Cube Core is open-source semantic layer for AI, BI and embedded analytics
A GPU-powered real-time analytics storage and query engine.
A load balancer / proxy / gateway for prestodb
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
Kerberos and Hadoop: The Madness beyond the Gate
ππ‘ The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!
ππ Beautiful chart for data visualization.
ππ Markdown WYSIWYG Editor. GFM Standard + Chart & UML Extensible.