Stars
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data ever…
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
Uniffle is a high performance, general purpose Remote Shuffle Service.
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
Apache Doris is an easy-to-use, high performance and unified analytics database.
这是我自己的Flink中文社区翻译稿存储仓库,用于提供给需要朋友进行二次创作。同时提供Flink一些课外的相关知识文档供大家学习
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.
📋分享我订阅的一些 Blog 和 Newsletter,通过 Github Actions,每天自动同步我 Feedly 上的订阅源,✅ 代表能正常订阅,❌ 代表暂无法订阅(对于无法订阅的 feed,支持 Telegram Bot、Email、Server酱等推送工具提醒更新)
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
Submarine is Cloud Native Machine Learning Platform.
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Upserts, Deletes And Incremental Processing on Big Data.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
This is a library for SQL optimizing/rewriting including Materialized View rewrite
JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter