Stars
Event Driven Orchestration & Scheduling Platform for Mission Critical Applications
AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Apache Druid: a high performance real-time analytics database.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
Alluxio, data orchestration for analytics and machine learning in the cloud
Flink CDC is a streaming data integration tool
Apache Pinot - A realtime distributed OLAP datastore
Open, Multi-modal Catalog for Data & AI
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Database Subsetting and Relational Data Browsing Tool.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Collect, aggregate, and visualize a data ecosystem's metadata
Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
OFD Reader & Writer 开源的OFD处理库,支持文档生成、数字签名、文档保护、文档合并、转换、导出等功能,文档格式遵循《GB/T 33190-2016 电子文件存储与交换格式版式文档》。
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
Ontop is a platform to query relational databases as Virtual RDF Knowledge Graphs using SPARQL
Open Control Plane for Tables in Data Lakehouse