Stars
Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
AI-Native & Cloud-Native FS: A high-performance file semantic layer for cloud object storage, integrated with high-speed cache
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
Uniffle is a high performance, general purpose Remote Shuffle Service.
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Apache Spark - A unified analytics engine for large-scale data processing
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apache Kyuubi
Alibaba Java Coding Guidelines pmd implements and IDE plugin
The Context Platform for your Data and AI Stack
hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。
An easy to use, self-service open BI reporting and BI dashboard platform.
High Performance Inter-Thread Messaging Library
Benchmark comparing serialization libraries on the JVM
A Spring Framework based, pragmatic style JavaEE application reference architecture.
Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus