Stars
Apache Spark - A unified analytics engine for large-scale data processing
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
A fault tolerant, protocol-agnostic RPC system
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
The leader in Customer Data Infrastructure
Cassovary is a simple big graph processing library for the JVM
Distributed State Management for Serverless
Large off-heap arrays and mmap files for Scala and Java
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange