Stars
Apache Spark - A unified analytics engine for large-scale data processing
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
CMAK is a tool for managing Apache Kafka clusters
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala