Starred repositories
Apache Spark - A unified analytics engine for large-scale data processing
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
CMAK is a tool for managing Apache Kafka clusters
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Tiny Scala high-performance, async web framework, inspired by Sinatra
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization fo…
XML data source for Spark SQL and DataFrames
A small web app to monitor the progress of kafka consumers and their lag wrt the log.
thunderain-project / StreamSQL
Forked from apache/sparkMirror of Apache Spark
Sample Spark Streaming application for secure consumption from Kafka