Lists (5)
Sort Name ascending (A-Z)
Starred repositories
Apache Spark - A unified analytics engine for large-scale data processing
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
Apache OpenWhisk is an open source serverless cloud platform
This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simplifies collecting, aggregating, and exporting Spark task/sta…
Project for James' Apache Spark with Scala course
A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.
seatunnel plugin developing examples.
A sample project shows how to run Spark Streaming app with Kafka in Docker