SPARK
Capture the logical plan from Spark (SQL)
This project is used for tracking lineage when using spark. Our team is aimed at enhancing the ability of column relation during logical plan analysis.
Notes talking about the design and implementation of Apache Spark
Qubole Sparklens tool for performance tuning Apache Spark
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
SQL parser written using Scala's parser combinator library
A Macro library for working with Spark SQL in a typesafe way.
Extensible Rules Engine for custom Dataframe / Dataset validation
Examples for High Performance Spark