Lists (16)
Sort Name ascending (A-Z)
Starred repositories
8
stars
written in Scala
Clear filter
Apache Spark - A unified analytics engine for large-scale data processing
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Data Lineage Tracking And Visualization Solution
gaecoli / spark
Forked from apache/sparkApache Spark - A unified analytics engine for large-scale data processing