- San Francisco, CA
Stars
5
stars
written in Scala
Clear filter
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
An open-source toolkit for large-scale genomic analysis
A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations of possible data sources. Multiple execution modes in multipl…