-
Supa Lab
- Pretoria, South Africa
-
06:38
(UTC +02:00) - https://supalab.github.io
Stars
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
StAEDI - Streaming API for EDI: Java library featuring a reader/parser, writer/generator, and validation