Broadway is a distributed actor-based processing server optimized for high-speed data/file ingestion
-
Updated
Mar 29, 2016 - Scala
Broadway is a distributed actor-based processing server optimized for high-speed data/file ingestion
Out of the box scheduling, logging, monitoring and data governance for your scala ETL jobs.
Enron Email ETL
A standalone ETL tool to generate advanced features for your Machine Learning projects
An opinionated way to structure ETL pipelines with a heavy focus on reusability and testing
A project to use spark for transform the *.n3 file to CSV file
Mole is a spark library which support designing and implementing ETL work in a configuration file to avoid spending additional time to repackage and deploy the submitted jar for general changes(such as source/sink location and transformations).
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."