etl
Here are 79 public repositories matching this topic...
A Rule-Based discount calculating engine for a retail store. The engine reads transaction data, applies various discount rules based on the product, date, and other criteria, and writes the results back to a database.
-
Updated
May 13, 2024 - Scala
Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage data warehouse workflow.
-
Updated
Jan 21, 2021 - Scala
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
-
Updated
Nov 10, 2025 - Scala
data pipeline using the lambda architecture is created for the unification and consolidation of real-time customer web events, weblogs, and profile data into a hive warehouse for adhoc analysis.
-
Updated
Oct 4, 2020 - Scala
Sample notebooks on Azure Databricks for ETL
-
Updated
May 20, 2023 - Scala
A project to use spark for transform the *.n3 file to CSV file
-
Updated
Jan 8, 2019 - Scala
Repository for playing with spark
-
Updated
Oct 13, 2020 - Scala
This Scala ETL pipeline processes transaction CSV files by applying discounts and calculating final prices using pure functional programming principles. The transformed data is then loaded into an Oracle database, demonstrating end-to-end data processing with type safety and immutability.
-
Updated
May 17, 2025 - Scala
Improve this page
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."