Senior Data Engineer (Databricks, Spark, Data Lakehouse, Data Modeling, AWS, Azure, Python, R)
Stars
9
stars
written in Python
Clear filter
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Data Engineering Practice Problems
Examples of Databricks Asset Bundles
Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables
A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.
This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.