Stars
A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.
21 Lessons, Get Started Building with Generative AI
A curated list of awesome Recommender System (Books, Conferences, Researchers, Papers, Github Repositories, Useful Sites, Youtube Videos)
Example of project using Databricks Asset Bundle
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
This is a repo with links to everything you'd ever want to learn about data engineering
Examples of Databricks Asset Bundles
Examples of using Terraform to deploy Databricks resources
Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables
This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.
Data Engineering Practice Problems