Stars
Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
21 Lessons, Get Started Building with Generative AI
Readymade evaluators for your LLM apps
This is a repo with links to everything you'd ever want to learn about data engineering
Examples of Databricks Asset Bundles
Examples of using Terraform to deploy Databricks resources
A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.
This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.
Data Engineering Practice Problems
Example of project using Databricks Asset Bundle
A curated list of awesome Recommender System (Books, Conferences, Researchers, Papers, Github Repositories, Useful Sites, Youtube Videos)