Skip to content
View andre-salvati's full-sized avatar

Block or report andre-salvati

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
16 results for source starred repositories
Clear filter

A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.

Python 47 23 Updated Feb 3, 2026

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 106,093 56,847 Updated Feb 3, 2026

Readymade evaluators for your LLM apps

Python 894 81 Updated Dec 18, 2025

A curated list of awesome Recommender System (Books, Conferences, Researchers, Papers, Github Repositories, Useful Sites, Youtube Videos)

1,438 211 Updated Feb 13, 2022

Example of project using Databricks Asset Bundle

Jupyter Notebook 42 32 Updated Aug 6, 2024

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Python 59,277 1,586 Updated Feb 5, 2026

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 39,620 7,601 Updated Dec 15, 2025

A guide for leading a data (engineering) team

64 5 Updated May 7, 2024

Databricks SDK for Python (Beta)

Python 518 184 Updated Feb 4, 2026
Python 217 148 Updated Feb 2, 2026

Examples of Databricks Asset Bundles

Python 282 110 Updated Feb 2, 2026

Examples of using Terraform to deploy Databricks resources

HCL 307 206 Updated Feb 4, 2026

Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables

Python 93 287 Updated Dec 22, 2025

This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.

Python 46 24 Updated Jan 27, 2025

Data Engineering Practice Problems

Python 2,513 784 Updated Jan 8, 2025