Build software better, together

microsoft / data-factory-testing-framework

A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric, Azure Data Factory and Azure Synapse Analytics.

fabric test unit-tests testing-framework azure-data-factory data-factory functional-tests azure-synapse microsoft-fabric

Updated Jan 26, 2026
Python

airscholar / FootballDataEngineering

Star

An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.

data-engineering dataengineering azure-data-factory apache-airflow azure-databricks azure-synapse-analytics azure-data-lake-gen2

Updated Oct 2, 2023
Python

alexanderbean / E2E-Data-Engineering-in-Azure

Star

End-to-end ETL pipeline in the Microsoft Azure cloud - (Jun '24 - Jul '24)

azure power-bi data-engineering azure-data-factory etl-pipeline azure-databricks azure-synapse-analytics

Updated Aug 9, 2024
Python

KChand1 / covid19-data-engineering

Star

power-bi pyspark databricks azure-data-factory etl-pipeline azure-sql azure-data-engineering data-engineering-project adls-gen2 covid19-analytics

Updated Apr 14, 2026
Python

dishadas168 / demand-forecasting-ebay

Star

A demand forecasting pipeline deployed on Azure and AWS

aws aws-lambda azure aws-s3 azure-functions aws-rds demand-forecasting azure-sql-database azure-data-factory price-prediction aws-step-functions aws-glue azure-ml azure-databricks azure-pipelines ebay-search aws-eventbridge aws-quicksight

Updated Nov 8, 2023
Python

areed1192 / trading-system

Sponsor

Star

A python application that leverages Microsoft Azure to build a fully functional trading system.

python azure azure-data-factory iex-api azure-blob-storage azure-key-vault azure-sql td-ameritrade-api azure-rbac

Updated May 23, 2023
Python

sanketrs / implementation-of-modern-data-engineering-architecture-with-fabric_analytics

Star

Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.

data-science etl azure data-visualization data-warehouse data-engineering data-analytics etl-framework azure-data-factory big-data-analytics cloud-dataflow etl-pipeline big-data-projects data-engineering-pipeline bi-analytics data-pipeline-monitoring cloud-data-warehouse azure-fabric data-engineering-project

Updated Dec 29, 2024
Python

SAMRAT47 / Project-1-End-to-End-Spotify-Data-Engineering-with-DABs-DLT

Star

Azure End To End Data Engineering Project | Azure Data Factory | Azure Databricks | Azure SQL DB | PySpark | Big Data. It is a in depth Data Engineering project using powerful tools like Azure Data Factory, Azure SQL DB, Azure Databricks, Unity Catalog, Delta Live Tables, Spark Streaming, PySpark, Databricks Asset Bundles, GitHub, and more.

pyspark data-engineering databricks azure-data-factory etl-pipeline delta-live-tables

Updated Jan 12, 2026
Python

giufalcao / Formula-1

Star

A data pipeline project build on databricks and azure to demostrate lifecycle of a cloud data project.

python azure jupyter-notebook data-visualization pyspark databricks data-pipeline azure-data-factory azure-databricks

Updated Jan 5, 2024
Python

jashshah-dev / Large-Language-Models-for-Medical-Data-Extraction

Star

Using LLMS to extract medical Data for Doctors and Hospitals

azure-data-factory azure-machine-learning-studio rag llm prompt-engineering azure-prompt-flow

Updated Jan 1, 2024
Python

AfonsoFeliciano / Imersao-Azure-Big-Data

Star

Treinamento Azure Data Factory + Azure Databricks + Azure Analysis Services + Power BI

python sql big-data pipeline adb azure dataflow dax adf databricks m aas azure-data-factory analysis-services pbi engenharia-de-dados

Updated Jan 24, 2022
Python

sfrechette / adfv2-datapipeline

Star

Azure Data Factory v2 datapipeline sample code snippets...

spark pyspark data-processing data-pipeline azure-data-factory

Updated Mar 9, 2018
Python

xuf-95 / azure_data_factory_etl_json_parse

Star

Parse Azure Data Factoty ETL pipeline to json

azure-data-factory azure-pipelines

Updated Aug 14, 2025
Python

msaleh1888 / azure-ml-customer-segmentation

Star

Production-grade customer segmentation pipeline built on Azure (Blob Storage, Data Factory, Azure ML, Batch Endpoint). Includes end-to-end data engineering, feature engineering, K-Means model training, and scalable batch inference.

python machine-learning azure clustering data-engineering infrastructure-as-code kmeans azure-machine-learning azure-data-factory customer-segmentation azure-blob-storage azure-ml mlops ml-pipeline production-ml cloud-ml batch-inference end-to-end-ml

Updated Nov 26, 2025
Python

srimantapal205 / Subject-Wise-Question---Answer

Star

This branch focuses on building Data Engineering Interview Question and Answer

python sql spark adb snowflake data-warehouse pyspark data-engineering project-management adf data-modeling datamodel data-engineer adls azure-data-factory azure-databricks azure-devops azure-data-lake-storage azure-delta-lake

Updated Jan 10, 2026
Python

sainikhilp / ridestream-data-pipeline

Star

RideStream is a scalable Azure-based lakehouse project designed for ride-hailing analytics. It combines batch ingestion from HTTP/internal sources with streaming booking events from Event Hubs, processes them in Databricks, and delivers a clean analytics model through a silver-layer OBT and a gold-layer star schema.

eventhubs azure-storage batch-processing databricks azure-data-factory declarative-pipeline delta-live-tables medallion-architecture streaming-tables

Updated Mar 23, 2026
Python

AmeeJoshi-MCA / Spotify-EndToEnd-Azure-Data-Engineering-Pipeline

Star

End-to-end Azure Data Engineering project using ADF for incremental ingestion, Databricks (DLT) for Medallion Architecture, and Delta Lake for CDC (SCD Type 1). Managed via Databricks Asset Bundles (DABs) for professional CI/CD. Focuses on real-time streaming, scalability, and Star Schema modeling.

data-engineering cdc azure-data-factory azure-databricks delta-lake cloud-data-platform delta-live-tables unity-catalog azure-data-engineering medallion-architecture databricks-asset-bundles end-to-end-data-pipelines adls-gen2

Updated Jan 29, 2026
Python

AbdulrehmanGit / F1-Data-Engineering-Pipeline-with-Azure-Databricks

Star

This project demonstrates a comprehensive data engineering pipeline capable of transforming vast amounts of historical F1 data into valuable insights. By leveraging modern cloud-based technologies and automating key processes, the project is designed to be both scalable and adaptable to future requirements.

python cloud sql azure power-bi pyspark data-lake databricks historical-data azure-data-factory azure-databricks delta-lake adlsgen2 medallion-architecture

Updated Aug 26, 2024
Python

rheaacharya77 / ETL-Olympics

Star

ETL pipeline tailored for Olympics data

python azure azure-sql-database azure-data-lake azure-data-factory etl-pipeline

Updated Jun 12, 2024
Python

khoinguyen19k8 / formula1

Star

Data pipeline that processes Formula1 data with Azure Databricks, DeltaLake, and Azure Data Factory

azure databricks azure-data-factory delta-lake

Updated Jul 14, 2023
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

azure-data-factory

Here are 53 public repositories matching this topic...

microsoft / data-factory-testing-framework

airscholar / FootballDataEngineering

alexanderbean / E2E-Data-Engineering-in-Azure

KChand1 / covid19-data-engineering

dishadas168 / demand-forecasting-ebay

areed1192 / trading-system

sanketrs / implementation-of-modern-data-engineering-architecture-with-fabric_analytics

SAMRAT47 / Project-1-End-to-End-Spotify-Data-Engineering-with-DABs-DLT

giufalcao / Formula-1

jashshah-dev / Large-Language-Models-for-Medical-Data-Extraction

AfonsoFeliciano / Imersao-Azure-Big-Data

sfrechette / adfv2-datapipeline

xuf-95 / azure_data_factory_etl_json_parse

msaleh1888 / azure-ml-customer-segmentation

srimantapal205 / Subject-Wise-Question---Answer

sainikhilp / ridestream-data-pipeline

AmeeJoshi-MCA / Spotify-EndToEnd-Azure-Data-Engineering-Pipeline

AbdulrehmanGit / F1-Data-Engineering-Pipeline-with-Azure-Databricks

rheaacharya77 / ETL-Olympics

khoinguyen19k8 / formula1

Improve this page

Add this topic to your repo