Skip to content
View ZakariaAlz's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ZakariaAlz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 Updated Apr 7, 2024

Azure Data Engineering Roadmap

4 Updated Apr 25, 2024

Roadmap for Data Engineering

Java 242 30 Updated Jun 20, 2024

Data Engineer Roadmap for 2024

158 23 Updated Dec 10, 2025

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 40,853 8,160 Updated May 3, 2026

The Data Engineering Cookbook

Python 15,088 2,712 Updated Jan 17, 2026

Roadmap to becoming a data engineer in 2021

12,752 1,343 Updated Jan 25, 2022

A list of useful resources to learn Data Engineering from scratch

3,991 569 Updated Jun 19, 2024

Implementing best practices for PySpark ETL jobs and applications.

Python 2,101 803 Updated Jan 1, 2023

The best place to learn data engineering. Built and maintained by the data engineering community.

CSS 1,935 234 Updated Apr 6, 2026

Data Engineering Practice Problems

Python 2,681 825 Updated Jan 8, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,616 583 Updated May 14, 2026

This project has been crafted during my second year within CESI School of Engineering

PHP 1 Updated Apr 2, 2024
PHP 1 Updated Apr 4, 2024
Jupyter Notebook 1 Updated Apr 1, 2024

Hey there! 👋 This was my very first data engineering project, built with my good friend Aymen Benniou—who we lost last year. Prayers for him. 🙏 We set up Kafka to stream customer feedback, used Fli…

Java 1 Updated Jul 26, 2025
Jupyter Notebook 1 Updated Mar 1, 2025

This Repository represents my Portfolio, feel free to check it out, I'm looking forward to discuss some opportunities :D

1 Updated May 14, 2026

Big Data Internship – Hadoop + Spark environment for analyzing telecom data (CDR).

Jupyter Notebook 2 1 Updated Jul 14, 2025

Data Engineering with Spark and Delta Lake

TSQL 106 78 Updated Jan 18, 2023

An orchestration platform for the development, production, and observation of data assets.

Python 15,507 2,123 Updated May 14, 2026

Full stack data engineering tools and infrastructure set-up

Python 58 20 Updated Feb 13, 2021

Practical Data Engineering: A Hands-On Real-Estate Project Guide

Jupyter Notebook 800 133 Updated Mar 10, 2026

Data Engineering Project: Extracting music video metrics of Twice using YouTube API, AWS, and Tableau

Python 32 10 Updated Nov 21, 2023

This is a public repository to go over all the LLM-driven data engineering concepts.

Python 1,151 229 Updated Oct 26, 2024

Get All Github achievements

1,304 37 Updated Dec 6, 2025
Next