Skip to content
View TawfikYasser's full-sized avatar
🐿️
squirrel
🐿️
squirrel

Organizations

@EddieHubCommunity @MakeContributions

Block or report TawfikYasser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Flower detection using yolov5 trained on custom dataset.

Jupyter Notebook 2 Updated Aug 11, 2024

All materials you need for Federated Learning: blogs, videos, papers, and softwares, etc.

Shell 725 97 Updated Nov 16, 2025

A tool for creating and running Linux containers using lightweight virtual machines on a Mac. It is written in Swift, and optimized for Apple silicon.

Swift 22,920 566 Updated Dec 22, 2025

Official Dockerfile for Delta Lake

Jupyter Notebook 58 30 Updated Sep 19, 2025

The resources of the preparation course for Databricks Data Engineer Associate certification exam

Python 552 887 Updated Dec 12, 2025

This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.

211 113 Updated Aug 11, 2024

Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize.

Python 9 2 Updated Sep 27, 2024

محتوى تقني متميز في مختلف مجالات هندسة البرمجيات عن طريق تبسيط المفاهيم البرمجية المعقدة بشكل سلس وباستخدام صور توضيحية مذهلة

Python 679 31 Updated Sep 1, 2025

End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API, sends the data to Kafka, and processes it with Spark befor…

Python 20 9 Updated Jul 26, 2024

Apache Airflow advanced functionalities examples

Python 21 3 Updated Mar 22, 2024

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…

321 26 Updated Jan 10, 2022

Source code for alfy.me

CSS 4 1 Updated Dec 15, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 39,089 7,508 Updated Dec 15, 2025

A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.

Python 79 166 Updated Aug 21, 2023

Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash

Jupyter Notebook 25 4 Updated Nov 12, 2022

Code for "Efficient Data Processing in Spark" Course

Python 353 72 Updated Oct 16, 2025

This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python, Scala and Java as an example.

Java 46 28 Updated Mar 14, 2024

used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline

Python 29 7 Updated Oct 25, 2023

📃 hire me! resume built with jekyll and hosted on https://insanj.github.io/resume/

HTML 7 1 Updated Aug 6, 2025

This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spark (calculation) and dbt (transformation)

Jupyter Notebook 42 7 Updated Apr 22, 2023

Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.

Java 65 21 Updated Sep 26, 2023

A real-time reddit data streaming pipeline for sentiment analysis of various subreddits

HCL 140 19 Updated Aug 23, 2023

Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]

Jupyter Notebook 112 178 Updated Sep 7, 2025

Code for Data Pipelines with Apache Airflow

Python 811 403 Updated Aug 15, 2024

Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra

Python 143 62 Updated Jul 27, 2023

Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.

Python 343 54 Updated Jan 12, 2022
Next