Starred repositories
The resources of the preparation course for Databricks Data Engineer Associate certification exam
PySpark-Tutorial provides basic algorithms using PySpark
An analytics engineering sandbox focusing on real estates prices in Cook County, IL
📘《Python进阶》(Intermediate Python - Chinese Version)
告别枯燥,致力于打造 Python 实用小例子,更多Python良心教程见 https://ai-jupyter.com
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift, Data Lake with Spark and Data Pipeline with Airflow.
data visualization, customer segmentation, CLV and next purchase prediction
Free MLOps course from DataTalks.Club
My ETL as well as the logging.
This repository will contain my day to day progress for the goal of becoming a data engineer
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
This repository is to access my own learning and growth as a Data Engineer on a daily basis for the next 100 days
This project is a practice project to analyze the twitter data set that we got using the Twitter developer API, using Apache Flume, Hive and hDFS
This is a new repository to capture the work related to the DLME ETL Pipeline and establish airflow
Data pipeline project extracting data from Azure Database for PostgreSQL server and csv file from Azure Blob Storage to loading this data at its final destination on Postgres Database Server.
This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging the data, filling the data warehouse, and running checks on…
Python data repo, jupyter notebook, python scripts and data.
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
A Hadoop cluster based on Docker, including Hive and Spark.
New Repo: https://github.com/byzer-org/kolo-lang
pyspark🍒🥭 is delicious,just eat it!😋😋
Summary of and link to finalized projects that were reviewed and approved by Udacity in order to meet the requirements for obtaining the Data Analyst Nanodegree: