Skip to content
View kjenq's full-sized avatar

Block or report kjenq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 39,043 7,492 Updated Dec 15, 2025

Bootstrap Kubernetes the hard way. No scripts.

46,736 15,362 Updated Apr 10, 2025

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 44,496 8,899 Updated Dec 21, 2025

Code and walkthrough labs to set up serverless applications for Wild Rydes workshops

JavaScript 4,261 2,641 Updated Jul 29, 2024

Using Git and GitHub with R, Rstudio, and R Markdown

TeX 613 341 Updated Jan 31, 2025

Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.

Scala 169 9 Updated Feb 10, 2024

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 2,337 167 Updated Dec 6, 2025

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 2,264 251 Updated Dec 19, 2025

Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies APIs into declarative configuration files that can be shared …

Go 47,315 10,140 Updated Dec 19, 2025

A list of useful resources to learn Data Engineering from scratch

3,922 561 Updated Jun 19, 2024

A curated list of data engineering tools for software developers

8,102 1,406 Updated Nov 30, 2025

Roadmap to becoming a data engineer in 2021

12,730 1,363 Updated Jan 25, 2022

The Data Engineering Cookbook

Python 14,864 2,684 Updated Oct 6, 2025

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 33,997 7,203 Updated Dec 19, 2025

Self-serve BI to 10x your data team ⚡️

TypeScript 5,409 653 Updated Dec 19, 2025

Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101

Jupyter Notebook 5,456 5,347 Updated Jun 27, 2023

A lightning fast JavaScript grid/spreadsheet

JavaScript 7,145 1,951 Updated Feb 26, 2023

Courseware setup and information for instructors

Ruby 250 201 Updated Nov 7, 2025

Documentation behind the model used to analyse companies in Simply Wall St

1,604 426 Updated Nov 12, 2024

🎸 A statistical analysis of every artist on Triple J Unearthed

Python 1 Updated Aug 7, 2018

🏫 nodeschool internet web page

JavaScript 939 466 Updated Jun 25, 2024

freeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.

TypeScript 435,042 42,882 Updated Dec 21, 2025