Stars
This is a repo with links to everything you'd ever want to learn about data engineering
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
Source code for Big Data: Principles and best practices of scalable realtime data systems
π Awesome Data Catalogs and Observability Platforms.
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here ππΌ
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
self-hosted, Netflix-like app made for streaming
π Path to a free self-taught education in Computer Science!
Solution Architecture links, articles, books, video lessons, etc.
Import arbitrary code from Stack Overflow as Python modules.
Roadmap to becoming a data engineer in 2021
Self hosted FLOSS fitness/workout, nutrition and weight tracker
List of Computer Science courses with video lectures.
Files for Udemy Course on Algorithms and Data Structures
Machine learning Fantasy Premier League team
π₯ Awesome list of resources on Web Development.
Udacity Nanodegree and Course Downloader
Projects done in the Data Engineering Nanodegree by Udacity.com
Career tips for Software Engineers and Recruiters
Always know what to expect from your data.
leetcode problems I solved to prepare for my Google interview.