Highlights
Stars
🎓 Path to a free self-taught education in Computer Science!
This is a repo with links to everything you'd ever want to learn about data engineering
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
D2 is a modern diagram scripting language that turns text to diagrams.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
An open-source screen recorder built with web technology
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
Free MLOps course from DataTalks.Club
700+ Pure CSS, SVG & Figma UI Icons, 6000+ glyphs, patterns, colors and layouts.
https://huyenchip.com/ml-interviews-book/
Preswald is a WASM packager for Python-based interactive data apps: bundle full complex data workflows, particularly visualizations, into single files, runnable completely in-browser, using Pyodide…
A non-validating SQL parser module for Python
Hosting read-only SQLite databases on static file hosters like Github Pages
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
Python interactive dashboards for learning data science
A collection of research papers on decision, classification and regression trees with implementations.
👩🏫 Advanced NLP with spaCy: A free online course
🦆 A curated list of awesome DuckDB resources
nannyml: post-deployment data science in python
A list of publicly available datasets with real-time data maintained by the team at bytewax.io
R & stats illustrations by @allison_horst
A guide for technical professionals looking to start consulting
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.