Stars
WebAssembly powered code blocks and exercises for both the R and Python languages in Quarto documents
Open-source scientific and technical publishing system built on Pandoc.
Demos to implement your Databricks Lakehouse
Column-wise type annotations for pyspark DataFrames
Code for setting up a local Spark development environment
Create web-based user interfaces with Python. The nice way.
Code for DE101 book at https://de101.startdataengineering.com/
High performance, self-hosted, newsletter and mailing list manager with a modern dashboard. Single binary app.
Advanced Spark SQL for Data Engineers
aider is AI pair programming in your terminal
Code for extracting data from API with Python
How to quickly deliver data to business users?
The Web framework for perfectionists with deadlines.
Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.
Step by step instructions to create a production-ready data pipeline
PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
🐍 Quick reference guide to common patterns & functions in PySpark.
JupyterLab computational environment.
Simple ETL demonstrated with literate programming
The fastest way to create an HTML app
Repository for Data Engineering Interview Series