Skip to content
View josephmachado's full-sized avatar
:octocat:
Working
:octocat:
Working

Block or report josephmachado

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WebAssembly powered code blocks and exercises for both the R and Python languages in Quarto documents

TypeScript 234 32 Updated Aug 19, 2025

A Swiss Army knife for developers.

C# 30,630 1,693 Updated Oct 26, 2025

Open-source scientific and technical publishing system built on Pandoc.

JavaScript 5,129 394 Updated Dec 24, 2025

Python Testing for Databricks

Python 102 10 Updated Oct 17, 2025

Demos to implement your Databricks Lakehouse

HTML 388 148 Updated Dec 9, 2025

Column-wise type annotations for pyspark DataFrames

Python 95 16 Updated Dec 24, 2025

Code for setting up a local Spark development environment

Python 10 33 Updated Aug 23, 2025

Create web-based user interfaces with Python. The nice way.

Python 14,967 893 Updated Dec 24, 2025

Code for DE101 book at https://de101.startdataengineering.com/

HTML 72 429 Updated Dec 6, 2025

A curated list of awesome jq tools and resources.

918 44 Updated Aug 31, 2025

High performance, self-hosted, newsletter and mailing list manager with a modern dashboard. Single binary app.

Go 18,494 1,858 Updated Dec 20, 2025

my scripts!

Shell 494 37 Updated Nov 6, 2025

Advanced Spark SQL for Data Engineers

Jupyter Notebook 12 4 Updated Jun 21, 2025

SQL tips and tricks

SQL 2,261 99 Updated Nov 23, 2025

aider is AI pair programming in your terminal

Python 39,185 3,759 Updated Dec 18, 2025

Code for extracting data from API with Python

Jupyter Notebook 3 Updated Apr 23, 2025

How to quickly deliver data to business users?

Jupyter Notebook 7 3 Updated Sep 2, 2025

Repo to show how to modularize messy SQL

Python 10 Updated Feb 7, 2025

Code for using dbt seed cross projects

2 1 Updated Jan 13, 2025

The Web framework for perfectionists with deadlines.

Python 86,250 33,402 Updated Dec 24, 2025

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

Scala 534 138 Updated Mar 16, 2022
Jupyter Notebook 3 1 Updated Oct 11, 2024
Dockerfile 15 1 Updated Dec 11, 2023

Step by step instructions to create a production-ready data pipeline

Jupyter Notebook 58 13 Updated Dec 23, 2024

PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster

Python 483 235 Updated Oct 15, 2024

🐍 Quick reference guide to common patterns & functions in PySpark.

640 200 Updated Feb 21, 2023

JupyterLab computational environment.

TypeScript 14,949 3,854 Updated Dec 24, 2025

Simple ETL demonstrated with literate programming

Python 7 Updated Aug 20, 2024

The fastest way to create an HTML app

Jupyter Notebook 6,752 289 Updated Dec 22, 2025

Repository for Data Engineering Interview Series

Jupyter Notebook 34 3 Updated Oct 17, 2024
Next