Skip to content
View zeotuan's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report zeotuan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Kylin

Java 3,767 1,512 Updated Mar 13, 2026

Collect, aggregate, and visualize a data ecosystem's metadata

Java 15 2 Updated Apr 1, 2026

Apache Spark Native DataSource for Safetensors

Scala 6 1 Updated Feb 24, 2026

Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)

HTML 9,634 1,813 Updated Dec 25, 2025

Microservice pattern demos (Saga, EventSourcing, CQRS...) running on .NET Aspire

C# 325 126 Updated Apr 3, 2026

Drop-in replacement for Apache Spark UI

TypeScript 442 52 Updated Apr 14, 2026
TypeScript 9 Updated Aug 6, 2025

Data Engineer Roadmap for 2024

153 23 Updated Dec 10, 2025

VSCode theme for all Jetbrains IDEs

Kotlin 1,148 33 Updated Feb 20, 2026

Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…

Scala 95 14 Updated May 9, 2025

The Scala 3 compiler, also known as Dotty.

Scala 6,228 1,152 Updated Apr 14, 2026

SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.

Python 26 6 Updated Feb 22, 2025

Delta Lake helper methods. No Spark dependency.

Python 22 7 Updated Jan 19, 2026
Python 9 2 Updated Sep 20, 2025

Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)

Scala 455 77 Updated Apr 2, 2026

Delta lake and filesystem helper methods

Scala 50 11 Updated Feb 29, 2024

Apache DataFusion SQL Query Engine

Rust 8,605 2,039 Updated Apr 14, 2026

Brings H3 - Hexagonal hierarchical geospatial indexing system support to Apache Spark SQL

Scala 7 4 Updated Jan 28, 2025

A collection of learning resources for curious software engineers

Python 50,739 3,966 Updated Apr 14, 2026

The Open Source Data Science Masters

26,080 6,140 Updated Dec 3, 2023

Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"

Scala 5,810 3,027 Updated Dec 11, 2024

Jargon from the functional programming world in simple terms!

18,639 1,007 Updated Oct 17, 2023

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,763 1,419 Updated Jan 28, 2026

Interactive roadmaps, guides and other educational content to help developers grow in their careers.

TypeScript 352,895 43,915 Updated Apr 14, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,646 9,689 Updated Nov 12, 2025

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 72,178 15,402 Updated Apr 12, 2026

Workshop Pragmatic Introduction to Category Theory

Scala 130 27 Updated Jun 19, 2021

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 40,972 7,784 Updated Apr 2, 2026

Wonderful reusable code from Twitter

Scala 2,724 572 Updated Dec 8, 2025
Next