Stars
ilum-cloud / marquez
Forked from MarquezProject/marquezCollect, aggregate, and visualize a data ecosystem's metadata
Apache Spark Native DataSource for Safetensors
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
Microservice pattern demos (Saga, EventSourcing, CQRS...) running on .NET Aspire
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…
SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.
Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)
Brings H3 - Hexagonal hierarchical geospatial indexing system support to Apache Spark SQL
A collection of learning resources for curious software engineers
Code, exercises, answers, and hints to go along with the book "Functional Programming in Scala"
Jargon from the functional programming world in simple terms!
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A curated list of awesome Machine Learning frameworks, libraries and software.
Workshop Pragmatic Introduction to Category Theory
This is a repo with links to everything you'd ever want to learn about data engineering