Skip to content
View MaxGekk's full-sized avatar
🚸
Taking care
🚸
Taking care

Organizations

@apache @databricks

Block or report MaxGekk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Spark Connect Client for Swift

Swift 30 7 Updated Mar 21, 2026

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,311 988 Updated Mar 20, 2026

A tool to get better debug info on spark's memory usage

Scala 42 15 Updated Aug 21, 2019

Tips for developing Apache Spark, especially in IntelliJ IDEA

3 1 Updated Jan 24, 2020

Command line history manager for bash

C++ 29 3 Updated Mar 11, 2023

All the things about TPC-DS in Apache Spark

Scala 110 44 Updated Jun 15, 2023
Jupyter Notebook 7 2 Updated Aug 23, 2021

Task Metrics Explorer

Scala 14 9 Updated Apr 2, 2019

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…

Python 24,905 5,460 Updated Mar 22, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,637 2,025 Updated Mar 22, 2026

Code that'll help you kickstart a personal website that showcases your work as a software developer.

HTML 7,602 6,589 Updated Dec 21, 2023

Spark Structured Streaming State Tools

Scala 34 8 Updated Jul 3, 2020

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

C++ 23,466 1,228 Updated Mar 20, 2026

Koalas: pandas API on Apache Spark

Python 3,373 364 Updated Mar 20, 2024

A lightweight library to inject LLVM bitcode into JVMs

C++ 88 7 Updated Dec 9, 2019

Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.

Java 3,275 460 Updated Dec 28, 2025

Run spark calculations from Ammonite

Scala 117 17 Updated Feb 20, 2026

Spark SQL index for Parquet tables

Scala 134 35 Updated May 6, 2021

A scala library for interacting with the slack api and real time messaging interface

Scala 191 104 Updated Aug 28, 2024

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 187 33 Updated Oct 15, 2025

Qubole Sparklens tool for performance tuning Apache Spark

Scala 590 143 Updated Jun 26, 2024

This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simplifies collecting, aggregating, and exporting Spark task/sta…

Scala 818 159 Updated Mar 4, 2026

Schema Registry integration for Apache Spark

Scala 40 17 Updated Nov 16, 2022

Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol

Scala 34 19 Updated Sep 8, 2022

Example project showing how to use Hive UDFs in Apache Spark

Scala 55 22 Updated Apr 23, 2019

Scala library for .netrc files

Scala 2 1 Updated Feb 1, 2018

Simple jdbc client for Apache Spark

Scala 7 1 Updated Dec 16, 2017

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,013 29,130 Updated Mar 21, 2026

Apache Kafka - A distributed event streaming platform

Java 32,207 15,060 Updated Mar 21, 2026
Next