-
Databricks Inc.
- Belgrade, Serbia
- https://linkedin.com/in/maxgekk/
Stars
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
A tool to get better debug info on spark's memory usage
Tips for developing Apache Spark, especially in IntelliJ IDEA
All the things about TPC-DS in Apache Spark
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Code that'll help you kickstart a personal website that showcases your work as a software developer.
Spark Structured Streaming State Tools
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
A lightweight library to inject LLVM bitcode into JVMs
Log analyser / visualiser for Java HotSpot JIT compiler. Inspect inlining decisions, hot methods, bytecode, and assembly. View results in the JavaFX user interface.
Run spark calculations from Ammonite
A scala library for interacting with the slack api and real time messaging interface
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Qubole Sparklens tool for performance tuning Apache Spark
This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simplifies collecting, aggregating, and exporting Spark task/sta…
Schema Registry integration for Apache Spark
Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol
Example project showing how to use Hive UDFs in Apache Spark
Apache Spark - A unified analytics engine for large-scale data processing
Apache Kafka - A distributed event streaming platform