Skip to content
View TJX2014's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report TJX2014

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
11 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,512 28,973 Updated Dec 18, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,470 1,971 Updated Dec 18, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,284 971 Updated Dec 18, 2025

Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.

Scala 1,849 548 Updated May 29, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,487 552 Updated Dec 17, 2025

Apache DataFusion Comet Spark Accelerator

Scala 1,086 258 Updated Dec 17, 2025

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 951 267 Updated Dec 17, 2025

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 803 159 Updated Nov 6, 2025

An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.

Scala 430 116 Updated Jan 14, 2022

All the things about TPC-DS in Apache Spark

Scala 108 43 Updated Jun 15, 2023