Skip to content
View arjunsk's full-sized avatar
:octocat:
Learning!
:octocat:
Learning!

Organizations

@dborchard @iarjunsk @dsorchard @csorchard

Block or report arjunsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

31 results for source starred repositories written in Scala
Clear filter

Source code for the X Recommendation Algorithm

Scala 72,923 13,270 Updated Sep 8, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,038 29,135 Updated Mar 25, 2026

A fault tolerant, protocol-agnostic RPC system

Scala 8,873 1,440 Updated Feb 2, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,644 2,027 Updated Mar 25, 2026

A machine learning package built for humans.

Scala 4,802 565 Updated Nov 6, 2025

Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules

Scala 4,383 521 Updated Jun 29, 2022

ZIO — A type-safe, composable library for async and concurrent programming in Scala

Scala 4,364 1,465 Updated Mar 25, 2026

Spark: The Definitive Guide's Code Repository

Scala 3,113 2,897 Updated Aug 26, 2020

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,313 988 Updated Mar 23, 2026

Apache Spark to Apache Cassandra connector

Scala 1,950 926 Updated Apr 29, 2025

Base classes to use when writing tests with Spark

Scala 1,549 355 Updated Mar 23, 2026

Protocol buffer compiler for Scala.

Scala 1,334 295 Updated Mar 24, 2026

Apache DataFusion Comet Spark Accelerator

Scala 1,156 295 Updated Mar 24, 2026

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,007 312 Updated Oct 5, 2022

Development in Shark has been ended.

Scala 993 323 Updated Aug 11, 2015

TiSpark is built for running Apache Spark on top of TiDB/TiKV

Scala 893 252 Updated Mar 17, 2026

Essential Spark extensions and helper methods ✨😲

Scala 766 150 Updated Sep 14, 2025

Ergo protocol description & reference client implementation

Scala 517 194 Updated Mar 24, 2026

Unified SQL Analytics Engine Based on SparkSQL

Scala 211 58 Updated Dec 5, 2022

Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange

Scala 131 29 Updated Dec 19, 2024

Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and write back results to NebulaGraph.

Scala 81 40 Updated Aug 19, 2024

sql interface for solr cloud

Scala 40 15 Updated Sep 16, 2022

Flink Examples

Scala 38 27 Updated Apr 27, 2016

A Scala implementation of the geohashing algorithm

Scala 18 14 Updated Sep 27, 2023

A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).

Scala 18 12 Updated Apr 20, 2024

Some AWS EMR examples

Scala 16 4 Updated Jan 18, 2018

Scala utility to send mail

Scala 14 7 Updated May 4, 2020

Pub/Sub built on top of FoundationDB

Scala 13 3 Updated Aug 13, 2024

GridDB connector for Apache Spark

Scala 4 4 Updated Dec 26, 2022
Next