Skip to content
View arjunsk's full-sized avatar
:octocat:
Learning!
:octocat:
Learning!

Organizations

@dborchard @iarjunsk @dsorchard @csorchard

Block or report arjunsk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

33 stars written in Scala
Clear filter

Source code for the X Recommendation Algorithm

Scala 67,743 12,621 Updated Sep 8, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,261 28,926 Updated Nov 9, 2025

A fault tolerant, protocol-agnostic RPC system

Scala 8,853 1,448 Updated Oct 15, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,387 1,948 Updated Nov 8, 2025

A machine learning package built for humans.

Scala 4,801 564 Updated Nov 6, 2025

Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules

Scala 4,385 523 Updated Jun 29, 2022

ZIO — A type-safe, composable library for async and concurrent programming in Scala

Scala 4,300 1,369 Updated Nov 9, 2025

A distributed, fault-tolerant graph database

Scala 3,332 253 Updated Mar 16, 2017

Spark: The Definitive Guide's Code Repository

Scala 3,054 2,876 Updated Aug 26, 2020

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,269 971 Updated Nov 8, 2025

Apache Spark to Apache Cassandra connector

Scala 1,947 929 Updated Apr 29, 2025

Base classes to use when writing tests with Spark

Scala 1,545 355 Updated Oct 27, 2025

Protocol buffer compiler for Scala.

Scala 1,328 293 Updated Nov 6, 2025

Apache DataFusion Comet Spark Accelerator

Scala 1,066 249 Updated Nov 9, 2025

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,007 313 Updated Oct 5, 2022

Development in Shark has been ended.

Scala 994 325 Updated Aug 11, 2015

TiSpark is built for running Apache Spark on top of TiDB/TiKV

Scala 889 251 Updated Jul 12, 2025

Essential Spark extensions and helper methods ✨😲

Scala 764 152 Updated Sep 14, 2025

A simplified, lightweight ETL Framework based on Apache Spark

Scala 587 158 Updated Jan 24, 2024

Ergo protocol description & reference client implementation

Scala 508 178 Updated Nov 6, 2025

Unified SQL Analytics Engine Based on SparkSQL

Scala 212 58 Updated Dec 5, 2022

Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange

Scala 129 29 Updated Dec 19, 2024

Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and write back results to NebulaGraph.

Scala 76 41 Updated Aug 19, 2024

sql interface for solr cloud

Scala 40 15 Updated Sep 16, 2022

Flink Examples

Scala 38 27 Updated Apr 27, 2016

A Scala implementation of the geohashing algorithm

Scala 18 14 Updated Sep 27, 2023

A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).

Scala 17 12 Updated Apr 20, 2024

Some AWS EMR examples

Scala 16 4 Updated Jan 18, 2018

Scala utility to send mail

Scala 14 7 Updated May 4, 2020

Pub/Sub built on top of FoundationDB

Scala 14 3 Updated Aug 13, 2024
Next