arjunsk

Learning!

Arjun Sunil Kumar arjunsk

Learning!

"It always seems impossible until it's done." - Nelson Mandela

520 followers · 1.9k following

@aws
Sunnyvale, CA
11:55 (UTC -08:00)
in/arjunsk15

Achievements

x3 x2

Achievements

x3 x2

Highlights

Developer Program Member

Organizations

Lists (1)

Sort

db-to-learn

Starred repositories

33 stars written in Scala

Clear filter

twitter / the-algorithm

Source code for the X Recommendation Algorithm

Scala 67,743 12,621 Updated Sep 8, 2025

apache / spark

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,261 28,926 Updated Nov 9, 2025

twitter / finagle

A fault tolerant, protocol-agnostic RPC system

Scala 8,853 1,448 Updated Oct 15, 2025

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,387 1,948 Updated Nov 8, 2025

airbnb / aerosolve

A machine learning package built for humans.

Scala 4,801 564 Updated Nov 6, 2025

mesos / chronos

Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules

Scala 4,385 523 Updated Jun 29, 2022

zio / zio

ZIO — A type-safe, composable library for async and concurrent programming in Scala

Scala 4,300 1,369 Updated Nov 9, 2025

twitter-archive / flockdb

A distributed, fault-tolerant graph database

Scala 3,332 253 Updated Mar 16, 2017

databricks / Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository

Scala 3,054 2,876 Updated Aug 26, 2020

apache / kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,269 971 Updated Nov 8, 2025

apache / cassandra-spark-connector

Apache Spark to Apache Cassandra connector

Scala 1,947 929 Updated Apr 29, 2025

holdenk / spark-testing-base

Base classes to use when writing tests with Spark

Scala 1,545 355 Updated Oct 27, 2025

scalapb / ScalaPB

Protocol buffer compiler for Scala.

Scala 1,328 293 Updated Nov 6, 2025

apache / datafusion-comet

Apache DataFusion Comet Spark Accelerator

Scala 1,066 249 Updated Nov 9, 2025

cloudera / livy

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,007 313 Updated Oct 5, 2022

amplab / shark

Development in Shark has been ended.

Scala 994 325 Updated Aug 11, 2015

pingcap / tispark

TiSpark is built for running Apache Spark on top of TiDB/TiKV

Scala 889 251 Updated Jul 12, 2025

mrpowers-io / spark-daria

Essential Spark extensions and helper methods ✨😲

Scala 764 152 Updated Sep 14, 2025

YotpoLtd / metorikku

A simplified, lightweight ETL Framework based on Apache Spark

Scala 587 158 Updated Jan 24, 2024

ergoplatform / ergo

Ergo protocol description & reference client implementation

Scala 508 178 Updated Nov 6, 2025

Qihoo360 / XSQL

Unified SQL Analytics Engine Based on SparkSQL

Scala 212 58 Updated Dec 5, 2022

MemVerge / splash

Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange

Scala 129 29 Updated Dec 19, 2024

vesoft-inc / nebula-algorithm

Nebula-Algorithm is a Spark Application based on GraphX, which enables state of art Graph Algorithms to run on top of NebulaGraph and write back results to NebulaGraph.

Scala 76 41 Updated Aug 19, 2024