Skip to content
View rsohlot's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report rsohlot

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

69 stars written in Scala
Clear filter

Source code for the X Recommendation Algorithm

Scala 67,968 12,644 Updated Sep 8, 2025

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,507 28,973 Updated Dec 17, 2025

♞ lichess.org: the forever free, adless and open source chess server ♞

Scala 17,467 2,519 Updated Dec 17, 2025

CMAK is a tool for managing Apache Kafka clusters

Scala 11,941 2,500 Updated Aug 2, 2023

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,468 1,971 Updated Dec 16, 2025

Open-source high-performance RISC-V processor

Scala 6,792 850 Updated Dec 17, 2025

Simple and Distributed Machine Learning

Scala 5,191 854 Updated Dec 15, 2025

sbt, the interactive build tool

Scala 4,875 957 Updated Dec 17, 2025

State of the Art Natural Language Processing

Scala 4,083 736 Updated Dec 16, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,555 574 Updated Nov 4, 2025

Spark: The Definitive Guide's Code Repository

Scala 3,069 2,878 Updated Aug 26, 2020

REST job server for Apache Spark

Scala 2,846 985 Updated Jul 8, 2025

A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine

Scala 2,376 106 Updated Sep 24, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,285 971 Updated Dec 17, 2025

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala 2,270 401 Updated Sep 29, 2023

Scala language server with rich IDE features 🚀

Scala 2,262 395 Updated Dec 17, 2025

Apache Spark to Apache Cassandra connector

Scala 1,946 929 Updated Apr 29, 2025

Base classes to use when writing tests with Spark

Scala 1,545 354 Updated Nov 21, 2025

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

Scala 1,505 185 Updated Dec 17, 2025

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,488 552 Updated Dec 17, 2025

command line options parsing for Scala

Scala 1,447 161 Updated Sep 6, 2025

High performance data store solution

Scala 1,444 704 Updated Nov 10, 2025

Mirror of Apache griffin

Scala 1,175 589 Updated Aug 3, 2025

Source files for SiFive's Freedom platforms

Scala 1,133 285 Updated Jul 17, 2021

Apache DataFusion Comet Spark Accelerator

Scala 1,086 258 Updated Dec 17, 2025

CSV Data Source for Apache Spark 1.x

Scala 1,057 440 Updated Dec 13, 2018

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 951 266 Updated Dec 17, 2025

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 937 619 Updated Dec 16, 2025

CPU and GPU-accelerated Machine Learning Library

Scala 918 170 Updated Oct 4, 2022

An open protocol for secure data sharing

Scala 906 215 Updated Dec 10, 2025
Next