Skip to content
View nhat416's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report nhat416

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

47 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,507 28,973 Updated Dec 17, 2025

♞ lichess.org: the forever free, adless and open source chess server ♞

Scala 17,467 2,520 Updated Dec 17, 2025

Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3

Scala 14,430 3,103 Updated Dec 12, 2025

The Community Maintained High Velocity Web Framework For Java and Scala.

Scala 12,612 4,073 Updated Dec 17, 2025

PredictionIO, a machine learning server for developers and ML engineers.

Scala 12,535 1,918 Updated Jan 9, 2021

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,468 1,971 Updated Dec 16, 2025

The leader in Customer Data Infrastructure

Scala 6,985 1,187 Updated Jun 4, 2025

Apache OpenWhisk is an open source serverless cloud platform

Scala 6,737 1,175 Updated Dec 8, 2025

ZIO — A type-safe, composable library for async and concurrent programming in Scala

Scala 4,321 1,379 Updated Dec 17, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,555 574 Updated Nov 4, 2025

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,285 971 Updated Dec 17, 2025

Deploy über-JARs. Restart processes. (port of codahale/assembly-sbt)

Scala 1,961 224 Updated Sep 29, 2025

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala 1,362 782 Updated Jan 28, 2025

Apache DataFusion Comet Spark Accelerator

Scala 1,086 258 Updated Dec 17, 2025

A simple-build-tool (sbt) plugin/processor for creating IntelliJ IDEA project files

Scala 1,068 147 Updated Dec 27, 2017

CSV Data Source for Apache Spark 1.x

Scala 1,057 440 Updated Dec 13, 2018

Chronon is a data platform for serving for AI/ML applications.

Scala 950 86 Updated Dec 10, 2025

The software used to extract structured data from Wikipedia

Scala 915 291 Updated Nov 6, 2025

The code examples used in Programming Scala, 2nd and 3rd Editions (O'Reilly)

Scala 651 405 Updated Dec 13, 2025

Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)

Scala 452 78 Updated Aug 8, 2025

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

Scala 346 57 Updated May 31, 2024

The official repository for the Rock the JVM Scala 2 for beginners course

Scala 339 285 Updated Apr 25, 2024

Mirror of Apache Bahir

Scala 335 193 Updated Jul 7, 2023

The official repository for the Rock the JVM Spark Essentials with Scala course

Scala 278 368 Updated Sep 10, 2025

Snowflake Data Source for Apache Spark.

Scala 230 106 Updated Dec 17, 2025

Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints

Scala 164 31 Updated Dec 17, 2025

The Scala 2 version (old) of the Advanced Scala course

Scala 164 163 Updated Sep 24, 2023

Spark in Action, 2nd edition - chapter 1 - Introduction

Scala 107 70 Updated Apr 21, 2023

Automated data quality suggestions and analysis with Deequ on AWS Glue

Scala 90 23 Updated Dec 29, 2022

Using Hadoop with Scala

Scala 70 30 Updated Oct 5, 2013
Next