satoshihirose

satoshihirose satoshihirose

Data Engineering Guy

19 followers · 61 following

Achievements

Starred repositories

42 results for source starred repositories written in Scala

Clear filter

apache / spark

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,235 28,920 Updated Nov 6, 2025

gitbucket / gitbucket

A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility

Scala 9,313 1,262 Updated Nov 1, 2025

snowplow / snowplow

The leader in Customer Data Infrastructure

Scala 6,964 1,190 Updated Jun 4, 2025

airbnb / aerosolve

A machine learning package built for humans.

Scala 4,801 565 Updated Nov 6, 2025

awslabs / deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,535 573 Updated Nov 4, 2025

scalanlp / breeze

Breeze is/was a numerical processing library for Scala.

Scala 3,456 694 Updated Oct 4, 2025

softwaremill / elasticmq

In-memory message queue with an Amazon SQS-compatible interface. Runs stand-alone or embedded.

Scala 2,758 203 Updated Nov 2, 2025

twitter / util

Wonderful reusable code from Twitter

Scala 2,722 577 Updated Oct 15, 2025

spotify / scio

A Scala API for Apache Beam and Google Cloud Dataflow.

Scala 2,611 527 Updated Oct 28, 2025

http4s / http4s

A minimal, idiomatic Scala interface for HTTP

Scala 2,599 807 Updated Oct 30, 2025

zio / zio-quill

Compile-time Language Integrated Queries for Scala

Scala 2,167 349 Updated Nov 6, 2025

finagle / finch

Scala combinator library for building Finagle HTTP services

Scala 1,605 221 Updated Sep 14, 2025

ThoughtWorksInc / Binding.scala

Reactive data-binding for Scala

Scala 1,584 102 Updated Nov 2, 2025

tumblr / colossus

I/O and Microservice library for Scala

Scala 1,134 97 Updated Aug 14, 2021

mpeltonen / sbt-idea

A simple-build-tool (sbt) plugin/processor for creating IntelliJ IDEA project files

Scala 1,070 148 Updated Dec 27, 2017

databricks / spark-csv

CSV Data Source for Apache Spark 1.x

Scala 1,057 441 Updated Dec 13, 2018

databricks / tensorframes

[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark

Scala 747 161 Updated Jul 30, 2024

skinny-framework / skinny-framework

🚝 "Scala on Rails" - A full-stack web app framework for rapid development in Scala

Scala 737 68 Updated Dec 14, 2022

scallop / scallop

a simple Scala CLI parsing library

Scala 680 58 Updated Aug 19, 2025

wvlet / airframe

Essential Building Blocks for Scala

Scala 659 71 Updated Nov 5, 2025

databricks / spark-redshift

Redshift data source for Apache Spark

Scala 608 349 Updated Aug 10, 2023

spotify / featran

A Scala feature transformation library for data science and machine learning

Scala 469 69 Updated Feb 7, 2025

lihaoyi / autowire

Macros for simple/safe RPCs between Scala applications, including ScalaJS/ScalaJVM

Scala 380 48 Updated Jan 4, 2022

aws / sagemaker-spark

A Spark library for Amazon SageMaker.

Scala 300 130 Updated Mar 8, 2025

dlwh / puck

Puck is a lightning-fast parser for natural languages using GPUs

Scala 248 29 Updated Nov 1, 2014

FaKod / neo4j-scala

Scala wrapper for Neo4j Graph Database

Scala 220 71 Updated Jun 1, 2017

SandroGrzicic / ScalaBuff

the scala protocol buffers (protobuf) compiler

Scala 219 76 Updated Aug 18, 2017

pablosmedina / ckite

CKite - A JVM implementation of the Raft distributed consensus algorithm written in Scala

Scala 214 26 Updated Jan 8, 2019

debasishg / scala-redis-nb

Implementation of a non blocking Redis client in Scala using Akka IO

Scala 201 38 Updated Jun 3, 2018

intuit / superglue

Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs and reports.

Scala 159 38 Updated Dec 10, 2022

Vue.js

X (Twitter)

Scala

Python

Publishing

Parsing

Natural language processing

Machine learning

Jupyter Notebook

Java

See all starred topics