-
primeNumber
- TOKYO
- https://twitter.com/satoshihirose
Starred repositories
Apache Spark - A unified analytics engine for large-scale data processing
A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility
The leader in Customer Data Infrastructure
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Breeze is/was a numerical processing library for Scala.
In-memory message queue with an Amazon SQS-compatible interface. Runs stand-alone or embedded.
A Scala API for Apache Beam and Google Cloud Dataflow.
Compile-time Language Integrated Queries for Scala
Scala combinator library for building Finagle HTTP services
A simple-build-tool (sbt) plugin/processor for creating IntelliJ IDEA project files
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
🚝 "Scala on Rails" - A full-stack web app framework for rapid development in Scala
Redshift data source for Apache Spark
A Scala feature transformation library for data science and machine learning
Macros for simple/safe RPCs between Scala applications, including ScalaJS/ScalaJVM
Puck is a lightning-fast parser for natural languages using GPUs
the scala protocol buffers (protobuf) compiler
CKite - A JVM implementation of the Raft distributed consensus algorithm written in Scala
Implementation of a non blocking Redis client in Scala using Akka IO
Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs and reports.