- Paris / France / Europe
Stars
Apache Spark - A unified analytics engine for large-scale data processing
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache OpenWhisk is an open source serverless cloud platform
State of the Art Natural Language Processing
REST job server for Apache Spark
Base classes to use when writing tests with Spark
An experimental library for Functional Reactive Programming in Scala
TensorFlow API for the Scala Programming Language
RxScala – Reactive Extensions for Scala – a library for composing asynchronous and event-based programs using observable sequences
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
Stanford CoreNLP wrapper for Apache Spark
Production-ready K-Means clustering for Apache Spark with pluggable Bregman divergences (KL, Itakura-Saito, L1, etc). 6 algorithms, 740 tests, cross-version persistence. Drop-in replacement for MLl…
A simple FRP library and a web UI framework built on it
Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.
A neural network library which trained by Spark RDD instances.