Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Stars
PredictionIO, a machine learning server for developers and ML engineers.
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
Zero-cost, compile-time, type-safe dependency injection library.
GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs
Essential Spark extensions and helper methods ✨😲
A tool for catching binary incompatibility in Scala
Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
An open-source toolkit for large-scale genomic analysis
A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform infere…
Coordinated (etcd, ...) cluster construction for dynamic (cloud, containers) environments
Provides GPU awareness to Spark, Contact: @kmadhugit and @kiszk
RTree2D is a 2D immutable R-tree for ultra-fast nearest and intersection queries in plane and spherical coordinates
Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics syst…
LocationTech SFCurve is a Scala library for the creation, transformation, and querying of space-filling curves
A framework for writing Spark 2.x applications in a pretty way
A framework for Spatio-Temporal Data Analytics on Spark
Project template for Play Framework 2.x demonstrating subprojects (useful for SaaS setup)
Akka Streams example of how to interleave Sources with priorities