Stars
Apache Spark - A unified analytics engine for large-scale data processing
Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3
PredictionIO, a machine learning server for developers and ML engineers.
Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.
Plugin for sbt to create Eclipse project definitions
Connect Spark to HBase for reading and writing data with ease
Examples for Spark Training in chinahadoop.cn
PredictionIO / template-scala-parallel-universal-recommendation
Forked from pferrel/template-scala-parallel-universal-recommendationPredictiionIO Template for Universal Recommender
PredictionIO E-Commerce Recommendation Engine Template (Scala-based parallelized engine)
DirectKafka examples for Spark Streaming : 1. with checkpointing 2. Custom offset management
Natural Language Processing algorithm including TextClassification, sentiment analysis, TextRank, LDA and so on
Spark SQL External HBase Snapshot Source
gaoyangkuanglong / marathon
Forked from d2iq-archive/marathonDeploy and manage containers (including Docker) on top of Apache Mesos at scale.