code snippets to write Apache Spark applications using Java
-
Updated
Oct 15, 2015 - Java
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
code snippets to write Apache Spark applications using Java
Investigating the trade-offs of low latency responses over quality when applying machine learning algorithms over lambda architecture.
Apache Spark Basics - Java Examples
Apache Spark Streaming - Java Examples
Apache Spark Machine Learning - Java Examples
Natural Language Processing - Java Example
Implementing an extensible lambda architecture
A library having Java and Scala examples for Spark 2.x
Java based Convolutional Neural Network package running on Apache Spark framework
Java based Convolutional Neural Network package running on Apache Spark framework
Provides a scaffold to easily build a cluster to query the data from ESA's Gaia satellite. Gaia is an ambitious mission to chart a three-dimensional map of our Galaxy, the Milky Way. Gaia will provide unprecedented positional and radial velocity measurements with the accuracies needed to produce a stereoscopic and kinematic census of about one b…
Real-time Log Analyzer using Apache Flume, Spark Streaming and HDFS written in Scala and Java
Sorting 1TB of data using Hadoop Map Reduce, Apache Spark and custom Java solution
Mirror of Apache Beam
This is an activator project for showcasing integration of Kafka 0.10 with Spark Streaming.
Created by Matei Zaharia
Released May 26, 2014