Template for Spark Data Science Projects
-
Updated
Oct 21, 2017 - Makefile
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Template for Spark Data Science Projects
An image for running Scala Jupyter notebooks and Apache Spark in the cloud on OpenShift
Personal notes and lab solutions for the Data Engineer Handbook Bootcamp
Created by Matei Zaharia
Released May 26, 2014