Gallery of Apache Zeppelin notebooks using Enth-Spark-AI.
-
Updated
Jun 15, 2017
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Gallery of Apache Zeppelin notebooks using Enth-Spark-AI.
An image for running Scala Jupyter notebooks and Apache Spark in the cloud on OpenShift
Big Data Management related Zeppelin notebooks
python rdd notebook in apache spark
A repository for ipython notebook backup
pyspark notebook with movie lens dataset
text analysis ipython notebooks for text analysis
Sample notebooks on Azure Databricks for ETL
Notebooks for Python and Spark for Big Data
Heart disease classification with data mining(Zeppelin Notebook)
Unix scripts & jupyter notebooks for COMP47470 Big Data Programming
Apache Spark cluster connected to a Jupyter Notebook instance
Pyspark and Spark [ My Notes and all practise Notebook ]
Exercise files and notebooks for learning Apache Spark DataFrames and SQL
PySpark word count and OpenCV motion detection implemented in a single Colab notebook.
Juypiter Notebooks to demonstrate the Lambda Architecture with Kafka Streams and Apache Spark
2019 Canadian Federal Election: Calculating the results using Apache Spark (Databricks notebook in Scala)
Simulating a consultancy project for Repsol, the repository contains both the code notebook and the analysis.
A hands-on, progressive learning path for mastering PySpark with 5 modules, Jupyter notebooks, and comprehensive code quality tools
Created by Matei Zaharia
Released May 26, 2014