apache-spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-spark

Here are 2,126 public repositories matching this topic...

sainat / SparkEnvInstall

manojmallela / fun-spark

s5745623 / Wiki_Search_Engine

adityamuralidaran / ParallelDistributedProcessing

vindeolal / spark-ignite-example

hcvazquez / ht-engineering

gyleodhis / apacheSpark-Recommender-System

manoj2411 / spark-playground

Teanlouise / shared-world-data

helioribeiro / helioribeiro.github.io

iamirmasoud / pyspark_basics

venkataganya / weather_analysis

avcaliani / aws-app

emsalcengiz / filtering-process

borgettas / apache-spark-docker

muhib20 / ETL-Process-with-Apache-Spark

mervat-khaled / ETL-Apache-Spark-NYC-Taxi-Data

HalaKhalifa / twitter-stream-pipeline

xavierruth / DesafioFinal_Residencia_PortoDigital_VacinaCovid

DrFarouk / word2vec-streaming-topic-clustering

Related topics