apache-spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-spark

Here are 2,126 public repositories matching this topic...

s5745623 / Wiki_Search_Engine

sainat / SparkEnvInstall

adityamuralidaran / ParallelDistributedProcessing

manojmallela / fun-spark

vindeolal / spark-ignite-example

gyleodhis / apacheSpark-Recommender-System

manoj2411 / spark-playground

hcvazquez / ht-engineering

Teanlouise / shared-world-data

helioribeiro / helioribeiro.github.io

venkataganya / weather_analysis

iamirmasoud / pyspark_basics

avcaliani / aws-app

emsalcengiz / filtering-process

borgettas / apache-spark-docker

mervat-khaled / ETL-Apache-Spark-NYC-Taxi-Data

HalaKhalifa / twitter-stream-pipeline

muhib20 / ETL-Process-with-Apache-Spark

xavierruth / DesafioFinal_Residencia_PortoDigital_VacinaCovid

DrFarouk / word2vec-streaming-topic-clustering

Related topics