apache-spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-spark

Here are 554 public repositories matching this topic...

sainat / SparkEnvInstall

venkataganya / weather_analysis

serine000 / Structured-Streaming-Pyspark-Project

avcaliani / aws-app

emsalcengiz / filtering-process

muhib20 / ETL-Process-with-Apache-Spark

LiliValGo / SmartCity

DrFarouk / word2vec-streaming-topic-clustering

pathak-ashutosh / sentiment-analysis-yelp-reviews

HowardZeng123 / realtime-cdc-pipeline-docker

evanmathew / ETL-University-Course-Extraction-Using-Spark-Snowflake

MulukenSholaye / spark_kafka_streaming_csv

bsachin207 / CloudComputing

MonirZaman / spark_projects

sujeongcha / Scalable-book-recommender-system

amyth-singh / pinterest-data-pipeline

vadgamabansari / aws-spotify-insights-data-pipeline

qu8n / BentoML

Peippo1 / marketing-analytics-pipeline

luismarcoslc / map_reduce_with_spark

Related topics