hadoop
Here are 3,661 public repositories matching this topic...
Apache Spark - From installation to performing awesome operations in Apache Spark Stack
-
Updated
May 8, 2017 - Python
-
Updated
Apr 15, 2017 - Python
CAB: power-Capping aware resource manager for Approximate Big data processing
-
Updated
Mar 12, 2018 - Java
Mirror of Apache SystemML (Incubating)
-
Updated
May 11, 2018 - Java
● Performed sequential and parallel analysis on the Wikipedia page-view logs to analyze page-view trends and derive the total average page views per day, top trending topics etc
-
Updated
Mar 7, 2018 - Java
Using MapReduce to calculate Wikipedia page rank; preventing dead-ends and spider-traps
-
Updated
Aug 29, 2017 - Java
Big Data & Cloud Computing project for recommendation, cluster analysis, data visualization with Hadoop and Spark deployed in auto- scaling cloud environment, youtube link:
-
Updated
Jan 5, 2018 - TypeScript
Big Data Analysis on NYC-Subway Data using Hadoop MapReduce Technique
-
Updated
Jun 16, 2018 - Jupyter Notebook
Automated setup of a HDFS, YARN and Metastore in single cluster mode with a single command
-
Updated
Apr 27, 2020 - Kotlin
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."