hadoop
Here are 1,131 public repositories matching this topic...
-
Updated
Feb 2, 2017 - Java
An application using Hadoop to analyze Twitter user, sorting them by page rank algorithm. Data is quite big (roughly 24 GB) so it is convinient to process it concurently in many nodes.
-
Updated
Dec 3, 2016 - Java
-
Updated
Jul 22, 2023 - Java
Working on Apache Hadoop's MapReduce and Apache Spark
-
Updated
Dec 5, 2019 - Java
Bandwidth measurements for uploading and downloading files with different sizes.
-
Updated
Jun 30, 2020 - Java
A model classifying word pairs by their semantic similarity, using AWS, Hadoop and WEKA
-
Updated
May 13, 2021 - Java
A toy search engine for the Wikipedia Corpus, utilizing inverted index, built with Hadoop MapReduce
-
Updated
Dec 17, 2021 - Java
This repository is a hands-on exploration of Hadoop architecture, MapReduce programming model, and data processing with both Java and Python. It uses a Docker-based Hadoop cluster (with YARN, HDFS, and MapReduce) so you can quickly run, test, and extend exercises without manual setup.
-
Updated
Aug 15, 2025 - Java
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."