-
Updated
Dec 21, 2017 - Scala
hadoop
Here are 139 public repositories matching this topic...
STM data enrichment, Extract, Transform, Load (e.g., ETL)
-
Updated
Jun 11, 2021 - Scala
Import and process HDF5 files on Spark with Hadoop
-
Updated
Jun 20, 2022 - Scala
Distributed computational problem-solving project, which aims to perform large-scale graph matching using cloud computing technologies. The project allows users to import two directed graphs and analyze the differences between them.
-
Updated
Oct 25, 2023 - Scala
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
-
Updated
May 26, 2017 - Scala
Update PubMed articles daily on HDFS by using Spark Cluster
-
Updated
Nov 18, 2022 - Scala
Data Algorithms for Apache Spark RDD and Hadoop Mapreduce in Scala and Java
-
Updated
Aug 25, 2022 - Scala
This project is a data processing application built with Apache Spark and Scala. This is designed to efficiently process, analyze and transform large datasets related to people data. It leverages Spark’s distributed computing capabilities to handle scalable data ingestion, cleaning and reporting. Shell scripts are included for hadoop deployment.
-
Updated
Jun 11, 2025 - Scala
Scope of this project is to calculate Daily Revenue from retail products
-
Updated
May 28, 2020 - Scala
This Big Data project consists of obtaining data on vehicle theft in the city of São Paulo and consolidating it in a counting and heat map, in order to show areas with a higher index of this type of crime. All applicable in AWS Resources.
-
Updated
Apr 21, 2023 - Scala
Demoing Spark 2.2 and Elasticsearch Hadoop connector
-
Updated
Jan 11, 2023 - Scala
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."