-
Updated
Dec 21, 2017 - Scala
hadoop
Here are 140 public repositories matching this topic...
This Big Data project consists of obtaining data on vehicle theft in the city of São Paulo and consolidating it in a counting and heat map, in order to show areas with a higher index of this type of crime. All applicable in AWS Resources.
-
Updated
Apr 21, 2023 - Scala
Scope of this project is to calculate Daily Revenue from retail products
-
Updated
May 28, 2020 - Scala
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
-
Updated
May 26, 2017 - Scala
Demoing Spark 2.2 and Elasticsearch Hadoop connector
-
Updated
Jan 11, 2023 - Scala
A skeleton to generate a Spark job project in Scala with local distributed environment for development, example at (https://github.com/s3ni0r/spark-app-example)
-
Updated
Sep 11, 2019 - Scala
Batch ETL data pipeline built on HDP 3.0 to process daily sales and business data to procedure power Bi reports. Automated the pipelines using Airflow.
-
Updated
Dec 29, 2021 - Scala
🌟Spark Ceph Connector: Implementation of Hadoop Filesystem API for Ceph
-
Updated
Aug 25, 2020 - Scala
some codes are created when learning BigData
-
Updated
Jun 29, 2022 - Scala
Optimal distributed data deduplication and supervised learning pipeline using Apache Spark
-
Updated
Aug 19, 2020 - Scala
Semester assignment for ECE NTUA 3189 Advanced Topics in Database Systems
-
Updated
Feb 5, 2023 - Scala
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."