-
Updated
Dec 21, 2017 - Scala
hadoop
Here are 140 public repositories matching this topic...
Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.
-
Updated
May 26, 2017 - Scala
Demoing Spark 2.2 and Elasticsearch Hadoop connector
-
Updated
Jan 11, 2023 - Scala
This Big Data project consists of obtaining data on vehicle theft in the city of São Paulo and consolidating it in a counting and heat map, in order to show areas with a higher index of this type of crime. All applicable in AWS Resources.
-
Updated
Apr 21, 2023 - Scala
Scope of this project is to calculate Daily Revenue from retail products
-
Updated
May 28, 2020 - Scala
Update PubMed articles daily on HDFS by using Spark Cluster
-
Updated
Nov 18, 2022 - Scala
A skeleton to generate a Spark job project in Scala with local distributed environment for development, example at (https://github.com/s3ni0r/spark-app-example)
-
Updated
Sep 11, 2019 - Scala
Batch ETL data pipeline built on HDP 3.0 to process daily sales and business data to procedure power Bi reports. Automated the pipelines using Airflow.
-
Updated
Dec 29, 2021 - Scala
In this project, we are going to build a Bicycle sharing demand prediction service using Apache Spark and Scala. I have created a two spark application one for model generation and another for model demand prediction.
-
Updated
Jan 8, 2021 - Scala
🌟Spark Ceph Connector: Implementation of Hadoop Filesystem API for Ceph
-
Updated
Aug 25, 2020 - Scala
some codes are created when learning BigData
-
Updated
Jun 29, 2022 - Scala
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."