Django app that collects data from parquet files, provides the data in json format, and then consumes and returns the data.
-
Updated
Jun 15, 2021 - HTML
Django app that collects data from parquet files, provides the data in json format, and then consumes and returns the data.
INTEGRATING MULTIPLE TECHNOLOGIES INTO ONE PLACE.
Automation Scripts | Ansible
My personal website
This repo contains tools to set up and run a Hadoop Cluster. Initially for AWS EC2, but contains also a local version set up (home with 3 Ubuntu computers)
This repository contains the assignments and project work done in the course -Engineering-of-Big-Data-systems-INFO7350
This java EE project uses K-Means algorithm in Hadoop to predict an individual's income
Hadoop component installation involves setting up HDFS for storage, YARN for resource management, and MapReduce for processing, creating a scalable big data platform.
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."