Geo-spatial data analysis on data collected from a taxi-cab company.
-
Updated
Sep 2, 2021 - Scala
Geo-spatial data analysis on data collected from a taxi-cab company.
Set of tasks solved in Big Data Algorithms course
YouTube video analysis based on datasets on Kaggle
Bus Delays Analysis is a big data analytics project designed to do ETL and analyze bus delays using Scala, Apache Spark, and HDFS.
Spark Bootstrapping analysis. This project involved doing computational analysis one loads of data in specific industries and getting insights on how long a candidate is likely to stay in a certain industry over time.
Using Scala for big data computations for basic tasks
Turing Data Engineering Challenge
GameTuner BigQuery Loader is application that loads enriched event to BigQuery
GameTuner Scala Stream Collector is project for collecting raw events from tracker
Implementation of simple Bloom Filter
SparkSQL analysis on groceries and medication prices from Wal-mart and Competitors to deduce empirical fact on best cost-effective grocery store
I used big data tools (Hive, SparkRDDs, and Spark SQL). I solved challenging big data processing tasks by finding highly efficient solutions. Experienced processing four different types of real data: Standard multi-attribute data (video game sales data), Time series data (Twitter feed), Bag of words data, A News aggregation corpus.
SANSA RDF Library
Build a large data-intensive application using real-world data to show interactive visualizations of the evolution of temperatures over time all over the world.
Performance of Aircraft in the US from 1987 to 2008.
The U.S. Department of Transportation's (DOT) Bureau of Transportation Statistics tracks the on-time performance of domestic flights operated by large air carriers. Summary information on the number of on-time, delayed, canceled, and diverted flights is published in DOT's monthly Air Travel Consumer Report and in this dataset of 2015 flight dela…
(Semester 4) Big Data Analytics - End Semester Project
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."