Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
-
Updated
Dec 18, 2023 - Java
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
Visual, interactive queries against big databases
Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.
Real-time visual analytics for soccer matches, leveraging Apache Flink, Apache Kafka and the Elastic stack. Solution to DEBS 2013 Grand Challenge. Coursework in Systems and Architectures for Big Data 2016/2017.
Workflow management system for the automated and distributed analysis of large-scale experimental data.
A suite of benchmark applications for distributed data stream processing systems
SUTD 2021 50.043 Database and Big Data Systems Code Dump
Understanding Big Data Analytics by using Map Reduce for performing various tasks like Blooms Filter, Frequent Itemset, KMeans, Matrix Multiplication, Finding Maximum Temperature, Finding Word Count, and Analyzing Electricity Consumption
💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍
《DNA元基催化与肽计算》 在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.
A lossy counting algorithm implemented to determine the top trending hashtags using the Twitter API to get a continuous stream of tweets.
The Project and workaround repository to generate a producer stream to kafka cluster, consume and then process it.
Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.
Performing various product review analysis on Amazon dataset using Apache Spark and MongoDB
Real-time social media analytics application that monitors posts and users popularity, leveraging Apache Flink. Research work accepted to the 10th ACM International Conference on Distributed and Event-Based Systems (DEBS 2015).
Analysis on Amazon Health care Products using Big Data Technologies using HDFS , Map/Reduce , Hive , PIG
Hadoop-MapReduces jobs to analyse and process a large number of tweets for the Rio 2016 Olympics.
Big data analytics using Hadoop on GDELT global news dataset.
Implementing parallel processing techniques for efficient handling of Big Data through practical activities.
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."