big-data-analytics

Here are 58 public repositories matching this topic...

ICT-BDA / EasyML

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.

machine-learning learning-platform big-data-analytics machine-learning-studio machine-learning-platform

Updated Dec 18, 2023
Java

ingef / conquery

Star

Visual, interactive queries against big databases

java big-data big-data-analytics

Updated Dec 15, 2025
Java

Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clusters on Kubernetes. This is the git repository of Eskimo Community Edition.

Updated Sep 14, 2023
Java

braineering / socstream

Star

Real-time visual analytics for soccer matches, leveraging Apache Flink, Apache Kafka and the Elastic stack. Solution to DEBS 2013 Grand Challenge. Coursework in Systems and Architectures for Big Data 2016/2017.

elasticsearch real-time kibana elasticstack soccer apache-flink apache-kafka visual-analytics big-data-analytics data-stream-processing

Updated Jul 25, 2017
Java

klugem / watchdog

Star

Workflow management system for the automated and distributed analysis of large-scale experimental data.

bioinformatics bioinformatics-pipeline rna-seq-analysis workflow-management-system cluster-computing big-data-analytics

Updated Oct 3, 2024
Java

garystafford / dataproc-java-demo

Star

Demonstration of Google Cloud Dataproc for running Spark jobs with Java

java google spark gcp big-data-analytics dataproc

Updated Dec 17, 2018
Java

GMAP / DSPBench

Star

A suite of benchmark applications for distributed data stream processing systems

big-data apache-spark storm data-stream bigdata evaluation stream-processing spark-streaming apache-storm apache-flink experiments big-data-analytics

Updated Aug 17, 2025
Java

jamestiotio / dbsys

Sponsor

Star

SUTD 2021 50.043 Database and Big Data Systems Code Dump

Updated May 17, 2022
Java

parshva45 / Big-Data-Analytics

Star

Understanding Big Data Analytics by using Map Reduce for performing various tasks like Blooms Filter, Frequent Itemset, KMeans, Matrix Multiplication, Finding Maximum Temperature, Finding Word Count, and Analyzing Electricity Consumption

bloom-filter word-cloud map-reduce matrix-multiplication kmeans-clustering hadoop-mapreduce frequent-itemsets big-data-analytics maximum-temperature electricity-consumption-analysis

Updated Mar 8, 2018
Java

Dare-marvel / Big-Data-Analytics--BDA--

Star

💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍

big-data hadoop tableau case-study big-data-analytics mapreduce-java tableau-dashboards walmart-case-study

Updated Feb 5, 2025
Java

yaoguangluo / ChromosomeDNA

Star

《DNA元基催化与肽计算》在进化计算中, 软件函数文件进行 DNA 语义元基索引编码的 PDE 新陈代谢优化方式, 是一种有效的进化方式.

search-engine data-science database prediction dnn plsql dna vision sorting-algorithms shell-script metabolism catalyst word-segmentation big-data-analytics nerotechnology etl-pipeline vpcs-rest dataswap

Updated Oct 1, 2025
Java

mehrotrasan16 / TwitterAnalyser_StormBoi

Star

A lossy counting algorithm implemented to determine the top trending hashtags using the Twitter API to get a continuous stream of tweets.

java twitter-streaming-api apache-storm streaming-algorithms big-data-analytics lossy-counting samples-tweets

Updated Nov 24, 2023
Java

JKhan01 / kafka-spark-stream

Star

The Project and workaround repository to generate a producer stream to kafka cluster, consume and then process it.

big-data apache-spark maven pyspark apache-kafka big-data-analytics

Updated Nov 4, 2021
Java

vvittis / FlinkSampling

Star

Reservoir Sampling for Group-By Queries in Flink Platform. Answering effectively Single Aggregate.

java topic stratum apache-flink sampling reservoir-sampling streaming-data big-data-analytics group-by big-data-processing streaming-tuples

Updated Aug 12, 2023
Java

nikhilsu / Product-review-analysis-Spark-MongoDB

Star

Performing various product review analysis on Amazon dataset using Apache Spark and MongoDB

spark apache-spark mongodb aws-s3 spark-clusters spark-sql big-data-analytics aws-emr-clusters

Updated Oct 17, 2018
Java

braineering / sostream

Star

Real-time social media analytics application that monitors posts and users popularity, leveraging Apache Flink. Research work accepted to the 10th ACM International Conference on Distributed and Event-Based Systems (DEBS 2015).

real-time social-network apache-flink social-network-analysis big-data-analytics data-stream-processing