MapReduce, Spark, Java, and Scala for Data Algorithms Book
-
Updated
Oct 14, 2024 - Java
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Kafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.
implementation of partitioning mechanism on Apache Kafka and asynchronous communication between Vert.x microservices
BBoxDB is a scalable, highly available, and distributed data store for multi-dimensional big data. The software supports operations like multi-dimensional range queries and spatial joins. In addition, data streams are supported.
CDAP Plugins for Sinks that allow you to specify a list of fields, and leverage the values as partitions in the dataset.
Test partitioning in PostgreSQL 10 using YCSB
Spring batch common components for partitioned jobs
Custom AEMO MMS Data Model CSV reader for Apache Spark
Java Utilities
In this project i have implemented the hadoop pipeline using sqoop for ingestion,hive for sumaarising and implementing the warehosue logics and MYSQL as an DB for validationa and storage.The entire thing was automated using the script and with help of bash commands we made it each and every incident is logged properly
Spring batch job as Spring cloud task
Spring batch job as Spring Rest service
A tiny embedded Java-engine for extremely fast partitioned immutable-after-construction databases
A partitioning algorithm for OWL
Implemented a Kafka-design distributed system by designing publisher, consumer and broker.
Command line tool to extract partitions and files from Atari disk images.
Simulation for memory management algorithms (First-fit, Best-fit, Worst-Fit)
Add a description, image, and links to the partitioning topic page so that developers can more easily learn about it.
To associate your repository with the partitioning topic, visit your repo's landing page and select "manage topics."