partitioning

Here are 27 public repositories matching this topic...

mahmoudparsian / data-algorithms-book

MapReduce, Spark, Java, and Scala for Data Algorithms Book

python java machine-learning scala apache-spark distributed-computing design-patterns pyspark mapreduce reducers partitioning hadoop-mapreduce distributed-algorithms mappers data-algorithms apache-hadoop

Updated Oct 14, 2024
Java

RTBHOUSE / kafka-workers

Star

Kafka Workers is a client library which unifies records consuming from Kafka and processing them by user-defined WorkerTasks.

kafka parallel multithreading stream-processing kafka-consumer partitioning backpressure kafka-workers parallel-consumer

Updated Sep 14, 2022
Java

piomin / sample-vertx-kafka-messaging

Sponsor

Star

implementation of partitioning mechanism on Apache Kafka and asynchronous communication between Vert.x microservices

kafka vertx partitioning-algorithms partition apache-kafka partitioning message-broker vertx-kafka

Updated Nov 1, 2025
Java

BBoxDB is a scalable, highly available, and distributed data store for multi-dimensional big data. The software supports operations like multi-dimensional range queries and spatial joins. In addition, data streams are supported.

sstables nosql storage-engine bigdata gis storage-manager spatial-data partitioning nosql-database range-query key-value-database multi-dimensional multidimensional-data data-streams datastream key-value-store spatial-join distributed-storage-manager multi-dimensional-data

Updated Oct 21, 2025
Java

MarcialRosales / rabbitmq-partitioning-with-cloud-stream

Star

spring spring-cloud partitioning spring-cloud-stream rabbitmq-consumer spring-boot-2 spring-cloud-stream-rabbitmq

Updated Sep 11, 2020
Java

data-integrations / dynamic-partitioner

Star

CDAP Plugins for Sinks that allow you to specify a list of fields, and leverage the values as partitions in the dataset.

partitioning cdap cdap-plugin cask-marketplace fileset cdap-dataset

Updated Apr 25, 2019
Java

funbringer / pg10_vs_pathman_ycsb

Star

Test partitioning in PostgreSQL 10 using YCSB

benchmark postgresql partitioning pathman

Updated Sep 20, 2017
Java

officiallysingh / spring-batch-commons

Star

Spring batch common components for partitioned jobs

spring-boot job fault-tolerance scalability partitioning spring-batch spring-batch-jobs

Updated Mar 18, 2025
Java

niftimus / SparkMMS

Star

Custom AEMO MMS Data Model CSV reader for Apache Spark

java spark pyspark mms electricity partitioning aemo datasourcev2

Updated Jun 10, 2024
Java

chrisgleissner / jutil

Star

Java Utilities

java utility csv sql protobuf log jdbc table pretty-print partitioning

Updated Jun 14, 2023
Java

SubhashMurugesan / Project_1-_Loading_Online_Event_Hits_using_Sqoop_to_Hive_via_Shell_Script

Star

In this project i have implemented the hadoop pipeline using sqoop for ingestion,hive for sumaarising and implementing the warehosue logics and MYSQL as an DB for validationa and storage.The entire thing was automated using the script and with help of bash commands we made it each and every incident is logged properly

sql hive hadoop reconciliation bigdata partitioning sqoop bucketing