Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
-
Updated
Nov 17, 2025 - Python
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Text counter application in Flask
News trend analysis using Hadoop in a virtualized CentOS environment
This repository contains solutions to common mapper and reducer problems in Hadoop using Python
MapReduce is a programming model and an associated implementation for processing and generating large data sets. Users specify a map function that processes a key/value pair to generate a set of intermediate key/value pairs, and a reduce function that merges all intermediate values associated with the same intermediate key. Many real-world tasks…
A framework to run map reduce program. Implemented based on map reduce paper
This is the simple Django introduction project. how it works and the simple features of Django in the backend.
A Python word counter module to quickly count number of words in a sentence
A list of some python programs related to the topics such as lists, string, file, fibonnaci series, prime numbers , even odd numbers , divisors of a number and many more.
Big Data Processing with Hadoop
A simple tool to perform a basic topic analysis on Facebook-post comments
Add a description, image, and links to the wordcount topic page so that developers can more easily learn about it.
To associate your repository with the wordcount topic, visit your repo's landing page and select "manage topics."