KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apache Spark Streaming, Apache Cassandra, Apache Kafka and Akka f…

Scala 1,183 394 Updated Jan 5, 2017

databricks / spark-csv

CSV Data Source for Apache Spark 1.x

Scala 1,057 441 Updated Dec 13, 2018

bigdatagenomics / adam

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

Scala 1,039 316 Updated Jul 12, 2025

TIBCOSoftware / snappydata

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

Scala 1,036 199 Updated Nov 21, 2022

twosigma / flint

A Time Series Library for Apache Spark

Scala 1,021 183 Updated Jul 3, 2020

h2oai / sparkling-water

Sparkling Water provides H2O functionality inside Spark cluster

Scala 977 360 Updated Nov 5, 2025

apache / incubator-livy

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 932 614 Updated Nov 9, 2025

nscala-time / nscala-time

A new Scala wrapper for Joda Time based on scala-time

Scala 869 78 Updated Nov 4, 2025

LucaCanali / sparkMeasure

This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…

Scala 795 160 Updated Nov 6, 2025

RayRoestenburg / akka-in-action

Accompanying source code for akka in action

Scala 745 418 Updated Aug 19, 2022

marcus-drake / sbt-docker

Create Docker images directly from sbt

Scala 735 111 Updated Dec 12, 2024

sameeragarwal / blinkdb

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

Scala 660 120 Updated Feb 6, 2014

databricks / reference-apps

Spark reference applications

Scala 652 339 Updated Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Miguel Peralvo MiguelPeralvo

Achievements

Achievements

Highlights

Block or report MiguelPeralvo

Stars

apache / spark

akka / akka-core

apache / predictionio

yahoo / CMAK

snowplow / snowplow

fpinscala / fpinscala

JohnSnowLabs / spark-nlp

awslabs / deequ

databricks / Spark-The-Definitive-Guide

spark-jobserver / spark-jobserver

twitter / algebird

apache / cassandra-spark-connector

typelevel / spire

holdenk / spark-testing-base

sryza / aas

combust / mleap

vkostyukov / scalacaster

killrweather / killrweather