Skip to content
View tromika's full-sized avatar

Block or report tromika

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
27 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,234 28,919 Updated Nov 6, 2025

PredictionIO, a machine learning server for developers and ML engineers.

Scala 12,530 1,920 Updated Jan 9, 2021

The leader in Customer Data Infrastructure

Scala 6,964 1,190 Updated Jun 4, 2025

A machine learning package built for humans.

Scala 4,801 565 Updated Nov 6, 2025

Breeze is/was a numerical processing library for Scala.

Scala 3,456 694 Updated Oct 4, 2025

Abstract Algebra for Scala

Scala 2,299 347 Updated Aug 21, 2025

Apache Spark to Apache Cassandra connector

Scala 1,947 929 Updated Apr 29, 2025

Distributed Prometheus time series database

Scala 1,457 237 Updated Nov 6, 2025

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

Scala 1,036 199 Updated Nov 21, 2022

Sparkling Water provides H2O functionality inside Spark cluster

Scala 977 360 Updated Nov 5, 2025

A connector for Spark that allows reading and writing to/from Redis cluster

Scala 944 368 Updated Oct 22, 2024

Mirror of Apache Toree (Incubating)

Scala 748 228 Updated Oct 21, 2025

Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apa…

Scala 613 117 Updated Jan 8, 2020

Avro Data Source for Apache Spark

Scala 539 306 Updated Dec 19, 2018

Real Time Analytics and Data Pipelines based on Spark Streaming

Scala 528 196 Updated Oct 24, 2019

Simplifying robust end-to-end machine learning on Apache Spark.

Scala 474 116 Updated Apr 18, 2017

MySQL binary log consumer with the ability to act on changed rows and publish changes to different systems with emphasis on Apache Kafka.

Scala 428 81 Updated Feb 22, 2023

Apache Kafka on Apache Mesos

Scala 413 140 Updated May 3, 2018

Spark library for easy MongoDB access

Scala 307 95 Updated Aug 30, 2016

Coral is a real-time analytics and data science platform. It transforms streaming events and extract patterns from data via RESTful APIs. Built on Scala, Akka, Cassandra and Spray.

Scala 147 22 Updated Sep 5, 2019

An example of using Avro and Parquet in Spark SQL

Scala 60 27 Updated Nov 16, 2015

Enabling Spark Optimization through Cross-stack Monitoring and Visualization

Scala 47 11 Updated Aug 23, 2017

Apache Spark AWS Lambda Executor (SAMBA)

Scala 44 18 Updated Jul 3, 2018

Apache Spark OpenCPU Executor (ROSE)

Scala 26 23 Updated Jun 16, 2018

Power BI API adapter for Apache Spark (deprecated)

Scala 26 10 Updated Nov 15, 2017

This project contains the code to translate between Apache Spark and SFrame.

Scala 20 20 Updated Jul 13, 2016
Scala 19 7 Updated Jul 11, 2023