- UK or HU
- http://tamas.szuromi.me
- @tamas__szuromi
Stars
Apache Spark - A unified analytics engine for large-scale data processing
PredictionIO, a machine learning server for developers and ML engineers.
The leader in Customer Data Infrastructure
Breeze is/was a numerical processing library for Scala.
Apache Spark to Apache Cassandra connector
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Sparkling Water provides H2O functionality inside Spark cluster
A connector for Spark that allows reading and writing to/from Redis cluster
apache-spark-on-k8s / spark
Forked from apache/sparkApache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the kubernetes scheduler back-end is now on https://github.com/apa…
Real Time Analytics and Data Pipelines based on Spark Streaming
Simplifying robust end-to-end machine learning on Apache Spark.
MySQL binary log consumer with the ability to act on changed rows and publish changes to different systems with emphasis on Apache Kafka.
Coral is a real-time analytics and data science platform. It transforms streaming events and extract patterns from data via RESTful APIs. Built on Scala, Akka, Cassandra and Spray.
An example of using Avro and Parquet in Spark SQL
Enabling Spark Optimization through Cross-stack Monitoring and Visualization
Apache Spark AWS Lambda Executor (SAMBA)
Apache Spark OpenCPU Executor (ROSE)
Power BI API adapter for Apache Spark (deprecated)
This project contains the code to translate between Apache Spark and SFrame.