Skip to content
View tromika's full-sized avatar

Block or report tromika

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
23 stars written in Java
Clear filter

The Metadata Platform for your Data and AI Stack

Java 11,182 3,253 Updated Nov 5, 2025

Apache Cassandra®

Java 9,469 3,790 Updated Nov 5, 2025

Flyway by Redgate • Database Migrations Made Easy.

Java 9,238 1,576 Updated Oct 23, 2025

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Java 6,575 2,832 Updated Nov 3, 2025

Apache Pinot - A realtime distributed OLAP datastore

Java 5,938 1,425 Updated Nov 5, 2025

Apache Hive

Java 5,861 4,775 Updated Nov 5, 2025

Maxwell's daemon, a mysql-to-json kafka producer

Java 4,194 1,028 Updated Oct 27, 2025

Source-agnostic distributed change data capture system

Java 3,670 740 Updated Sep 28, 2023

Please visit https://github.com/h2oai/h2o-3 for latest H2O

Java 2,258 556 Updated Oct 24, 2024

Secor is a service implementing Kafka log persistence

Java 1,853 536 Updated Oct 20, 2025

ZooKeeper co-process for instance monitoring, backup/recovery, cleanup and visualization.

Java 1,677 445 Updated Sep 12, 2019

Wasabi A/B Testing service is an open source project that is no longer under active development or being supported

Java 1,137 236 Updated May 26, 2023

📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)

Java 796 102 Updated Nov 13, 2021

Mirror of Apache Pig

Java 686 447 Updated Sep 15, 2025

Realtime analytics, this includes the core components of Pulsar pipeline.

Java 652 124 Updated Nov 6, 2015

High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper. No Data-loss. No dependency on HDFS and WAL. In-built PID r…

Java 633 317 Updated Feb 26, 2022

Library and tools for advanced feature engineering

Java 568 109 Updated Dec 16, 2020

This code base is retained for historical interest only, please visit Apache Incubator Repo for latest one

Java 561 225 Updated Oct 5, 2022

Powered by Spark Streaming & Siddhi

Java 317 84 Updated Feb 11, 2020

[DEPRECATED] This project is deprecated. It will be archived on December 1, 2017.

Java 147 52 Updated Sep 20, 2016

Pig on Apache Spark

Java 82 26 Updated Mar 23, 2015

Apache Hadoop HDFS Data Node Scheduler

Java 13 7 Updated Jun 4, 2016

Data Science Subject repository

Java 1 2 Updated Dec 2, 2013