Skip to content
View jimdowling's full-sized avatar

Block or report jimdowling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
37 stars written in Java
Clear filter

🔎 Open source distributed and RESTful search engine.

Java 12,683 2,496 Updated Apr 2, 2026

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 11,536 2,382 Updated Apr 2, 2026

Apache Iceberg

Java 8,687 3,121 Updated Apr 2, 2026

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Java 6,612 2,819 Updated Apr 2, 2026

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,128 2,473 Updated Apr 2, 2026

Apache Kafka® running on Kubernetes

Java 5,758 1,472 Updated Apr 2, 2026

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

Java 3,226 415 Updated Apr 1, 2026

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …

Java 3,011 646 Updated Nov 6, 2025

An Open Standard for lineage metadata collection

Java 2,385 446 Updated Apr 2, 2026

Official code repository for GATK versions 4 and up

Java 1,927 624 Updated Apr 2, 2026

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java 1,889 415 Updated Apr 2, 2026

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,370 845 Updated Aug 22, 2023

Hopsworks - Data-Intensive AI platform with a Feature Store

Java 1,290 156 Updated Feb 10, 2025

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 1,173 199 Updated Apr 1, 2026

Payara Server is an open source middleware platform that supports reliable and secure deployments of Java EE (Jakarta EE) and MicroProfile applications in any environment: on premise, in the cloud …

Java 912 318 Updated Apr 2, 2026

Truly open source API gateway with native OpenAPI support. Written in Java, it is easily extensible, supports legacy XML and SOAP, and is optimized for container deployments.

Java 578 159 Updated Apr 2, 2026

A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.

Java 577 43 Updated Feb 3, 2023

Generic Data Ingestion & Dispersal Library for Hadoop

Java 482 111 Updated Mar 19, 2023

Uniffle is a high performance, general purpose Remote Shuffle Service.

Java 447 170 Updated Apr 2, 2026

Remote shuffle service for Apache Spark to store shuffle data on remote servers.

Java 335 100 Updated Sep 29, 2023

Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.

Java 322 80 Updated Jan 22, 2026

Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs

Java 302 59 Updated Mar 23, 2025

🆕 Find the k-nearest neighbors (k-NN) for your vector data

Java 212 187 Updated Apr 2, 2026

Rapid is a scalable distributed membership service

Java 137 20 Updated Jul 5, 2023

A tool for scale and performance testing of HDFS with a specific focus on the NameNode.

Java 135 33 Updated Jan 11, 2024

HopsWorks - Hadoop for Humans

Java 117 26 Updated Apr 25, 2019

Kompics - A message-passing component model for building distributed systems

Java 66 14 Updated Oct 4, 2022

Reproducing Distributed Systems and Experiments on Cloud

Java 40 21 Updated Sep 11, 2023
Next