Skip to content
View g8gg's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report g8gg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

103 stars written in Java
Clear filter

Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

Java 20,703 6,913 Updated Mar 27, 2026

🚌 The IK Analysis plugin integrates Lucene IK analyzer into Elasticsearch and OpenSearch, support customized dictionary.

Java 17,434 3,284 Updated Mar 20, 2026

DataX是阿里云DataWorks数据集成的开源版本。

Java 17,147 5,662 Updated Jul 1, 2025

QuestDB is a high performance, open-source, time-series database

Java 16,789 1,556 Updated Mar 27, 2026

Zuul is a gateway service that provides dynamic routing, monitoring, resiliency, security, and more.

Java 14,000 2,431 Updated Mar 26, 2026

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,665 3,547 Updated Mar 27, 2026

🔎 Open source distributed and RESTful search engine.

Java 12,648 2,481 Updated Mar 27, 2026

Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.

Java 12,551 2,895 Updated Mar 26, 2026

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 11,788 2,132 Updated Mar 24, 2026

A scalable web crawler framework for Java.

Java 11,696 4,151 Updated Dec 20, 2025

A simple expressive web framework for java. Spark has a kotlin DSL https://github.com/perwendel/spark-kotlin

Java 9,665 1,568 Updated Oct 8, 2023

Flyway by Redgate • Database Migrations Made Easy.

Java 9,622 1,605 Updated Mar 26, 2026

Pentaho Data Integration ( ETL ) a.k.a Kettle

Java 8,325 3,584 Updated Mar 27, 2026

Distributed scheduled job

Java 8,221 3,258 Updated Mar 10, 2026

Use SQL to query Elasticsearch

Java 7,023 1,533 Updated Feb 21, 2026

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Java 6,545 2,289 Updated Nov 19, 2023

Gephi - The Open Graph Viz Platform

Java 6,411 1,599 Updated Mar 15, 2026

Flink CDC is a streaming data integration tool

Java 6,385 2,133 Updated Mar 27, 2026

Drools is a rule engine, DMN engine and complex event processing (CEP) engine for Java

Java 6,231 2,579 Updated Mar 26, 2026

JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern

Java 5,936 1,418 Updated Mar 25, 2026

JanusGraph: an open-source, distributed graph database

Java 5,748 1,207 Updated Nov 21, 2025

An xposed module that disables SSL certificate checking for the purposes of auditing an app with cert pinning

Java 5,289 827 Updated Sep 2, 2024

Distributed Graph Database

Java 5,237 999 Updated Oct 19, 2022

A scalable, distributed Time Series Database.

Java 5,064 1,239 Updated Dec 12, 2024

A data integration framework

Java 4,105 1,693 Updated Dec 2, 2025

Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance

Java 3,854 476 Updated Mar 27, 2026

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Java 3,658 917 Updated Mar 26, 2026

Open, Multi-modal Catalog for Data & AI

Java 3,341 597 Updated Mar 27, 2026

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…

Java 3,251 1,036 Updated Nov 4, 2025

Database Subsetting and Relational Data Browsing Tool.

Java 3,153 142 Updated Mar 18, 2026
Next