Skip to content
View g8gg's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report g8gg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

103 stars written in Java
Clear filter

Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

Java 20,703 6,913 Updated Mar 30, 2026

🚌 The IK Analysis plugin integrates Lucene IK analyzer into Elasticsearch and OpenSearch, support customized dictionary.

Java 17,433 3,285 Updated Mar 20, 2026

DataX是阿里云DataWorks数据集成的开源版本。

Java 17,149 5,661 Updated Jul 1, 2025

QuestDB is a high performance, open-source, time-series database

Java 16,800 1,555 Updated Mar 30, 2026

Zuul is a gateway service that provides dynamic routing, monitoring, resiliency, security, and more.

Java 14,001 2,432 Updated Mar 26, 2026

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,671 3,548 Updated Mar 30, 2026

🔎 Open source distributed and RESTful search engine.

Java 12,665 2,488 Updated Mar 30, 2026

Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.

Java 12,562 2,899 Updated Mar 30, 2026

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 11,791 2,132 Updated Mar 28, 2026

A scalable web crawler framework for Java.

Java 11,698 4,150 Updated Dec 20, 2025

A simple expressive web framework for java. Spark has a kotlin DSL https://github.com/perwendel/spark-kotlin

Java 9,664 1,568 Updated Oct 8, 2023

Flyway by Redgate • Database Migrations Made Easy.

Java 9,630 1,605 Updated Mar 26, 2026

Pentaho Data Integration ( ETL ) a.k.a Kettle

Java 8,323 3,583 Updated Mar 30, 2026

Distributed scheduled job

Java 8,221 3,258 Updated Mar 10, 2026

Use SQL to query Elasticsearch

Java 7,021 1,533 Updated Feb 21, 2026

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

Java 6,540 2,289 Updated Nov 19, 2023

Gephi - The Open Graph Viz Platform

Java 6,416 1,600 Updated Mar 30, 2026

Flink CDC is a streaming data integration tool

Java 6,383 2,134 Updated Mar 30, 2026

Drools is a rule engine, DMN engine and complex event processing (CEP) engine for Java

Java 6,233 2,579 Updated Mar 30, 2026

JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern

Java 5,935 1,418 Updated Mar 28, 2026

JanusGraph: an open-source, distributed graph database

Java 5,749 1,207 Updated Nov 21, 2025

An xposed module that disables SSL certificate checking for the purposes of auditing an app with cert pinning

Java 5,292 826 Updated Sep 2, 2024

Distributed Graph Database

Java 5,236 999 Updated Oct 19, 2022

A scalable, distributed Time Series Database.

Java 5,064 1,239 Updated Dec 12, 2024

A data integration framework

Java 4,104 1,693 Updated Dec 2, 2025

Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance

Java 3,855 475 Updated Mar 29, 2026

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

Java 3,661 917 Updated Mar 30, 2026

Open, Multi-modal Catalog for Data & AI

Java 3,341 597 Updated Mar 28, 2026

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualiz…

Java 3,252 1,037 Updated Nov 4, 2025

Database Subsetting and Relational Data Browsing Tool.

Java 3,154 142 Updated Mar 18, 2026
Next