Skip to content
View ssyue's full-sized avatar

Block or report ssyue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

All Algorithms implemented in Python

Python 215,089 49,676 Updated Dec 13, 2025

Apache Fluss is a streaming storage built for real-time analytics.

Java 1,687 454 Updated Dec 24, 2025

Python SQL Parser and Transpiler

Python 8,737 1,033 Updated Dec 24, 2025

DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。

Java 5,955 2,244 Updated Jun 2, 2024

Apache HBase Operator Tools

Java 183 149 Updated Dec 6, 2025

Java library for inferring JSON schema from sample JSONs

Java 189 40 Updated Nov 26, 2025

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

Shell 1,704 386 Updated Aug 20, 2025

NO LONGER ACTIVE; please use the new official bzip2 repository at https://gitlab.com/federicomenaquintero/bzip2. This was an unofficial mirror of bzip2, including the historical releases I could find.

C 5 5 Updated Apr 4, 2015

The Metadata Platform for your Data and AI Stack

Java 11,348 3,312 Updated Dec 24, 2025

Apache Hadoop docker image

Shell 2,309 1,404 Updated Feb 1, 2024

The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.

Java 410 187 Updated Nov 20, 2025

ClickHouse Native Protocol JDBC implementation

Java 541 151 Updated Jun 22, 2025

汇总Apache Iceberg相关的最新文章、资料以及Demo等

32 11 Updated Jul 15, 2021

汇总Apache Hudi相关资料

561 160 Updated Dec 11, 2025

Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover, seamlessly and without downtime.

Scala 516 226 Updated Jan 13, 2020

Upserts, Deletes And Incremental Processing on Big Data.

Java 6,048 2,454 Updated Dec 24, 2025

Apache Iceberg

Java 8,356 2,936 Updated Dec 24, 2025

Flink sink for Clickhouse

Java 384 128 Updated Dec 5, 2023

Components for building stream loaders from Kafka to arbitrary storages

Scala 37 8 Updated Nov 4, 2025

Easily load data from kafka to ClickHouse

Go 533 118 Updated Dec 24, 2025

ClickHouse® is a real-time analytics database management system

C++ 44,822 7,923 Updated Dec 24, 2025

A data generator source connector for Flink SQL based on data-faker.

Java 231 60 Updated Jul 24, 2023

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platfor…

Dockerfile 911 208 Updated Nov 7, 2022

IntelliJ IDEA 简体中文专题教程

22,161 7,286 Updated Sep 12, 2025

🔥 人人可用的开源 BI 工具,数据可视化神器。An open-source BI tool alternative to Tableau.

Java 22,931 3,943 Updated Dec 24, 2025

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 20,322 4,974 Updated Dec 24, 2025

🔥 经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等

18,177 2,555 Updated Dec 8, 2025

Apache Pulsar - distributed pub-sub messaging system

Java 15,018 3,696 Updated Dec 24, 2025

A curated list of awesome big data frameworks, ressources and other awesomeness.

14,112 2,590 Updated Nov 27, 2025
Next