Skip to content
View wuchong's full-sized avatar

Organizations

@apache @alibaba @hexojs @flink-china

Block or report wuchong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 20,277 4,969 Updated Dec 18, 2025

An orchestration platform for the development, production, and observation of data assets.

Python 14,613 1,910 Updated Dec 18, 2025

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,468 1,971 Updated Dec 18, 2025

World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams. The GitHub/GitLab for database DevSecOps.

Go 13,447 882 Updated Dec 18, 2025

A platform for community discussion. Free, open, simple.

Ruby 45,796 8,747 Updated Dec 18, 2025

The Swift Programming Language

C++ 69,453 10,620 Updated Dec 18, 2025

FoundationDB - the open source, distributed, transactional key-value store

C++ 16,015 1,456 Updated Dec 17, 2025

A composable and fully extensible C++ execution engine library for data management systems.

C++ 3,987 1,417 Updated Dec 17, 2025

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

Rust 8,238 671 Updated Dec 17, 2025

Confluent Schema Registry for Kafka

Java 2,388 1,153 Updated Dec 17, 2025

A cloud native embedded storage engine built on object storage.

Rust 2,546 166 Updated Dec 17, 2025

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 5,834 500 Updated Dec 17, 2025

Main Portal page for the Jackson project

9,612 1,212 Updated Dec 17, 2025

🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop

Java 2,017 999 Updated Dec 17, 2025

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …

Java 11,101 2,235 Updated Dec 17, 2025

Apache Flink

Java 25,614 13,804 Updated Dec 17, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 45,862 6,644 Updated Dec 17, 2025

A Java serialization/deserialization library to convert Java Objects into JSON and back

Java 24,275 4,365 Updated Dec 17, 2025

A modern, lambda-friendly, 120 character Java formatter.

Java 712 73 Updated Dec 17, 2025

Some notes on things I find interesting and important.

JavaScript 2,094 180 Updated Dec 17, 2025

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java 1,771 343 Updated Dec 17, 2025

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 12,285 3,411 Updated Dec 17, 2025

Restate is the platform for building resilient applications that tolerate all infrastructure faults w/o the need for a PhD.

Rust 3,276 114 Updated Dec 17, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,116 1,236 Updated Dec 17, 2025

A full-featured license tool to check and fix license headers and resolve dependencies' licenses.

Go 292 63 Updated Dec 17, 2025

Official Elasticsearch Java Client

Java 510 279 Updated Dec 17, 2025

AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.

Java 8,715 587 Updated Dec 17, 2025

DuckDB is an analytical in-process SQL database management system

C++ 34,818 2,792 Updated Dec 17, 2025

.asf.yaml documentation and schema

Python 9 14 Updated Dec 17, 2025

Redis Python client

Python 13,395 2,650 Updated Dec 17, 2025
Next