Skip to content
View cxzl25's full-sized avatar

Block or report cxzl25

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A proxy tool to access OSS-HDFS as HDFS

Java 6 1 Updated Jan 21, 2026

Drop-in Apache Spark replacement written in Rust, unifying batch processing, stream processing, and compute-intensive AI workloads.

Rust 3,008 179 Updated Jun 22, 2026

Apache OpenDAL: One Layer, All Storage.

Rust 5,179 773 Updated Jun 21, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,693 728 Updated Jun 22, 2026

AI-Native & Cloud-Native FS: A high-performance file semantic layer for cloud object storage, integrated with high-speed cache

Rust 847 94 Updated Jun 22, 2026

The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing

Rust 1,772 226 Updated Jun 22, 2026

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Java 1,053 447 Updated Jun 22, 2026

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,571 619 Updated Jun 22, 2026

Apache Iceberg

Java 8,985 3,334 Updated Jun 22, 2026

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,303 1,335 Updated Jun 22, 2026

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java 3,017 874 Updated Jun 20, 2026

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

16 7 Updated May 15, 2026

Apache ORC - the smallest, fastest columnar storage for Hadoop workloads

Java 766 513 Updated Jun 19, 2026

Uniffle is a high performance, general purpose Remote Shuffle Service.

Java 449 171 Updated May 27, 2026

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala 2,347 1,005 Updated Jun 21, 2026

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,487 29,248 Updated Jun 22, 2026

Pig Visualization framework

JavaScript 467 132 Updated Mar 24, 2023

A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apache Kyuubi

Scala 184 80 Updated Apr 6, 2022

Alibaba Java Coding Guidelines pmd implements and IDE plugin

Kotlin 30,825 7,991 Updated Aug 6, 2024

The Context Platform for your Data and AI Stack

Python 12,132 3,520 Updated Jun 22, 2026

hsweb (haʊs wɛb) 是一个基于spring-boot 2.x开发 ,首个使用全响应式编程的企业级后台管理系统基础项目。

Java 8,404 3,025 Updated Jun 18, 2026

An easy to use, self-service open BI reporting and BI dashboard platform.

JavaScript 3,094 1,156 Updated Dec 6, 2025

High Performance Inter-Thread Messaging Library

Java 18,381 3,978 Updated Apr 2, 2025

Web-based SQL editor

JavaScript 5,186 816 Updated Aug 23, 2025

Benchmark comparing serialization libraries on the JVM

Java 3,288 556 Updated Oct 7, 2023

A Spring Framework based, pragmatic style JavaEE application reference architecture.

Java 5,652 2,841 Updated Oct 25, 2022

Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus

Java 161 108 Updated Apr 10, 2016