Skip to content
View hzh0425's full-sized avatar

Organizations

@apache @sofastack

Block or report hzh0425

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Contexts Optical Compression

Python 21,499 1,922 Updated Oct 25, 2025

Persist and reuse KV Cache to speedup your LLM.

Python 214 54 Updated Dec 19, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,552 242 Updated Dec 18, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,654 748 Updated Dec 19, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,765 3,809 Updated Dec 19, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,792 12,071 Updated Dec 19, 2025

Serverless LLM Serving for Everyone.

Python 625 61 Updated Dec 19, 2025

Apache Fluss is a streaming storage built for real-time analytics.

Java 1,678 448 Updated Dec 19, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,385 802 Updated Dec 19, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,451 475 Updated Dec 19, 2025

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 5,840 501 Updated Dec 19, 2025

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 12,555 1,126 Updated Dec 19, 2025

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,119 1,236 Updated Dec 19, 2025

📚 极客时间电子书

12,873 4,297 Updated Jan 26, 2023

DoctorK is a service for Kafka cluster auto healing and workload balancing

Java 629 91 Updated Dec 15, 2021

《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译

Python 22,367 4,470 Updated Nov 24, 2025

Kubernetes CSI driver for LVM on shared disks

Go 25 5 Updated Dec 8, 2025

Open source Java implementation for Raft consensus protocol.

Java 1,426 439 Updated Dec 18, 2025

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

C++ 11,464 704 Updated Dec 19, 2025
TypeScript 10,496 1,114 Updated Nov 26, 2025

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …

Java 2,969 644 Updated Nov 6, 2025

AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.

Java 8,777 592 Updated Dec 17, 2025

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 329,727 53,674 Updated Nov 3, 2025

Apache Pulsar Source code analysis

145 48 Updated Nov 16, 2019

沉浸式双语网页翻译扩展 , 支持输入框翻译, 鼠标悬停翻译, PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension

16,720 957 Updated Dec 19, 2025

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.一键免费部署你的私人AutoGPT 网页应用

TypeScript 3,024 1,385 Updated Feb 10, 2025

Java large off heap cache

Java 1,096 186 Updated Sep 12, 2024

A Java library to perform direct I/O in Linux, bypassing file page cache.

Java 320 68 Updated Oct 4, 2022

A Java Direct IO framework which is very simple to use.

Java 122 30 Updated Mar 17, 2024

Source code for the X Recommendation Algorithm

Scala 67,981 12,648 Updated Sep 8, 2025
Next