Skip to content
View hzh0425's full-sized avatar

Organizations

@apache @sofastack

Block or report hzh0425

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
275 results for source starred repositories
Clear filter

This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models with 100-200M parameters.

Python 110 7 Updated Feb 2, 2026

记录我在cs336学习时的笔记和作业

Python 579 10 Updated Jan 30, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,304 410 Updated Jan 19, 2026

Contexts Optical Compression

Python 22,384 2,055 Updated Jan 27, 2026

Persist and reuse KV Cache to speedup your LLM.

Python 248 60 Updated Feb 4, 2026

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,762 266 Updated Dec 18, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,041 840 Updated Feb 4, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,241 4,320 Updated Feb 4, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 69,479 13,186 Updated Feb 4, 2026

Serverless LLM Serving for Everyone.

Python 645 63 Updated Jan 23, 2026

Apache Fluss is a streaming storage built for real-time analytics.

Java 1,769 499 Updated Feb 4, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,839 886 Updated Feb 4, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,689 545 Updated Feb 4, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,008 540 Updated Feb 4, 2026

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 13,170 1,161 Updated Feb 4, 2026

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,181 1,269 Updated Feb 4, 2026

📚 极客时间电子书

12,964 4,335 Updated Jan 26, 2023

《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译

Python 22,568 4,485 Updated Jan 26, 2026

Kubernetes CSI driver for LVM on shared disks

Go 25 5 Updated Jan 9, 2026

Open source Java implementation for Raft consensus protocol.

Java 1,437 440 Updated Feb 3, 2026

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

C++ 11,677 713 Updated Feb 4, 2026
TypeScript 10,537 1,127 Updated Nov 26, 2025

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …

Java 2,991 646 Updated Nov 6, 2025

AutoMQ is a diskless Kafka® on S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency. Multi-AZ Availability.

Java 9,453 660 Updated Feb 4, 2026

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 334,406 54,312 Updated Nov 3, 2025

Apache Pulsar Source code analysis

147 48 Updated Nov 16, 2019

沉浸式双语网页翻译扩展 , 支持输入框翻译, 鼠标悬停翻译, PDF, Epub, 字幕文件, TXT 文件翻译 - Immersive Dual Web Page Translation Extension

16,890 971 Updated Jan 26, 2026

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.一键免费部署你的私人AutoGPT 网页应用

TypeScript 3,018 1,377 Updated Feb 10, 2025

Java large off heap cache

Java 1,096 185 Updated Sep 12, 2024

A Java library to perform direct I/O in Linux, bypassing file page cache.

Java 322 68 Updated Oct 4, 2022
Next