hzh0425

Zhangheng hzh0425

@apache

403 followers · 108 following

Alibaba Cloud

Achievements

x3 x2 x2

Achievements

x3 x2 x2

Organizations

Stars

blader / humanizer

Claude Code skill that removes signs of AI-generated writing from text

16,026 1,481 Updated Apr 1, 2026

juliye2025 / evil-read-arxiv

Claude Code+Obsidian，邪修读论文就是快

Python 972 117 Updated Mar 29, 2026

github / awesome-copilot

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 31,412 3,804 Updated Apr 28, 2026

ymx10086 / ResearchClaw

ResearchClaw is a personal AI assistant built for research: fast to set up, easy to run locally or in the cloud, and ready to integrate with the chat apps you already use. With extensible skills, i…

Python 281 31 Updated Apr 4, 2026

jackwener / OpenCLI

Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …

JavaScript 17,708 1,742 Updated Apr 28, 2026

addyosmani / agent-skills

Production-grade engineering skills for AI coding agents.

Shell 24,831 3,085 Updated Apr 28, 2026

obra / superpowers

An agentic skills framework & software development methodology that works.

Shell 170,319 15,040 Updated Apr 28, 2026

maxiaosong1124 / ncu-cuda-profiling-skill

let coding agents use ncu skills analysis cuda program automatically!

Shell 91 4 Updated Feb 5, 2026

Orchestra-Research / AI-Research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 7,488 576 Updated Apr 28, 2026

yzlnew / infra-skills

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 121 6 Updated Apr 15, 2026

alibaba / tair-kvcache

Alibaba Cloud's high-performance KVCache system for LLM inference, with components for global cache management, inference simulation(HiSim), and more.

C++ 153 20 Updated Apr 27, 2026

affaan-m / everything-claude-code

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 168,644 26,138 Updated Apr 26, 2026

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,344 7,508 Updated Apr 28, 2026

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 1,950 156 Updated Apr 26, 2026

WKQ9411 / Mini-LLM

This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models with 100-200M parameters.

Python 174 16 Updated Apr 27, 2026

weiruihhh / cs336_note_and_hw

记录我在cs336学习时的笔记和作业

Python 818 21 Updated Mar 30, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,073 599 Updated Mar 13, 2026

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 22,917 2,120 Updated Jan 27, 2026

ModelEngine-Group / unified-cache-management

Persist and reuse KV Cache to speedup your LLM.

Python 274 73 Updated Apr 28, 2026

skyzh / tiny-llm

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,128 310 Updated Apr 24, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,682 1,066 Updated Apr 28, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 26,597 5,596 Updated Apr 28, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,384 16,184 Updated Apr 28, 2026

ServerlessLLM / ServerlessLLM

Serverless LLM Serving for Everyone.

Python 676 71 Updated Apr 24, 2026

apache / fluss

Apache Fluss is a streaming storage built for real-time analytics.

Java 1,876 529 Updated Apr 27, 2026

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 8,142 1,135 Updated Apr 28, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,212 709 Updated Apr 28, 2026

lance-format / lance

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,366 642 Updated Apr 28, 2026

juicedata / juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 13,497 1,195 Updated Apr 28, 2026

apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,253 1,306 Updated Apr 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhangheng hzh0425

Achievements

Achievements

Organizations

Block or report hzh0425

Stars

blader / humanizer

juliye2025 / evil-read-arxiv

github / awesome-copilot

ymx10086 / ResearchClaw

jackwener / OpenCLI

addyosmani / agent-skills

obra / superpowers

maxiaosong1124 / ncu-cuda-profiling-skill

Orchestra-Research / AI-Research-SKILLs

yzlnew / infra-skills

alibaba / tair-kvcache

affaan-m / everything-claude-code

ray-project / ray

CalvinXKY / InfraTech

WKQ9411 / Mini-LLM

weiruihhh / cs336_note_and_hw

sgl-project / mini-sglang

deepseek-ai / DeepSeek-OCR

ModelEngine-Group / unified-cache-management

skyzh / tiny-llm

ai-dynamo / dynamo

sgl-project / sglang

vllm-project / vllm

ServerlessLLM / ServerlessLLM

apache / fluss

LMCache / LMCache

kvcache-ai / Mooncake

lance-format / lance

juicedata / juicefs

apache / paimon