Skip to content
View hzh0425's full-sized avatar

Organizations

@apache @sofastack

Block or report hzh0425

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Claude Code skill that removes signs of AI-generated writing from text

16,026 1,481 Updated Apr 1, 2026

Claude Code+Obsidian,邪修读论文就是快

Python 972 117 Updated Mar 29, 2026

Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.

Python 31,412 3,804 Updated Apr 28, 2026

ResearchClaw is a personal AI assistant built for research: fast to set up, easy to run locally or in the cloud, and ready to integrate with the chat apps you already use. With extensible skills, i…

Python 281 31 Updated Apr 4, 2026

Make Any Website & Tool Your CLI. A universal CLI Hub and AI-native runtime. Transform any website, Electron app, or local binary into a standardized command-line interface. Built for AI Agents to …

JavaScript 17,708 1,742 Updated Apr 28, 2026

Production-grade engineering skills for AI coding agents.

Shell 24,831 3,085 Updated Apr 28, 2026

An agentic skills framework & software development methodology that works.

Shell 170,319 15,040 Updated Apr 28, 2026

let coding agents use ncu skills analysis cuda program automatically!

Shell 91 4 Updated Feb 5, 2026

Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…

TeX 7,488 576 Updated Apr 28, 2026

A collection of specialized agent skills for AI infrastructure development, enabling Claude Code to write, optimize, and debug high-performance systems.

Python 121 6 Updated Apr 15, 2026

Alibaba Cloud's high-performance KVCache system for LLM inference, with components for global cache management, inference simulation(HiSim), and more.

C++ 153 20 Updated Apr 27, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 168,644 26,138 Updated Apr 26, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 42,344 7,508 Updated Apr 28, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 1,950 156 Updated Apr 26, 2026

This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models with 100-200M parameters.

Python 174 16 Updated Apr 27, 2026

记录我在cs336学习时的笔记和作业

Python 818 21 Updated Mar 30, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,073 599 Updated Mar 13, 2026

Contexts Optical Compression

Python 22,917 2,120 Updated Jan 27, 2026

Persist and reuse KV Cache to speedup your LLM.

Python 274 73 Updated Apr 28, 2026

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,128 310 Updated Apr 24, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,682 1,066 Updated Apr 28, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 26,597 5,596 Updated Apr 28, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,384 16,184 Updated Apr 28, 2026

Serverless LLM Serving for Everyone.

Python 676 71 Updated Apr 24, 2026

Apache Fluss is a streaming storage built for real-time analytics.

Java 1,876 529 Updated Apr 27, 2026

Supercharge Your LLM with the Fastest KV Cache Layer

Python 8,142 1,135 Updated Apr 28, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,212 709 Updated Apr 28, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,366 642 Updated Apr 28, 2026

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 13,497 1,195 Updated Apr 28, 2026

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Java 3,253 1,306 Updated Apr 28, 2026
Next