yiwenshao

Follow

yiwenshao

Follow

44 followers · 119 following

Achievements

Achievements

Lists (20)

Sort

ai

61 repositories

apm

c&&cpp

develop

ebpf

11 repositories

golang

21 repositories

kube-edge

kubernetes

41 repositories

linux

monitoring

network

17 repositories

nvidia

24 repositories

python

quant

resources

11 repositories

rust

storage

sys-for-ai

tools

39 repositories

wasm

Stars

shanraisshan / claude-code-best-practice

from vibe coding to agentic engineering - practice makes claude perfect

HTML 53,356 5,343 Updated May 16, 2026

openai / codex

Lightweight coding agent that runs in your terminal

Rust 83,172 12,057 Updated May 17, 2026

agent-infra / sandbox

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 4,705 404 Updated May 13, 2026

zeroclaw-labs / zeroclaw

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀

Rust 31,393 4,619 Updated May 17, 2026

VoltAgent / awesome-openclaw-skills

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

48,820 4,781 Updated May 15, 2026

geekjourneyx / md2wechat-skill

Markdown to WeChat CLI | 一键排版发布到微信公众号：支持 40+ 排版样式和专业主题、AI 配图、批量发布

Go 2,243 281 Updated May 14, 2026

nanobrowser / nanobrowser

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 12,994 1,364 Updated Nov 24, 2025

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,185 650 Updated May 10, 2026

go-test / deep

Golang deep variable equality test that returns human-readable differences

Go 788 55 Updated Dec 22, 2025

forest0xia / dota2bot-OpenHyperAI

A beta Dota2 Bot Script aims to provide better bot game experience

Lua 250 51 Updated Apr 17, 2026

google-gemini / cookbook

Examples and guides for using the Gemini API

Jupyter Notebook 17,228 2,624 Updated May 14, 2026

openai / circuit_sparsity

Open-source release accompanying Gao et al. 2025

Python 521 53 Updated Dec 11, 2025

sgl-project / rbg

A workload for deploying LLM inference services on Kubernetes

Go 219 56 Updated May 9, 2026

matpool / mpu

A shim driver allows in-docker nvidia-smi showing correct process list without modify anything

C 104 27 Updated Jul 3, 2025

go-resty / resty

Simple HTTP, REST, and SSE client library for Go

Go 11,674 782 Updated May 4, 2026

karlseguin / ccache

A golang LRU Cache for high concurrency

Go 1,395 123 Updated Jan 13, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,529 563 Updated May 17, 2026

bytedance / sonic

A blazingly fast JSON serializing & deserializing library

Go 9,390 443 Updated May 13, 2026

rk8s-dev / rk8s

A lightweight Kubernetes-compatible container orchestration system written in Rust, implementing the Container Runtime Interface (CRI) with support for single containers, Kubernetes-style pods, and…

Rust 506 81 Updated May 9, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 53,553 7,203 Updated May 5, 2026

jfernandez / bpftop

bpftop provides a dynamic real-time view of running eBPF programs. It displays the average runtime, events per second, and estimated total CPU % for each program.

C 2,676 129 Updated May 3, 2026

ome-projects / ome

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

Go 449 79 Updated May 16, 2026

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,981 101 Updated May 16, 2026

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,040 111 Updated May 16, 2026

blitz-serving / blitz-scale

The official implementation of OSDI'25 paper BlitzScale

Rust 47 4 Updated Apr 15, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,316 417 Updated Apr 23, 2026

NVIDIA / multi-gpu-programming-models

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 885 150 Updated Sep 26, 2025

changjonathanc / flex-nano-vllm

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 345 19 Updated Nov 2, 2025

wolfecameron / nanoMoE

Forked from karpathy/nanoGPT

An extension of the nanoGPT repository for training small MOE models.

Python 267 32 Updated Mar 9, 2025

datalab-to / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 35,147 2,442 Updated May 5, 2026