Skip to content
View yiwenshao's full-sized avatar

Block or report yiwenshao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

from vibe coding to agentic engineering - practice makes claude perfect

HTML 53,356 5,343 Updated May 16, 2026

Lightweight coding agent that runs in your terminal

Rust 83,172 12,057 Updated May 17, 2026

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 4,705 404 Updated May 13, 2026

Fast, small, and fully autonomous AI personal assistant infrastructure, ANY OS, ANY PLATFORM — deploy anywhere, swap anything 🦀

Rust 31,393 4,619 Updated May 17, 2026

The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞

48,820 4,781 Updated May 15, 2026

Markdown to WeChat CLI | 一键排版发布到微信公众号:支持 40+ 排版样式和专业主题 、AI 配图 、批量发布

Go 2,243 281 Updated May 14, 2026

Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.

TypeScript 12,994 1,364 Updated Nov 24, 2025

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,185 650 Updated May 10, 2026

Golang deep variable equality test that returns human-readable differences

Go 788 55 Updated Dec 22, 2025

A beta Dota2 Bot Script aims to provide better bot game experience

Lua 250 51 Updated Apr 17, 2026

Examples and guides for using the Gemini API

Jupyter Notebook 17,228 2,624 Updated May 14, 2026

Open-source release accompanying Gao et al. 2025

Python 521 53 Updated Dec 11, 2025

A workload for deploying LLM inference services on Kubernetes

Go 219 56 Updated May 9, 2026

A shim driver allows in-docker nvidia-smi showing correct process list without modify anything

C 104 27 Updated Jul 3, 2025

Simple HTTP, REST, and SSE client library for Go

Go 11,674 782 Updated May 4, 2026

A golang LRU Cache for high concurrency

Go 1,395 123 Updated Jan 13, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,529 563 Updated May 17, 2026

A blazingly fast JSON serializing & deserializing library

Go 9,390 443 Updated May 13, 2026

A lightweight Kubernetes-compatible container orchestration system written in Rust, implementing the Container Runtime Interface (CRI) with support for single containers, Kubernetes-style pods, and…

Rust 506 81 Updated May 9, 2026

The best ChatGPT that $100 can buy.

Python 53,553 7,203 Updated May 5, 2026

bpftop provides a dynamic real-time view of running eBPF programs. It displays the average runtime, events per second, and estimated total CPU % for each program.

C 2,676 129 Updated May 3, 2026

Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

Go 449 79 Updated May 16, 2026

Large Language Model (LLM) Systems Paper List

1,981 101 Updated May 16, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 1,040 111 Updated May 16, 2026

The official implementation of OSDI'25 paper BlitzScale

Rust 47 4 Updated Apr 15, 2026

My learning notes for ML SYS.

Python 6,316 417 Updated Apr 23, 2026

Examples demonstrating available options to program multiple GPUs in a single node or a cluster

Cuda 885 150 Updated Sep 26, 2025

FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.

Python 345 19 Updated Nov 2, 2025

An extension of the nanoGPT repository for training small MOE models.

Python 267 32 Updated Mar 9, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 35,147 2,442 Updated May 5, 2026
Next