Skip to content
View pacoxu's full-sized avatar
be kind
be kind

Block or report pacoxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Fast and easy to use database for logs, which can efficiently handle terabytes of logs

Go 1,507 100 Updated Feb 16, 2026

开源面对面,连接热爱开源的你!Episodes for the open-source face-to-face talk!

Go Template 298 21 Updated Feb 8, 2026

🤖 AI Gateway | AI Native API Gateway

Go 7,540 989 Updated Feb 16, 2026

Bridge is a multi-level proxy that supports clients and servers with multiple protocols. SSHProxy, HTTPProxy, Socks4, Socks5, Shadowsocks.

Go 204 22 Updated Feb 2, 2026

Visual Causal Flow

Python 2,265 173 Updated Feb 3, 2026

Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway

Go 2,515 672 Updated Feb 16, 2026

Discover ingress-nginx usage and auto-generate Gateway API migration plans before ingress-nginx reaches end-of-life (March 2026).

Go 15 Updated Nov 26, 2025

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 200,921 35,805 Updated Feb 16, 2026

A framework for few-shot evaluation of language models.

Python 11,430 3,039 Updated Feb 15, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,285 1,686 Updated Feb 14, 2026

DOCA Platform manages provisioning and service orchestration for Bluefield DPUs

Go 77 20 Updated Feb 16, 2026

NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

Go 180 46 Updated Feb 16, 2026

pod single process oom

Go 1 1 Updated Nov 21, 2025

A cross-platform GUI application for easily downloading Hugging Face models without requiring technical knowledge or setup.

Python 22 3 Updated Nov 26, 2025

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 3,491 435 Updated Feb 11, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,179 818 Updated Feb 3, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 781 86 Updated Feb 15, 2026

A high-performance and light-weight router for vLLM large scale deployment

Rust 121 39 Updated Feb 16, 2026

[Survey] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization

Python 325 10 Updated Jan 18, 2026

Intelligent automation and multi-agent orchestration for Claude Code

Python 28,716 3,145 Updated Feb 7, 2026

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 241 39 Updated Feb 16, 2026

The batch gateway is an llm-d implementation of the OpenAI batch inference API

Go 4 6 Updated Feb 16, 2026
Go 3 Updated Feb 10, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,693 255 Updated Jan 14, 2026

Specification and documentation for the Universal Commerce Protocol (UCP)

TypeScript 2,330 273 Updated Feb 13, 2026

A lightweight macOS menubar hub for your GitHub works.

Swift 3 Updated Feb 9, 2026

Open-source, secure environment with real-world tools for enterprise-grade agents.

MDX 10,909 771 Updated Feb 16, 2026

An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.

Go 30 10 Updated Feb 12, 2026
Next