-
@DaoCloud Co,.Ltd, Shanghai
- Shanghai, China
-
03:47
(UTC +08:00) - https://pacoxu.wordpress.com/
- @pacoxu.bsky.social
- @xu_paco
- in/pacoxu2020
- https://sessionize.com/pacoxu/
Highlights
Lists (8)
Sort Name ascending (A-Z)
- All languages
- BitBake
- C
- C#
- C++
- CSS
- Clojure
- Cuda
- Dart
- Dockerfile
- Emacs Lisp
- GCC Machine Description
- Go
- Go Template
- Groovy
- HCL
- HTML
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Lean
- Lua
- MDX
- MLIR
- Makefile
- Markdown
- Mustache
- OCaml
- Objective-C
- PHP
- PowerShell
- Python
- Roff
- Ruby
- Rust
- Scala
- Shell
- Smarty
- Solidity
- Starlark
- Swift
- TypeScript
- Vue
Starred repositories
Fast and easy to use database for logs, which can efficiently handle terabytes of logs
开源面对面,连接热爱开源的你!Episodes for the open-source face-to-face talk!
Bridge is a multi-level proxy that supports clients and servers with multiple protocols. SSHProxy, HTTPProxy, Socks4, Socks5, Shadowsocks.
Manages Envoy Proxy as a Standalone or Kubernetes-based Application Gateway
Discover ingress-nginx usage and auto-generate Gateway API migration plans before ingress-nginx reaches end-of-life (March 2026).
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A framework for few-shot evaluation of language models.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
DOCA Platform manages provisioning and service orchestration for Bluefield DPUs
NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments
A cross-platform GUI application for easily downloading Hugging Face models without requiring technical knowledge or setup.
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
A high-performance and light-weight router for vLLM large scale deployment
[Survey] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization
Intelligent automation and multi-agent orchestration for Claude Code
A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
The batch gateway is an llm-d implementation of the OpenAI batch inference API
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Specification and documentation for the Universal Commerce Protocol (UCP)
A lightweight macOS menubar hub for your GitHub works.
Open-source, secure environment with real-world tools for enterprise-grade agents.
An Open-source, self-hosted AI model hub with Hugging Face compatibility, accelerating vLLM/SGLang performance.