Lists (1)
Sort Name ascending (A-Z)
Stars
Beginner, advanced, expert level Rust training material
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
Run OpenClaw more securely inside NVIDIA OpenShell with managed inference
A simple, fast and robust program-aware agentic inference system.
AI agents running research on single-GPU nanochat training automatically
High-Performance KV Cache Storage Engine on CXL Shared Memory for LLM Inference
Spacer: Towards Engineered Scientific Inspiration
A Claude Code skill that turns PDFs, docs, and codebases into Obsidian study vaults
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Accelerating Long Context LLM Inference with Accuracy-Preserving Context Optimization in SGLang, vLLM, llama.cpp, RAG, and Agentic AI.
[Survey] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization
Disaggregated serving system for Large Language Models (LLMs).
Mini website for testing both general CS knowledge and enforce coding practice and common algorithm/data structure memorization.
Transparent Proxy Implementation using eBPF and Go
Demo repository for all the different ways to do eBPF Tracing
This repo contains various examples to learn, explore, and experiment with eBPF.
Website for Artifact Evaluation at EuroSys, SOSP, OSDI, ATC
A Datacenter Scale Distributed Inference Serving Framework