Stars
CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…
Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes
Fast, Flexible and Portable Structured Generation
A retargetable MLIR-based machine learning compiler and runtime toolkit.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
verl: Volcano Engine Reinforcement Learning for LLMs
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
TPU inference for vLLM, with unified JAX and PyTorch support.
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang
Supercharge Your LLM with the Fastest KV Cache Layer
A high-throughput and memory-efficient inference and serving engine for LLMs
A machine learning compiler for GPUs, CPUs, and ML accelerators
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.
Declarative Continuous Deployment for Kubernetes
Lizard is the visual verification debugger for Viper IDE
Exercises for the Big Data lecture at ETH Zurich (Fall 2025)
A set of exercises to prepare for Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation
Extension for Visual Studio Code - Intellisense in helm-templates from the values.yaml
This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.
A massively parallel, optimal functional runtime in Rust
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
A DAP-compatible JavaScript debugger. Used in VS Code, VS, + more
View deoptimizations of your JavaScript in V8
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)