Heterogeneous GPU Sharing on Kubernetes
-
Updated
May 17, 2026 - Go
NVIDIA Corporation is a company that manufactures graphics processors, mobile technologies, and desktop computers. It is known for developing integrated circuits, which are used in everything from electronic game consoles to personal computers (PCs). The company is a leading manufacturer of high-end graphics processing units (GPUs).
Heterogeneous GPU Sharing on Kubernetes
NVIDIA GPU Operator creates, configures, and manages GPUs in Kubernetes
Nvidia GPU exporter for prometheus using nvidia-smi binary
eBPF based always-on CPU/GPU profiler auto-discovering targets in Kubernetes and systemd, zero code changes or restarts needed!
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
GPUd automates monitoring, diagnostics, and issue identification for GPUs
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
A cross-platform go bot that tracks for availability of stock from Nvidia's store and adds a cart to your checkout.
Tensor Fusion is a state-of-the-art GPU virtualization and pooling solution designed to optimize GPU cluster utilization to its fullest potential.
Kubernetes operator for local LLM inference with llama.cpp, vLLM, and TGI - multi-GPU, autoscaling, air-gapped, production-ready
Prometheus Exporter for NVIDIA GPUs using NVML
A k8s device plugin for scheduling and allocating vGPU devices.
nvidiagpubeat is an elastic beat that uses NVIDIA System Management Interface (nvidia-smi) to monitor NVIDIA GPU devices and can ingest metrics into Elastic search cluster, with support for both 6.x and 7.x versions of beats. nvidia-smi is a command line utility, based on top of the NVIDIA Management Library (NVML), intended to aid in the manage…
A light-weight container runtime for Linux with NVIDIA gpu support, allows developers to quicky setup development environments for dev and test. Pavlos can emulate any Linux rootfs image as a container.
A collection of projects, hands-on demos, and best practices built by the DigitalOcean community.
NVIDIA GPU acceleration patches for Stash - enables CUDA decoding and NVENC encoding for preview/sprite/phash generation tasks
golang wrapper for NVIDIA Management Library (NVML)
Simple Fast CLI Tool for Monitoring Nvidia GPU Using Nouveau Driver Written in Go
Created by Jensen Huang, Curtis Priem, Chris Malachowsky
Released April 5, 1993