Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton
-
Updated
Dec 25, 2025 - Go
Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton
Create and administer a local one or multi-node Kubernetes cluster(s) preset for development with simple commands.
Add a description, image, and links to the multi-node-kubernetes topic page so that developers can more easily learn about it.
To associate your repository with the multi-node-kubernetes topic, visit your repo's landing page and select "manage topics."