vMaroon

Maroon Ayoub vMaroon

@llm-d contributor

31 followers · 14 following

IBM
in/v-maroon

Achievements

x3 x3

Achievements

x3 x3

Organizations

Stars

vMaroon / sideye

A personal PR-review extension.

Python 1 Updated Mar 9, 2026

red-hat-data-services / kserve

Forked from opendatahub-io/kserve

Standardized Serverless ML Inference Platform on Kubernetes

Go 3 19 Updated Mar 31, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,088 669 Updated Apr 1, 2026

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 832 95 Updated Apr 1, 2026

IBM / spnl

Span Queries: What if we had a way to plan and optimize GenAI like we do for SQL?

Rust 13 8 Updated Mar 31, 2026

kagenti / kagenti

Main Kagenti repo - installer, UI and docs

Python 164 65 Updated Apr 1, 2026

kubernetes-sigs / inference-perf

GenAI inference performance benchmarking tool

Python 162 77 Updated Mar 25, 2026

llm-d / llm-d-routing-sidecar

Incubating P/D sidecar for llm-d

Go 16 29 Updated Nov 13, 2025

llm-d / llm-d-benchmark

llm-d benchmark scripts and tooling

Python 53 64 Updated Apr 1, 2026

llm-d / llm-d-inference-sim

A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual heavy models.

Go 105 68 Updated Apr 1, 2026

llm-d / llm-d-deployer

Helm charts for llm-d

Shell 52 56 Updated Jul 22, 2025

llm-d / llm-d-inference-scheduler

Inference scheduler for llm-d

Go 158 150 Updated Apr 1, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,875 384 Updated Apr 1, 2026

llm-d / llm-d-kv-cache

Distributed KV cache scheduling & offloading libraries

Go 122 107 Updated Mar 31, 2026

kfirtoledo / multi-mcp

Python 104 27 Updated Jul 21, 2025

kubernetes-sigs / gateway-api-inference-extension

Gateway API Inference Extension

Go 634 274 Updated Apr 1, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,919 15,050 Updated Apr 1, 2026

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,785 1,025 Updated Mar 30, 2026

vllm-project / production-stack

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 2,249 380 Updated Mar 31, 2026

kube-agent / kuery

Go 1 Updated Jan 28, 2025

tmc / langchaingo

LangChain for Go, the easiest way to write LLM-based programs in Go

Go 8,981 1,069 Updated Jan 11, 2026

joannj35 / debruijn-sequence-parser

GUI tool for visualizing the result data of deBruijn sequence complexity distribution study

C++ 2 Updated Feb 20, 2024

kubestellar / kubestellar

KubeStellar - a flexible solution for multi-cluster configuration management for edge, multi-cloud, and hybrid cloud

Go 651 258 Updated Mar 30, 2026

stolostron / multicluster-global-hub

the main repository for the multicluster global hub

Go 22 35 Updated Apr 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maroon Ayoub vMaroon

Achievements

Achievements

Organizations

Block or report vMaroon

Stars

vMaroon / sideye

red-hat-data-services / kserve

vllm-project / vllm-omni

ovg-project / kvcached

IBM / spnl

kagenti / kagenti

kubernetes-sigs / inference-perf

llm-d / llm-d-routing-sidecar

llm-d / llm-d-benchmark

llm-d / llm-d-inference-sim

llm-d / llm-d-deployer

llm-d / llm-d-inference-scheduler

llm-d / llm-d

llm-d / llm-d-kv-cache

kfirtoledo / multi-mcp

kubernetes-sigs / gateway-api-inference-extension

vllm-project / vllm

deepseek-ai / 3FS

vllm-project / production-stack

kube-agent / kuery

tmc / langchaingo

joannj35 / debruijn-sequence-parser

kubestellar / kubestellar

stolostron / multicluster-global-hub