- Cambridge, MA
-
kubernetes Public
Forked from kubernetes/kubernetesProduction-Grade Container Scheduling and Management
-
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Other UpdatedMar 25, 2026 -
agentgateway Public
Forked from agentgateway/agentgatewayNext Generation Agentic Proxy for AI Agents and MCP servers
Rust Apache License 2.0 UpdatedFeb 24, 2026 -
LMCache Public
Forked from LMCache/LMCacheSupercharge Your LLM with the Fastest KV Cache Layer
Python Apache License 2.0 UpdatedFeb 23, 2026 -
grove Public
Forked from ai-dynamo/groveKubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
Go Apache License 2.0 UpdatedFeb 17, 2026 -
jaxy Public
TensorStore S3 write throughput optimization benchmarks for JAX arrays
Python UpdatedFeb 14, 2026 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedFeb 10, 2026 -
-
tensorstore-test Public
Minimal tensorstore S3 roundtrip test for Coreweave vhost-style object storage
Python UpdatedFeb 5, 2026 -
hf-mem Public
Forked from alvarobartt/hf-memA CLI to estimate inference memory requirements for Hugging Face models, written in Python.
Python MIT License UpdatedJan 27, 2026 -
-
-
-
-
fireworks Public
Terminal fireworks show - ASCII art celebration in your terminal
-
football-analytics Public
Elite football data analytics dashboard for fans
TypeScript UpdatedDec 23, 2025 -
-
inc-cli Public
CLI for interacting with incident.io for batch automation
-
word-anal Public
GPU-accelerated analysis of 3.5B+ word permutations using CuPy + Numba CUDA kernels on 8x H100
Python UpdatedOct 17, 2025 -
-
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
Python Apache License 2.0 UpdatedAug 25, 2025 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedAug 25, 2025 -
gpu-checkpoint Public
GPU-aware checkpoint/restore system with intelligent strategy selection
Rust UpdatedJul 30, 2025 -
-
containerd Public
Forked from containerd/containerdAn open and reliable container runtime
Go Apache License 2.0 UpdatedJul 26, 2025 -
llm-d-model-service Public
Forked from llm-d/llm-d-model-serviceSimplified model deployment on llm-d
Go Apache License 2.0 UpdatedJul 21, 2025 -
-
maxtext-nv Public
Forked from AI-Hypercomputer/maxtextA simple, performant and scalable Jax LLM!
Python Apache License 2.0 UpdatedJun 6, 2025 -
-