All

11 repositories

nixl
Public
NVIDIA Inference Xfer Library (NIXL)
C++
•
Other
•323•1k•43•121•Updated May 21, 2026May 21, 2026
flextensor
Public
FlexTensor is a tensor offloading and management library for PyTorch that enables running large models on limited GPU memory by intelligently offloading tensors…
Python
•
Apache License 2.0
•12•102•0•0•Updated May 21, 2026May 21, 2026
dynamo
Public
A Datacenter Scale Distributed Inference Serving Framework
Rust
•
Other
•1.1k•6.8k•201•568•Updated May 21, 2026May 21, 2026
aiconfigurator
Public
Offline optimization of your disaggregated Dynamo graph
Python
•
Apache License 2.0
•121•305•23•49•Updated May 21, 2026May 21, 2026
aiperf
Public
AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.
Python
•
Apache License 2.0
•85•320•22•76•Updated May 21, 2026May 21, 2026
modelexpress
Public
Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performa…
Rust
•
Apache License 2.0
•24•64•9•22•Updated May 21, 2026May 21, 2026
grove
Public
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
kubernetes gpu inference
kubernetes gpu inference operator auto-scaling role-based grove multinode auto-scaling-group gang-scheduling
Go
•
Apache License 2.0
•63•210•39•30•Updated May 20, 2026May 20, 2026
enhancements
Public
Enhancement Proposals and Architecture Decisions
Apache License 2.0
•16•9•1•53•Updated May 19, 2026May 19, 2026
velo
Public
Rust
•
Apache License 2.0
•1•4•0•3•Updated May 12, 2026May 12, 2026
aitune
Public
NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.
deep-learning inference nvidia
deep-learning inference nvidia nvidia-gpu
Python
•
Apache License 2.0
•30•270•2•0•Updated Mar 13, 2026Mar 13, 2026
.github
Public
3•1•0•1•Updated Aug 21, 2025Aug 21, 2025

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamo

All

All

11 repositories

nixl

flextensor

dynamo

aiconfigurator

aiperf

modelexpress

grove

enhancements

velo

aitune

.github

All

All

Repositories list

11 repositories