Skip to content
Change the repository type filter

All

    Repositories list

    • nixl

      Public
      NVIDIA Inference Xfer Library (NIXL)
      C++
      Other
      3231k43121Updated May 21, 2026May 21, 2026
    • FlexTensor is a tensor offloading and management library for PyTorch that enables running large models on limited GPU memory by intelligently offloading tensors…
      Python
      Apache License 2.0
      1210200Updated May 21, 2026May 21, 2026
    • dynamo

      Public
      A Datacenter Scale Distributed Inference Serving Framework
      Rust
      Other
      1.1k6.8k201568Updated May 21, 2026May 21, 2026
    • Offline optimization of your disaggregated Dynamo graph
      Python
      Apache License 2.0
      1213052349Updated May 21, 2026May 21, 2026
    • aiperf

      Public
      AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solution.
      Python
      Apache License 2.0
      853202276Updated May 21, 2026May 21, 2026
    • Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performa…
      Rust
      Apache License 2.0
      2464922Updated May 21, 2026May 21, 2026
    • grove

      Public
      Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
      Go
      Apache License 2.0
      632103930Updated May 20, 2026May 20, 2026
    • enhancements

      Public
      Enhancement Proposals and Architecture Decisions
      Apache License 2.0
      169153Updated May 19, 2026May 19, 2026
    • velo

      Public
      Rust
      Apache License 2.0
      1403Updated May 12, 2026May 12, 2026
    • aitune

      Public
      NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.
      Python
      Apache License 2.0
      3027020Updated Mar 13, 2026Mar 13, 2026
    • .github

      Public
      3101Updated Aug 21, 2025Aug 21, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.