NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,332 2,264 Updated Nov 6, 2025

uxlfoundation / oneTBB

oneAPI Threading Building Blocks (oneTBB)

C++ 6,416 1,142 Updated Nov 6, 2025

arrayfire / arrayfire

ArrayFire: a general purpose GPU library.

C++ 4,813 551 Updated Sep 5, 2025

NVlabs / tiny-cuda-nn

Lightning fast C++/CUDA neural network framework

C++ 4,293 531 Updated Oct 13, 2025

ROCm / hip

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 4,224 573 Updated Nov 4, 2025

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

C++ 3,445 787 Updated Nov 6, 2025

CVCUDA / CV-CUDA

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,597 241 Updated May 21, 2025

NVIDIA / CUDALibrarySamples

CUDA Library Samples

C++ 2,161 416 Updated Nov 6, 2025

NVIDIA / stdexec

`std::execution`, the proposed C++ framework for asynchronous and parallel programming.

C++ 2,073 209 Updated Nov 2, 2025

NVIDIA / cccl

CUDA Core Compute Libraries

C++ 2,010 286 Updated Nov 6, 2025

NVlabs / nvdiffrast

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

C++ 1,709 201 Updated Oct 22, 2025

hpcaitech / TensorNVMe

A Python library transfers PyTorch tensors between CPU and NVMe

C++ 120 27 Updated Nov 27, 2024

causalflow-ai / petit-kernel

Optimized FP16/BF16 x FP4 GPU kernels for AMD GPUs

C++ 34 6 Updated Oct 9, 2025

ROCm / rocDecode

rocDecode is a high performance video decode SDK for AMD hardware

C++ 31 24 Updated Nov 5, 2025

ROCm / libhipcxx

The C++ Standard Library for your entire system.

C++ 22 4 Updated Apr 24, 2025

AMD-HPC / CoralGemm

C++ 15 9 Updated Oct 30, 2025

ROCm / hipBench

HIP Kernel Benchmarking Library

C++ 8 3 Updated Apr 25, 2025

johnpzh / cudnn_samples_v9

cuDNN samples v9.x

C++ 4 1 Updated Feb 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DoubleRedX DoubleRedX

Achievements

Achievements

Block or report DoubleRedX

Lists (2)

image_generation

video_generation

Starred repositories

facebook / folly

78 / xiaozhi-esp32

pybind / pybind11

ggml-org / ggml

NVIDIA / TensorRT