Lists (2)
Sort Name ascending (A-Z)
Starred repositories
An open-source C++ library developed and used at Facebook.
Seamless operability between C++11 and Python
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
oneAPI Threading Building Blocks (oneTBB)
Lightning fast C++/CUDA neural network framework
HIP: C++ Heterogeneous-Compute Interface for Portability
A retargetable MLIR-based machine learning compiler and runtime toolkit.
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
`std::execution`, the proposed C++ framework for asynchronous and parallel programming.
Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering
A Python library transfers PyTorch tensors between CPU and NVMe
Optimized FP16/BF16 x FP4 GPU kernels for AMD GPUs
rocDecode is a high performance video decode SDK for AMD hardware