-
cudarc
Safe and minimal CUDA bindings
-
neptune
Poseidon hashing over BLS12-381 for Filecoin
-
bindgen_cuda
Bindgen like interface to build cuda kernels to interact with within Rust
-
hvm
A massively parallel, optimal functional runtime in Rust
-
iron_learn
ML library with GPU-accelerated gradient descent. Supports tensors, complex numbers, linear/logistic regression, and CUDA optimization.
-
candle-kernels
CUDA kernels for Candle
-
mwa_hyperbeam
Primary beam code for the Murchison Widefield Array (MWA) radio telescope
-
getenv
Getenv.rs
-
infernum
CLI - From the depths, intelligence rises
-
async-cuda
Async CUDA for Rust
-
ringkernel-cuda
CUDA backend for RingKernel - NVIDIA GPU support via cudarc
-
llama-cpp-sys-2
Low Level Bindings to llama.cpp
-
llmux
Zero-reload model switching for vLLM - manages multiple models on shared GPU
-
ec-gpu
Traits for field and eliptic curve operations on GPUs
-
torch_poetry_bootstrap
A command-line tool to detect CUDA version and install the appropriate PyTorch wheel via Poetry
-
zfp-sys
Raw Rust bindings to ZFP (https://github.com/LLNL/zfp)
-
cudaforge
Advanced CUDA kernel builder for Rust with incremental builds, auto-detection, and external dependency support
-
async-tensorrt
Async TensorRT for Rust
-
cuda-rust-wasm
CUDA to Rust transpiler with WebGPU/WASM support
-
pylate-rs
WebAssembly library for late interaction models
-
perdix
High-performance GPU-accelerated ring buffer for AI terminal multiplexing
-
xdl-amp
Multi-backend GPU/ML acceleration for XDL
-
mwa_hyperdrive
Calibration software for the Murchison Widefield Array (MWA) radio telescope
-
gpu-scatter-gather
World's fastest wordlist generator using GPU acceleration with multi-GPU support
-
cuvs
RAPIDS vector search library
-
ec-gpu-gen
Code generator for field and eliptic curve operations on the GPUs
-
nam-ec-gpu-gen
Code generator for field and elliptic curve operations on the GPUs
-
ringkernel-cuda-codegen
CUDA code generation from Rust DSL for RingKernel stencil kernels
-
cuda-device-query
CUDA
deviceQuery.cppport written in Rust withcudarc -
autd3-backend-cuda
CUDA Backend for AUTD3
-
with-gpu
Intelligent GPU selection wrapper for CUDA commands
-
kitsune-stt
Speech-to-Text tool using Candle and Voxtral
-
tensor_frame
A PyTorch-like tensor library for Rust with CPU, WGPU, and CUDA backends
-
sbv2_core
Style-Bert-VITSの推論ライブラリ
-
abaddon
LLM inference engine - The Destroyer renders judgment
-
sass-assembler
SASS (NVIDIA GPU) assembler for Gaia project
-
trueno-gpu
Pure Rust PTX generation for NVIDIA CUDA - no LLVM, no nvcc
-
optirs-gpu
OptiRS GPU acceleration and multi-GPU optimization
-
supraseal-c2
CUDA Groth16 proof generator for Filecoin
-
haagenti-cuda
CUDA GPU decompression kernels for Haagenti tensor compression
-
burn-cuda
CUDA backend for the Burn framework
-
hive-gpu
High-performance GPU acceleration for vector operations with Device Info API (Metal, CUDA, ROCm)
-
crown
A cryptographic library
-
pasta-msm
Optimized multiscalar multiplicaton for Pasta moduli for x86_64 and aarch64
-
cuda-driver-sys
Rust binding to CUDA Driver APIs
-
nvidia-video-codec-sdk
Bindings for NVIDIA Video Codec SDK
-
torsh-profiler
Performance profiling and monitoring for ToRSh
-
ringkernel-graph
GPU-accelerated graph algorithm primitives
-
kn-cuda-sys
A wrapper around the CUDA APIs
-
torsh-backend
Backend abstraction layer for ToRSh
-
cuda-runtime-sys
Rust binding to CUDA Runtime APIs
-
icicle-core
GPU ZK acceleration by Ingonyama
-
piper-tts-rs
Piper-TTS implementation in Rust
-
tesser-cortex
High-performance, hardware-agnostic AI inference engine for Tesser
-
luminal_cudarc
Safe wrappers around CUDA apis
-
ringkernel-montecarlo
GPU-accelerated Monte Carlo primitives for variance reduction
-
RayBNN_DataLoader
Read CSV, numpy, and binary files to Rust vectors of f16, f32, f64, u8, u16, u32, u64, i8, i16, i32, i64
-
crown-bin
A cryptographic library
-
RayBNN_Raytrace
Ray tracing library using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI
-
rcudnn
safe Rust wrapper for CUDA's cuDNN
-
jawe-cuvs-iii
RAPIDS vector search library
-
hpt-cudakernels
implements cuda kernels for hpt
-
jawe-cuvs-iv
RAPIDS vector search library
-
RayBNN_Cell
Cell Position Generator for RayBNN
-
tropical-gemm-cuda
CUDA backend for tropical matrix multiplication
-
rcublas
safe Rust wrapper for CUDA's cuBLAS
-
cublas
safe Rust wrapper for CUDA's cuDNN
-
scir-gpu
SciR GPU foundations: device arrays and CUDA (feature-gated) elementwise/FIR kernels with CPU parity
-
hodu_cuda_kernels
hodu cuda kernels
-
cuda-config
Helper crate for finding CUDA libraries
-
rcudnn-sys
FFI bindings to cuDNN
-
llama-cpp-sys-4
Low Level Bindings to llama.cpp
-
icicle-cuda-runtime
Ingonyama's Rust wrapper of CUDA runtime
-
cudnn
safe Rust wrapper for CUDA's cuDNN
-
nam-supraseal-c2
CUDA Groth16 proof generator for Filecoin
-
crseo-sys
Cuda Engined Optics Rust Interface
-
jawe-cuvs-sys-ii
Low-level rust bindings to libcuvs
-
emixai
Feature-gated AI helpers (audio, imaging, language, vision) for EssentialMix
-
luminal_cuda
Cuda compiler for luminal
-
cmake-init
Initialize CMake project at speed
-
zenu-cuda
CUDA bindings for Rust
-
fellhorn-llama-cpp-sys-2
Low Level Bindings to llama.cpp
-
accel
GPGPU Framework for Rust
-
cuda
CUDA bindings
-
bevy_cuda
CUDA integration for Bevy game engine
-
RayBNN_Optimizer
Gradient Descent Optimizers and Genetic Algorithms using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI
-
shimmy-llama-cpp-sys-2
Low Level Bindings to llama.cpp with MoE CPU offloading support
-
cuda_bindgen
Bindgen like interface to build cuda kernels to interact with within Rust
-
whisper-cpp-plus-sys
Low-level FFI bindings for whisper.cpp
-
async-cuda-npp
Async NVIDIA Performance Primitives for Rust
-
candle_embed
Text embeddings with Candle. Fast and configurable. Use any model from Hugging Face. CUDA or CPU powered.
-
cudarse-driver
Bindings to the CUDA Driver API that tries to stay faithful to the original
-
cudnn-sys
FFI bindings to cuDNN
-
silero-vad-rs
Silero Voice Activity Detection
-
jawe-cuvs-sys-iv
Low-level rust bindings to libcuvs
-
torsh-core
Core types and traits for ToRSh deep learning framework
-
jawe-cuvs-sys-iii
Low-level rust bindings to libcuvs
-
cuda-oxide
high-level, rusty wrapper over CUDA. It provides the best safety one can get when working with hardware.
-
darknet-sys
-sys crate for Rust darknet wrapper
-
crown-jsasm
A cryptographic library
-
easy-tensorrt-core
Rust wrapper for NVIDIA TensorRT
-
cufile
Safe Rust bindings for NVIDIA CuFile library
-
oxidized-transformers
Transformers library (not functional yet)
-
memonitor
Query CPU and GPU memory information in a portable way
-
rcublas-sys
FFI bindings to cuBLAS
-
RayBNN_Neural
Neural Networks with Sparse Weights in Rust using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI
-
easy-tensorrt-sys
Rust binding to NVIDIA TensorRT, forked from tensorrt-rs-sys
-
libdebayer
debayer images with CUDA
-
tensorgraph-sys
backbone for tensorgraph, providing memory manamagement across devices
-
cuda11-cudart-sys
cuda ffi
-
cuda_d3d11_interop_bindings
Register and map D3D11 buffers with CUDA
-
cuda-colorspace-kernel
Colorspace handling on CUDA (device code)
-
cuda11-cuda-sys
cuda ffi
-
RayBNN_Graph
Graph Manipulation Library For GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI
-
ptoxide
A virtual machine to execute CUDA PTX without a GPU
-
zenu-cuda-config
CUDA configuration for Zenu
-
simt_cuda_sys
part of simt. cuda driver api bindings
-
ug-llama
Micro compiler for tensor operations
-
tensorgraph-math
backbone for tensorgraph, providing math primitives
-
cuda_dnn
cuDNN API bindings
-
ulib
Universal data storage library for CPU/GPU heterogeneous applications
-
babichjacob-llama-cpp-sys-2
Low Level Bindings to llama.cpp
-
cudi
A small tool for displaying CUDA device properties
-
cufile-sys
Raw FFI bindings for NVIDIA CuFile library
-
nvrtc
Bindings for NVIDIA® CUDA™ NVRTC in Rust
-
torch-build
link libtorch FFI interface
-
tensorrt-rs-sys
Rust binding to NVIDIA TensorRT
-
del-msh-cudarc
2D/3D Mesh processing using Cuda for scientific prototyping
-
zenu-cudnn-sys
Rust bindings for cuDNN
-
zenu-cuda-driver-sys
Rust bindings for CUDA Driver API
-
zenu-cublas-sys
Rust bindings for cuBLAS
-
zenu-cuda-kernel-sys
CUDA kernel bindings for Rust
-
zenu-cuda-runtime-sys
CUDA runtime bindings for Rust
-
galois-kernels
galois cuda kernels
-
bullet
Supersonic Math
-
grumpkin-msm
Optimized multiscalar multiplicaton for the Grumpkin curve cycle
Try searching with DuckDuckGo.