gpu
Here are 396 public repositories matching this topic...
FlashInfer: Kernel Library for LLM Serving
-
Updated
Nov 10, 2025 - Cuda
cuGraph - RAPIDS Graph Analytics Library
-
Updated
Nov 10, 2025 - Cuda
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
-
Updated
Nov 8, 2025 - Cuda
Graphics Processing Units Molecular Dynamics
-
Updated
Nov 10, 2025 - Cuda
cuVS - a library for vector search and clustering on the GPU
-
Updated
Nov 10, 2025 - Cuda
GPU Accelerated t-SNE for CUDA with Python bindings
-
Updated
Oct 2, 2024 - Cuda
PopSift is an implementation of the SIFT algorithm in CUDA.
-
Updated
Oct 27, 2025 - Cuda
GPU accelerated decision optimization
-
Updated
Nov 10, 2025 - Cuda
CUDA Kernel Benchmarking Library
-
Updated
Oct 21, 2025 - Cuda
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
-
Updated
Sep 8, 2024 - Cuda
SDK for GPU accelerated genome assembly and analysis
-
Updated
May 3, 2024 - Cuda
CUDA Matrix Factorization Library with Alternating Least Square (ALS)
-
Updated
Aug 14, 2018 - Cuda
A simple GPU hash table implemented in CUDA using lock free techniques
-
Updated
Feb 7, 2024 - Cuda
Static suckless single batch CUDA-only qwen3-0.6B mini inference engine
-
Updated
Sep 8, 2025 - Cuda
Improve this page
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."