JIT-compiled GPU kernels for quantum chemistry
-
Updated
Nov 11, 2025 - Cuda
JIT-compiled GPU kernels for quantum chemistry
cuVS - a library for vector search and clustering on the GPU
🚀 Build a fast inference engine for the QWEN3-0.6B model using CUDA, optimizing performance with minimal dependencies for efficient learning and practice.
A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!
FlashInfer: Kernel Library for LLM Serving
FLAME GPU 2 is a GPU accelerated agent based modelling framework for CUDA C++ and Python
GPU accelerated decision optimization
cuGraph - RAPIDS Graph Analytics Library
Graphics Processing Units Molecular Dynamics
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
GPU-accelerated triangle mesh processing
A high-performance computing project that implements the moment representation of the Lattice Boltzmann Method (LBM) on Nvidia GPUs using CUDA.
CUDA work-efficient scan, stream compaction, and radix sort
Real-time N-Body Algorithm Accelerated With CUDA and FFT (for up to 1 billion particles with multi GPU)
High-performance computing project for multicomponent computational fluid dynamics (CFD) simulations using the Lattice Boltzmann Method (LBM) on NVIDIA GPUs with CUDA C++.
A dynamic binary instrumentation tool for tracing and analyzing CUDA kernel instructions.
100 days of writing CUDA kernels!
GPU Framework for Radio Astronomical Image Synthesis
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."