#

sparse-matrix

Here are 12 public repositories matching this topic...

dgSPARSE / dgSPARSE-Lib

PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity

sparse-matrix gnn spconv

Updated May 11, 2026
Cuda

karShetty / Torch-Sparse-Multiply

PyTorch Memory Efficient Sparse Sparse Matrix Multiplication

multiplication pytorch sparse sparse-matrix

Updated Aug 12, 2024
Cuda

shreyansh26 / SparseMatrix-Computation-CUDA

Sparse Matrix Computations in CUDA

sparsity cuda cuda-kernels sparse-matrix cuda-programming

Updated Sep 20, 2024
Cuda

jhson989 / SpMM

Parallel Sparse Matrix Multiplication via CUDA

cuda sparse-matrix spmm

Updated Mar 30, 2022
Cuda

aktemur / CSRLenGoto

spmv sparse-matrix

Updated Sep 15, 2017
Cuda

akshittyagi / sparseMatVecMul

Code for Sparse Matrix and Vector multiplication. Parallelised using CUDA and MPI

mpi cuda matrix-multiplication sparse-matrix

Updated May 8, 2017
Cuda

Darkviper7 / cuSpFFT

CUDA sparse binary 2-D FFT with compact CSC input, Bluestein transforms, and cuFFT/SpFFT baselines.

cuda high-performance-computing fft gpu-computing sparse-matrix bluestein cufft matrix-market stockham-fft spfft

Updated May 24, 2026
Cuda

VSJ001 / Cache-Aware-and-GPU-Accelerated-Sparse-Matrix-Vector-Multiplication

CUDA SpMV kernels (scalar, warp-per-row, ELL) on NVIDIA A100 benchmarked against cuSPARSE on SuiteSparse matrices, plus AVX2 + cache-tiled CPU baselines on Intel Xeon Gold. Vector kernel reaches 98-110% of HBM2 peak, beating cuSPARSE by 24-56% on regular matrices.

c performance-engineering cpp hpc gpu parallel-computing cuda nvidia simd high-performance-computing avx2 cuda-kernels spmv sparse-matrix memory-bandwidth cusparse cache-optimization

Updated May 8, 2026
Cuda

nicolaserlonghi / Sparse-Matrix-Transposition-for-GPUs

University project on Sparse Matrix transposition with CUDA.

cpp gpu cuda nvidia sparse-matrix sparse-matrix-transposition

Updated Jan 21, 2021
Cuda

acornjelly2205 / Instruction_Roofline_Analysis

Reproducible Instruction Roofline analysis of cuSPARSE and Ginkgo SpMM on RTX 4090 using Nsight Compute metrics.

cuda performance-analysis sparse-matrix ginkgo roofline cusparse gpu-performance spmm

Updated May 13, 2026
Cuda

Pupking / fft_final

Sparse binary 2D FFT on CUDA/cuFFT with memory-footprint optimization, streaming tiles, Hermitian symmetry, and Nsight analysis.

cuda fft sparse-matrix suitesparse cufft gpu-performance nsight-compute

Updated May 2, 2026
Cuda

chuankaizhao / AppliedParallelProgramming

Machine problems

cuda reduction matrix-multiplication convolution gpu-computing shared-memory sparse-matrix atomic-operation

Updated Dec 12, 2019
Cuda

Improve this page

Add a description, image, and links to the sparse-matrix topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sparse-matrix topic, visit your repo's landing page and select "manage topics."