cublas
Here are 31 public repositories matching this topic...
GPGPU Inverse Distance Weighting using matrix vector multiplication
-
Updated
Dec 5, 2017 - Cuda
Generalized Orthogonal Least-Squares in CUDA
-
Updated
Apr 21, 2018 - Cuda
Level 3 matrix multiplication using both cublas and mkl.
-
Updated
Jul 20, 2018 - Cuda
A MNIST handwritten digit classifier written from scratch in Cuda - C
-
Updated
Nov 12, 2019 - Cuda
Lab exercise of Parallel Processing course in NTUA regarding CUDA programming
-
Updated
Mar 3, 2020 - Cuda
A CUDA approach for computing the multiplication of a transposed matrix with the initial one, using the cuBLAS library.
-
Updated
Sep 28, 2021 - Cuda
Algorithms implemented in CUDA + resources about GPGPU
-
Updated
Jan 18, 2022 - Cuda
code for benchmarking GPU performance based on cublasSgemm and cublasHgemm
-
Updated
May 20, 2022 - Cuda
Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.
-
Updated
Nov 3, 2023 - Cuda
A CUBLAS‐CUDA Based Implementation of Multi-GPU Large Matrix Multiplication
-
Updated
Feb 18, 2024 - Cuda
Matrix Exponential Approximation using CUDA
-
Updated
Mar 20, 2024 - Cuda
Nonnegative matrix factorizations using CUDA
-
Updated
Mar 21, 2024 - Cuda
Improve this page
Add a description, image, and links to the cublas topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the cublas topic, visit your repo's landing page and select "manage topics."