Skip to content
#

cublas

Here are 94 public repositories matching this topic...

🔍 Analyze CUDA matrix multiplication performance and power consumption on NVIDIA Jetson Orin Nano across multiple implementations and settings.

  • Updated Dec 18, 2025
  • Python

High-performance GPU-accelerated linear algebra library for scientific computing. Custom kernels outperform cuBLAS+cuSPARSE by 2.4x in iterative solvers. Built for circuit simulation workloads.

  • Updated Dec 6, 2025
  • Cuda
jetson-orin-matmul-analysis

Scientific CUDA benchmarking framework: 4 implementations x 3 power modes x 5 matrix sizes on Jetson Orin Nano. 1,282 GFLOPS peak, 90% performance @ 88% power (25W mode), 99.5% accuracy validation, edge AI deployment guide.

  • Updated Oct 14, 2025
  • Python

Improve this page

Add a description, image, and links to the cublas topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cublas topic, visit your repo's landing page and select "manage topics."

Learn more