nsight
Here are 4 public repositories matching this topic...
🎬 Explore GPU training efficiency with FP32 vs FP16 in this modular lab, utilizing Tensor Core acceleration for deep learning insights.
-
Updated
Sep 6, 2025 - Python
A reproducible GPU benchmarking lab that compares FP16 vs FP32 training on MNIST using PyTorch, CuPy, and Nsight profiling tools. This project blends performance engineering with cinematic storytelling—featuring NVTX-tagged training loops, fused CuPy kernels, and a profiler-driven README that narrates the GPU’s inner workings frame by frame.
-
Updated
Sep 5, 2025 - Python
Improve this page
Add a description, image, and links to the nsight topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nsight topic, visit your repo's landing page and select "manage topics."