Transient Fault Detection in Tensor Cores for Modern GPUs
Abstract
References
Index Terms
- Transient Fault Detection in Tensor Cores for Modern GPUs
Recommendations
Acceleration of Tensor-Product Operations with Tensor Cores
In this article, we explore the acceleration of tensor product operations in finite element methods, leveraging the computational power of the NVIDIA A100 GPU Tensor Cores. We provide an accessible overview of the necessary mathematical background and ...
MixPert: Optimizing Mixed-Precision Floating-Point Emulation on GPU Integer Tensor Cores
LCTES 2024: Proceedings of the 25th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded SystemsFeaturing mixed-precision tensor operations, accelerators significantly enhance performance for many error-tolerant computing tasks, but their applicability is limited in scenarios demanding high precision. While emulating higher-precision data types ...
PTTS: Power-aware tensor cores using two-sided sparsity
AbstractDeep Neural networks (DNNs) have become the compelling solution for a broad range of applications such as automatic translation, advertisement recommendation, and speech recognition. Matrix multiplication is the fundamental operation ...
Highlights- GPGPUs based on Tensor Core architecture are power hungry devices.
- PTTS+ ...
Comments
Information & Contributors
Information
Published In
Publisher
Association for Computing Machinery
New York, NY, United States
Journal Family
Publication History
Check for updates
Author Tags
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 183Total Downloads
- Downloads (Last 12 months)183
- Downloads (Last 6 weeks)18
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in