Impact of Tensor Cores and Mixed Precision on the Reliability of Matrix Multiplication in GPUs

Pedro Martins Basso,Fernando Fernandes Dos Santos,Paolo Rech

doi:10.1109/tns.2020.2977583

Abstract

Matrix multiplication (MxM) is a cornerstone application for both high-performance computing and safety-critical applications. Most of the operations in convolutional neural networks for object detection, in fact, are MxM related. Chip designers are proposing novel solutions to improve the efficiency of the execution of MxM. In this article, we investigate the impact of two novel architectures for MxM (i.e., tensor cores and mixed precision) on the graphics processing units (GPUs) reliability. In addition, we evaluate how effective the embedded error-correcting code is in reducing the MxM error rate. Our results show that low-precision operations are more reliable, and the tensor core increases the amount of data correctly produced by the GPU. However, reducing precision and the use of tensor core significantly increase the impact of faults in the output correctness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Impact of Tensor Cores and Mixed Precision on the Reliability of Matrix Multiplication in GPUs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Nuclear Science

Lead the way for us

Journal: IEEE Transactions on Nuclear Science	Publication Date: Mar 6, 2020
Citations: 36

Similar Papers

NVIDIA Tensor Core Programmability, Performance & Precision
Stefano Markidis ... Ivy Bo Peng
-
Stefano Markidis, et. al.Stefano Markidis ... Ivy Bo Peng
01 May 2018
01 May 2018

Demystifying GPU Reliability: Comparing and Combining Beam Experiments, Fault Simulation, and Profiling
Fernando Fernandes Dos Santos ... Siva Kumar Sastry Hari
-
Fernando Fernandes Dos Santos, et. al.Fernando Fernandes Dos Santos ... Siva Kumar Sastry Hari
01 May 2021
01 May 2021

Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUs
Fernando Fernandes Dos Santos ... Olivier Sentieys
-
Fernando Fernandes Dos Santos, et. al.Fernando Fernandes Dos Santos ... Olivier Sentieys
01 Jul 2022
01 Jul 2022

DGEMM Using Tensor Cores, and Its Accurate and Reproducible Versions
Daichi Mukunoki ... Takeshi Ogita
-
Daichi Mukunoki, et. al.Daichi Mukunoki ... Takeshi Ogita
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of Tensor Cores and Mixed Precision on the Reliability of Matrix Multiplication in GPUs

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Nuclear Science