Highly-reliable integer matrix multiplication via numerical packing

Ijeoma Anarado,Yiannis Andreopoulos,Davide Anastasia,Mohammad Ashraful Anam,Fabio Verdicchio

doi:10.1109/iolts.2013.6604045

Abstract

The generic matrix multiply (GEMM) routine comprises the compute and memory-intensive part of many information retrieval, relevance ranking and object recognition systems. Because of the prevalence of GEMM in these applications, ensuring its robustness to transient hardware faults is of paramount importance for highly-efficientlhighly-reliable systems. This is currently accomplished via error control coding (ECC) or via dual modular redundancy (DMR) approaches that produce a separate set of “parity” results to allow for fault detection in GEMM. We introduce a third family of methods for fault detection in integer matrix products based on the concept of numerical packing. The key difference of the new approach against ECC and DMR approaches is the production of redundant results within the numerical representation of the inputs rather than as a separate set of parity results. In this way, high reliability is ensured within integer matrix products while allowing for: (i) in-place storage; (ii) usage of any off-the-shelf 64-bit floating-point GEMM routine; (iii) computational overhead that is independent of the GEMM inner dimension. The only detriment against a conventional (i.e. fault-intolerant) integer matrix multiplication based on 32-bit floating-point GEMM is the sacrifice of approximately 30.6% of the bitwidth of the numerical representation. However, unlike ECC methods that can reliably detect only up to a few faults per GEMM computation (typically two), the proposed method attains more than “12 nines” reliability, i.e. it will only fail to detect 1 fault out of more than 1 trillion arbitrary faults in the GEMM operations. As such, it achieves reliability that approaches that of DMR, at a very small fraction of its cost. Specifically, a single-threaded software realization of our proposal on an Intel i7-3632QM 2.2GHz processor (Ivy Bridge architecture with AVX support) incurs, on average, only 19% increase of execution time against an optimized, fault-intolerant, 32-bit GEMM routine over a range of matrix sizes and it remains more than 80% more efficient than a DMR-based GEMM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Highly-reliable integer matrix multiplication via numerical packing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Mixed-mode multicore reliability
Philip M Wells ... Gurindar S Sohi
-
Philip M Wells, et. al.Philip M Wells ... Gurindar S Sohi
07 Mar 2009
07 Mar 2009

Mixed-mode multicore reliability
Philip M Wells ... Gurindar S Sohi
ACM SIGPLAN Notices | VOL. 44
Philip M Wells, et. al.Philip M Wells ... Gurindar S Sohi
28 Feb 2009
ACM SIGPLAN Notices | VOL. 44

Mixed-mode multicore reliability
Philip M Wells ... Koushik Chakraborty
ACM SIGARCH Computer Architecture News | VOL. 37
Philip M Wells, et. al.Philip M Wells ... Koushik Chakraborty
01 Mar 2009
ACM SIGARCH Computer Architecture News | VOL. 37

Timing diversity as a protective mechanism
Mischa Möstl ... Anika Christmann
-
Mischa Möstl, et. al.Mischa Möstl ... Anika Christmann
30 Sep 2021
30 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Highly-reliable integer matrix multiplication via numerical packing

Abstract

Talk to us

Similar Papers