A Framework for Crossing Temperature-Induced Timing Errors Underlying Hardware Accelerators to the Algorithm and Application Layers

Guilherme Paim,Brunno Abreu,Hussam Amrouch,Leandro M G Rocha,Jorg Henkel,Sergio Bampi,Eduardo Antonio Cesar Da Costa

doi:10.1109/tc.2021.3050978

Abstract

Temperature rising is an unavoidable effect on VLSI and has always been a critical issue in any system-on-chip – especially when targeting compute-intensive applications. This effect increases the delay in hardware accelerators, resulting in timing errors due to unsustainable clock frequency, whose impact must be carefully evaluated on design time to measure the performance degradation of the hardware accelerator. Further, a hardware operating at a higher temperature accelerates device aging, which incurs in more timing errors. This issue is usually addressed with the inclusion of timing guardbands that compensate for the deleterious effects of temperature, ensuring the hardware accelerator works within a reliable zone, i.e., without any timing errors caused by temperature effects at runtime. However, guardbands directly result in considerable performance and efficiency losses because the circuit will be clocked at a frequency lower than its full potential. Accelerators on edge devices often dismiss such guardbands to explore the full potential of the designed circuits, posing an enormous design challenge as this approach requires a careful evaluation of the impact of timing errors on the quality of the target applications. Many algorithms, such as in multimedia and machine learning applications, are capable of tolerating hardware errors. Yet, these algorithms have a dynamic behavior (i.e., closed-loop) where a timing error can be propagated, affecting subsequent steps. Measuring the degradation-induced errors in these applications is very challenging given that an accurate gate-level simulation to investigate degradation-induced timing errors needs to be coupled dynamically with a system-level simulator to unveil how induced errors in the underlying hardware ultimately impact the algorithm execution in the hardware accelerator. <i>This is the first work to achieve this goal</i>. State-of-the-art works have studied accelerators under timing-errors when removing (or narrowing) guardbands. However, their approach was suitable <i>only for open-loop hardware accelerators which are entirely agnostic of complex interactions of the algorithms</i>. Unlike prior work, this paper investigates temperature- and aging-induced timing-errors in the joint accelerator-algorithm interactions and their runtime impacts. Our framework investigates aging effects across the different layers starting from transistor physics all the way up to the algorithm layer. The hardware accelerator employed as a case study in this work is the sum of absolute differences (SAD), which is the most compute-intensive accelerator on commercial video encoder for mobile applications. Our results demonstrate the runtime behavior impacts of three advanced block-matching algorithms of the video encoder in a joint operation by a SAD accelerator under timing-errors induced by temperature and aging effects considering a 14nm FinFET technology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Framework for Crossing Temperature-Induced Timing Errors Underlying Hardware Accelerators to the Algorithm and Application Layers

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Jan 14, 2021
Citations: 5

Similar Papers

Bridging the Gap Between Voltage Over-Scaling and Joint Hardware Accelerator-Algorithm Closed-Loop
Guilherme Paim ... Hussam Amrouch
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Guilherme Paim, et. al.Guilherme Paim ... Hussam Amrouch
13 Feb 2021
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

針對移動估計與H.264/AVC標準以及智慧型視訊信號處理之演算法和架構設計
...
-
, et. al. ...
01 Jan 2004
01 Jan 2004

Comparative analysis of parallel SAD calculation hardware architectures for H.264/AVC video coding
Claudio Diniz ... Altamiro Susin
-
Claudio Diniz, et. al.Claudio Diniz ... Altamiro Susin
01 Feb 2010
01 Feb 2010

A bit-serial sum of absolute difference accelerator for variable block size motion estimation of H.264
Mohammad Reza H Fatemi ... Hasan F Ates
-
Mohammad Reza H Fatemi, et. al.Mohammad Reza H Fatemi ... Hasan F Ates
01 Jul 2009
01 Jul 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Framework for Crossing Temperature-Induced Timing Errors Underlying Hardware Accelerators to the Algorithm and Application Layers

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers