Reliability-Aware Runtime Adaption Through a Statically Generated Task Schedule

Laura Rozo,Aaron Myles Landwehr,Yan Zheng,Chengmo Yang,Guang Gao

doi:10.1109/tvlsi.2017.2753242

Laura Rozo, Aaron Myles Landwehr + Show 3 more

Open Access

https://doi.org/10.1109/tvlsi.2017.2753242

Copy DOI

Abstract

Device scaling, increasing number of components in a single chip, varying environmental issues, and aging effects have brought severe reliability challenges that impose tight constraints on the operation of a system. To cope with these challenges, this paper proposes a reliability-aware scheduling framework that combines static and dynamic analyses to improve the overall system resiliency to different kinds of faults (i.e., intermittent, transient, and permanent). The static analysis technique employs genetic algorithms to optimize the overall system reliability by considering reliability level (RL) as an intermediate scheduling dimension and creating a task-to-RL mapping. This enables the RL-to-core mapping to be efficiently adapted at runtime according to fault rate variations, while the task-to-RL mapping can still be reused. The dynamic analysis tracks faults appearing in each core and measures the time correlation of those faults to update the RL-to-core mapping. The proposed reliability-aware framework is implemented in a state-of-the-art runtime system, Delaware Adaptive Run-Time System, so as to quantitatively show the advantages of using the overall framework in existing multicore platforms. Experimental results show that the proposed technique delivers up to 30% improvement in application execution time and up to 72% improvement in faults occurring at runtime.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Very Large Scale Integration (VLSI) Systems	Publication Date: Jan 1, 2018
Citations: 26	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Reliability-Aware Runtime Adaption Through a Statically Generated Task Schedule

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Lead the way for us

Similar Papers

Malware Classification Using Probability Scoring and Machine Learning
Di Xue ... Weifei Wu
IEEE Access | VOL. 7
Di Xue, et. al.Di Xue ... Weifei Wu
01 Jan 2019
IEEE Access | VOL. 7

On-demand Connection Management for OpenSHMEM and OpenSHMEM+MPI
Sourav Chakraborty ... Hari Subramoni
-
Sourav Chakraborty, et. al.Sourav Chakraborty ... Hari Subramoni
01 May 2015
01 May 2015

Combined Static and Dynamic Analysis
Cyrille Artho ... Armin Biere
Electronic Notes in Theoretical Computer Science | VOL. 131
Cyrille Artho, et. al.Cyrille Artho ... Armin Biere
01 May 2005
Electronic Notes in Theoretical Computer Science | VOL. 131

Combining Dynamic and Static Analysis for Malware Detection
Anusha Damodaran
-
Anusha DamodaranAnusha Damodaran
18 Apr 2019
18 Apr 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reliability-Aware Runtime Adaption Through a Statically Generated Task Schedule

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Very Large Scale Integration (VLSI) Systems