Analyzing Reliability of Memory Sub-systems with Double-Chipkill Detect/Correct

Xun Jian,Vilas Sridharan,Nathan Debardeleben,Rakesh Kumar,Sean Blanchard

doi:10.1109/prdc.2013.18

Abstract

Chip kill correct is an advanced type of error correction used in memory sub-systems. Existing analytical approaches for modeling the reliability of memory sub-systems with chipkillcorrect are limited to those with chip kill-correct solutions that guarantee correction of errors in a single DRAM device. However, stronger chip kill correct solutions that are capable of guaranteeing the detection and even correction of errors in up to two DRAM devices have become common in existing HPC systems. Analytical reliability models are needed for such memory subsystems. This paper proposes analytical models for the reliability of double-chipkill detect and/or correct. Validation against Monte Carlo simulations shows that the output of our analytical models are within 3.9% of Monte Carlo simulations, on average. We used the analytical models to study various aspects of the reliability of memory sub-systems protected by double-chip kill detect and/or correct. Our studies provide several insights into the dependence of reliability of these systems on scale, device fault rate, memory organization, and memory-scrubbing policy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Analyzing Reliability of Memory Sub-systems with Double-Chipkill Detect/Correct

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

SU‐E‐J‐145: Validation of An Analytical Model for in Vivo Range Verification Using GATE Monte Carlo Simulation in Proton Therapy
C Lee ... K Chuang
Medical Physics | VOL. 42
C Lee, et. al.C Lee ... K Chuang
01 Jun 2015
Medical Physics | VOL. 42

SU-E-I-03: Scatter and Beam Hardening Correction for μCBCT Scanners Using Monte Carlo.
W Volken ... M.A Zulliger
Medical physics | VOL. 39
W Volken, et. al.W Volken ... M.A Zulliger
01 Jun 2012
Medical physics | VOL. 39

Analytical Reliability Models and Their Application for Planning and Optimisation of Telecommunication Networks
Elmira Yu Kalimulina
SSRN Electronic Journal | VOL. -
Elmira Yu KalimulinaElmira Yu Kalimulina
11 Feb 2016
SSRN Electronic Journal | VOL. -

Review and application of Artificial Neural Networks models in reliability analysis of steel structures
A.A Chojaczyk ... C Guedes Soares
Structural Safety | VOL. 52
A.A Chojaczyk, et. al.A.A Chojaczyk ... C Guedes Soares
11 Oct 2014
Structural Safety | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analyzing Reliability of Memory Sub-systems with Double-Chipkill Detect/Correct

Abstract

Talk to us

Similar Papers