Abstract

Many-core systems are characterized by a large number of components based on ever-shrinking circuit geometries. System reliability becomes an issue because of the system complexity, the large number of components and nanoscale issues due to soft errors. While information redundancy techniques can be used for fault tolerance, they occupy too much memory space and increase the memory and network bandwidth. Moreover, in many-cores, resources are plentiful encouraging the design of simple cores without hardware fault tolerance. Thus in the absence of information redundancy, software fault detection techniques become necessary to detect errors. Herein, we present fault detection techniques for 2×2 matrix multiplication which we extend to nxn matrix multiplication. These tests can detect transient and some intermittent and permanent hardware faults. These tests are also suitable to computing grids and distributed heterogeneous systems where the result-forming node may run tests in software to validate the sub-results submitted by the grid nodes.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.