Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data

Ravishankar K Iyer,Zbigniew Kalbarczyk

doi:10.1007/3-540-45798-4_13

Abstract

The discussion in this paper focuses on the issues involved in analyzing the availability of networked systems using fault injection and the failure data collected by the logging mechanisms built into the system. In particular we address: (1) analysis in the prototype phase using physical fault injection to an actual system. We use example of fault injection-based evaluation of a software-implemented fault tolerance (SIFT) environment (built around a set of self-checking processes called ARMORS) that provides error detection and recovery services to spaceborne scientific applications and (2) measurement-based analysis of systems in the field. We use example of LAN of Windows NT based computers to present methods for collecting and analyzing failure data to characterize network system dependability. Both, fault injection and failure data analysis enable us to study naturally occurring errors and to provide feedback to system designers on potential availability bottlenecks. For example, the study of failures in a network of Windows NT machines reveals that most of the problems that lead to reboots are software related and that though the average availability evaluates to over 99%, a typical machine, on average, provides acceptable service only about 92% of the time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An experimental evaluation of the REE SIFT environment for spaceborne applications
K. Whisnant ... D. Rennels
-
K. Whisnant, et. al.K. Whisnant ... D. Rennels
23 Jun 2002
23 Jun 2002

Improving processor reliability using software protection techniques.

-

30 Apr 2020
30 Apr 2020

Measurement-Based Analysis of Networked System Availability
Ravishankar K Iyer ... Mahesh Kalyanakrishnan
-
Ravishankar K Iyer, et. al.Ravishankar K Iyer ... Mahesh Kalyanakrishnan
01 Jan 1999
01 Jan 1999

The effects of an armor-based sift environment on the performance and dependability of user applications
K. Whisnant ... Z.T. Kalbarczyk
IEEE Transactions on Software Engineering | VOL. 30
K. Whisnant, et. al.K. Whisnant ... Z.T. Kalbarczyk
01 Apr 2004
IEEE Transactions on Software Engineering | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data

Abstract

Talk to us

Similar Papers