Fail-Awareness: An Approach to Construct Fail-Safe Systems

Christof Fetzer,Flaviu Cristian

doi:10.1023/a:1021730519625

Abstract

We present a framework for building fail-safe hard real-time applications in timed asynchronous distributed systems subject to communication partitions and performance, omission, and crash failures. Most distributed systems built from commercial-off-the-shelf (COTS) processor and communication services are subject to such partitions because their COTS components do not provide hard real-time guarantees. Also custom designed systems can be subject to partitions due to unmaskable link or router failures. The basic assumption behind our approach is that each processor has a local hardware clock that proceeds within a linear envelope of real-time. This allows one to compute an upper bound on the actual delays incurred by a particular processing sequence or message transmission. Services and applications can use these computed bounds to detect when they cannot guarantee all their standard properties because of excessive delays. This allows an application to be fail-aware, that is, to detect when it cannot guarantee all its safety properties and in particular, to detect when to switch to a fail-safe mode.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fail-Awareness: An Approach to Construct Fail-Safe Systems

Abstract

Talk to us

Similar Papers

More From: Real-Time Systems

Lead the way for us

Journal: Real-Time Systems	Publication Date: Jan 1, 2003
Citations: 40

Similar Papers

Guest Editor's Introduction: Special section on dependable distributed systems
Christof Fetzer
Distributed Systems Engineering | VOL. 6
Christof FetzerChristof Fetzer
01 Sep 1999
Distributed Systems Engineering | VOL. 6

DESIGN OF WRAPPER FOR SELF-MANAGEMENT OF COTS COMPONENTS
Michael E Shin ... Fernando Paniagua
International Journal of Software Engineering and Knowledge Engineering | VOL. 19
Michael E Shin, et. al.Michael E Shin ... Fernando Paniagua
01 Jun 2009
International Journal of Software Engineering and Knowledge Engineering | VOL. 19

An Empirical Study on Off-the-Shelf Component Usage in Industrial Projects
Jingyue Li ... Maurizio Morisio
-
Jingyue Li, et. al.Jingyue Li ... Maurizio Morisio
01 Jan 2004
01 Jan 2004

A fail-safe infrastructure designed for COTS component used in safety critical system
Xi Wang ... Lianchuan Ma
-
Xi Wang, et. al.Xi Wang ... Lianchuan Ma
01 Oct 2012
01 Oct 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fail-Awareness: An Approach to Construct Fail-Safe Systems

Abstract

Talk to us

Similar Papers

More From: Real-Time Systems