Abstract

One of the very significant potential advantages of multiprocessor computing systems is the possibility of graceful degradation in the presence of faults. Prerequisites for fault tolerance in multiprocessor systems are an ability to detect and locate node and link failures, detection and prevention of system deadlocks and, algorithms for distributed checkpointing, rollover and rescheduling of the various program modules. This paper deals with these issues in the context of an alternative multiprocessing system, namely, the dynamic dataflow system. The machine described in this paper employs a localised switching protocol, a special arbiter called the batching arbiter and a new bus structure. Several niovel actors have also been implemented to deal with the various issues of fault tolerance in the system. It is shown in the paper that our machine offers a considerable degree of fault tolerance when compared with most other dataflow systems proposed and/or built.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.