Applicability of Generic Naming Services and Fault-Tolerant Metacomputing with FT-MPI

Dawid Kurzyniec,Tom Dhaene,Vaidy Sunderam,Jan Broeckhove,Graham Fagg,David Dewolfs

doi:10.1007/11557265_36

Applicability of Generic Naming Services and Fault-Tolerant Metacomputing with FT-MPI

Dawid Kurzyniec, Tom Dhaene + Show 4 more

https://doi.org/10.1007/11557265_36

Copy DOI

Publication Date: Sep 18, 2005

Citations: 11

Affiliation: University of Antwerp, Emory University

#Potential Single Point Of Failure #Name Service + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

There is a growing interest in deploying MPI over multiple, heterogenous and geographically distributed resources for performing very large scale computations. However, increasing the amount of geographical distribution and resources creates problems with interoperability and fault-tolerance. FT-MPI presents an interesting solution for adding fault-tolerance to MPI, but suffers from interoperability limitations and potential single points of failure when crossing multiple administrative domains. We propose to overcome these limitations by adding “pluggability” for one potential single point of failure – the name service used by FT-MPI – and combining FT-MPI with the H2O metacomputing framework.

Full Text