Fault Tolerant File Models for MPI-IO Parallel File Systems

A Calderón,Rainer Keller,Alexander Schulz,F García-Carballeira,Florin Isailǎ

doi:10.1007/978-3-540-75416-9_25

Abstract

Parallelism in file systems is obtained by using several independent server nodes supporting one or more secondary storage devices. This approach increases the performance and scalability of the system, but a fault in one single node can make the whole system fail. In order to avoid this problem, data must be stored using some kind of redundant technique, so that it can be recovered in case of failure. Fault tolerance can be provided in I/O systems by using replication or RAID based schemes. However, most of the current systems apply the same technique of fault tolerant at disk or file system level.This paper describes how fault tolerance support can be used by MPI applications based on PVFS version 2 [1], a well-know parallel file system for clusters. This support can be applied to other parallel file systems with many benefits: fault tolerance at file level, flexible definition of new fault tolerance scheme, and dynamic reconfiguration of the fault tolerance policy.KeywordsParallel File Systemclustersfault-tolerancedata declusteringreliability

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fault Tolerant File Models for MPI-IO Parallel File Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Fault tolerant file models for parallel file systems: introducing distribution patterns for every file
A Calderón ... J Fernandez
The Journal of Supercomputing | VOL. 47
A Calderón, et. al.A Calderón ... J Fernandez
22 Apr 2008
The Journal of Supercomputing | VOL. 47

A Fault Tolerant MPI-IO Implementation using the Expand Parallel File System
A Calderon ... J.M Perez
-
A Calderon, et. al.A Calderon ... J.M Perez
09 Feb 2005
09 Feb 2005

A parallel and fault tolerant file system based on NFS servers
F Garcia ... J Fernandez
-
F Garcia, et. al.F Garcia ... J Fernandez
01 Jan 2003
01 Jan 2003

A Technique for Lock-Less Mirroring in Parallel File Systems
Bradley W Settlemyer ... Walter B Ligon Iii
-
Bradley W Settlemyer, et. al.Bradley W Settlemyer ... Walter B Ligon Iii
01 May 2008
01 May 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fault Tolerant File Models for MPI-IO Parallel File Systems

Abstract

Talk to us

Similar Papers