An improved logging and checkpointing scheme for recoverable distributed shared memory

Taesoon Park,Heon Y Yeom,Sungbok Cho

doi:10.1007/bfb0027781

Abstract

The distributed shared memory(DSM) system transforms an existing network of workstations to a powerful shared-memory parallel computer which could deliver superior price/performance. However, with more workstations engaged in the system and longer execution time, the probability of faults increases which could render the system useless. Several checkpointing and logging schemes have been proposed to enable the DSM system to continue work after transient failures. Using checkpoints, it is not necessary to roll back to the beginning of the process but the processes need to roll back to the latest checkpoint. The logging is introduced to further reduce the amount of rollback propagation on other related processes. Although logging makes the rollback propogation unnecessary, it introduces the overhead for the logging itself. If it is needed to log all the read/write operations, the logging overhead would be prohibitive. Moreover, some of the logging methods proposed earlier could result in incorrect recovery when processes synchronize using barriers. In this paper, we propose a novel logging scheme which greatly reduces the amount of logging by not loging all the pages accessed but logging only the pages which are invalidated. The performance our proposed scheme is analyzed using extensive simulation. Compared with two other schemes proposed earlier, our new logging scheme shows superior performance in various cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An improved logging and checkpointing scheme for recoverable distributed shared memory

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Performance evaluation of three logging schemes for a shared-nothing database server
Kam-Fai Wong
Simulation Practice and Theory | VOL. 6
Kam-Fai WongKam-Fai Wong
01 May 1998
Simulation Practice and Theory | VOL. 6

Efficient, Compromise Resilient and Append-Only Cryptographic Schemes for Secure Audit Logging
Attila A Yavuz ... Peng Ning
-
Attila A Yavuz, et. al.Attila A Yavuz ... Peng Ning
01 Jan 2012
01 Jan 2012

BAF: An Efficient Publicly Verifiable Secure Audit Logging Scheme for Distributed Systems
Attila Altay Yavuz ... Peng Ning
-
Attila Altay Yavuz, et. al.Attila Altay Yavuz ... Peng Ning
01 Dec 2009
01 Dec 2009

Lightweight logging and recovery for distributed shared memory over virtual interface architecture
Soyeon Park ... Youngjae Kim
-
Soyeon Park, et. al. Soyeon Park ... Youngjae Kim
13 Oct 2003
13 Oct 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An improved logging and checkpointing scheme for recoverable distributed shared memory

Abstract

Talk to us

Similar Papers