Fault-Tolerance Issues of Local Area Multiprocessor (LAMP) Storage Subsystem

Qiang Li,Alex Tsukerman,Edward Hong

doi:10.1007/978-1-4615-5449-3_8

Abstract

This paper discusses the fault tolerance issues of the Local Area Multiprocessor (LAMP) storage subsystem, and presents its architecture design, error detection and recovery algorithms, and logical volume reconstruction procedure. LAMP is a network of workstations with shared physical memory. Its basic communication protocol is load and store. The LAMP storage subsystem is developed for this class of distributed computing system: 1) It is with distributed shared memory; 2) It uses low-latency and high-bandwidth interconnection; 3) It provides remote DMA support. The LAMP storage subsystem stripes data across multiple nodes for higher I/O performance and availability. It organizes logical volumes (virtual disks) to store files according to the file size, data access pattern, as well as other criteria performance, availability, and security requirements. The LAMP storage subsystem implements RAID technology: RAID-0, 1, and 5 for each logical volume. The write-ahead logging is used to log data, metadata and parity updates of a recovery unit, which allows LAMP storage subsystem to perform fast error recovery. For rapid reconstruction of a failed logical volume, the LAMP logical volume reconstruction algorithm is implemented. In this paper, three main fault tolerance issues of the LAMP storage subsystem are discussed: system configurability for fault tolerance and performance, fast error detection and recovery, and fast logical volume reconstruction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fault-Tolerance Issues of Local Area Multiprocessor (LAMP) Storage Subsystem

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

5. Fast Processor Recover Techniques with Micro Rollback
Matthias Pflanz
-
Matthias PflanzMatthias Pflanz
01 Jan 2002
01 Jan 2002

A Scalable Multi-Data Sources Based Recursive Approximation Approach for Fast Error Recovery in Big Sensing Data on Cloud
Chi Yang ... Xianghua Xu
IEEE Transactions on Knowledge and Data Engineering | VOL. 32
Chi Yang, et. al.Chi Yang ... Xianghua Xu
27 Apr 2020
IEEE Transactions on Knowledge and Data Engineering | VOL. 32

Fast error detection through efficient use of hardwired resources in FPGAs
Gabriel L Nazar ... Luigi Carro
-
Gabriel L Nazar, et. al.Gabriel L Nazar ... Luigi Carro
01 May 2012
01 May 2012

A Scalable Recovery Tree Construction Scheme Considering Spatial Locality of Packet Loss
Jinsuk Baek
KSII Transactions on Internet and Information Systems | VOL. 2
Jinsuk BaekJinsuk Baek
25 Apr 2008
KSII Transactions on Internet and Information Systems | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fault-Tolerance Issues of Local Area Multiprocessor (LAMP) Storage Subsystem

Abstract

Talk to us

Similar Papers