Abstract

Data replication is widely used to provide high data availability, and increase the performance of the distributed systems. Many replica control protocols have been proposed in distributed and grid environments that achieved both high performance and availability. However, the previously proposed protocols still require a bigger number of replicas for read and write operations which are not suitable for a large scale system such as data grid. In this paper, a new replica control protocol called Clusteringbased Hybrid (CBH) has been proposed for managing the data in grid environments. We analyzed the communication cost and data availability for the operations and compared CBH protocol with recently proposed replica control protocols called Dynamic Hybrid (DH) protocol and Diagonal Replication in 2D Mesh (DR2M) protocol. To evaluate CBH protocol, a simulation model was implemented using Java. Our results show that for the read operations, CBH protocol improves the performance of communication cost and data availability compared to the DH and DR2M protocols.

Highlights

  • Grid computing is a distributed network computing system that enables large scale resource sharing between machines distributed across many organizationsReceived: 19 March 2016 Acepted: 7 December 2016and over a wide area network (Foster et al, 2001; Krauter et al, 2002)

  • We propose a new replica control protocol called Clustering-based Hybrid (CBH) protocol for the grid environment

  • Diagonal Replication on 2D Mesh Protocol In Diagonal Replication on 2D Mesh structure (DR2M) nodes are organized in a two-dimensional 2D mesh structure (Latip et al, 2008; Latip et al, 2009)

Read more

Summary

INTRODUCTION

Over a wide area network (Foster et al, 2001; Krauter et al, 2002). In grid computing, data grid provides a scalable infrastructure to manage huge amounts of data and support data intensive applications (Chervenak et al, 2000; Abdullah et al, 2004, Yusof et al, 2012). Managing the large network and widely distributed data in the data grid is a challenging problem. One of the issues is data availability (Lamehamedi et al, 2003; Latip et al, 2014), because data is geographically distributed over large networks Another issue is communication cost, where cost can become expensive if the number of read and write operations is high (Choi & Youn, 2012; Latip et al, 2009). The basic property for any replica control protocol is to guarantee non-empty intersection between read and write quorums in order to maintain the consistency of the replicated data. Many replica control protocols have been proposed in distributed and grid environments which achieved both high performance and availability. CBH provides low communication cost as well as high availability

RELATED WORKS
46 Replication on 2D Mesh Protocol
Findings
Data Availability Analysis
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.