Abstract

The unprecedented growth of data at geographically distributed locations coupled with tremendous improvement in networking capabilities over the last decade strongly motivate the need for efficient data management in wide-area network (WAN) environments such as peer-to-peer (P2P) networks and grids. In particular, data availability and performance demands on WAN applications are now greater than ever before. While replication has been traditionally used for maximizing both data availability and performance, this paper contends that replication schemes for traditional distributed environments (e.g., clusters) do not adequately address the requirements of WAN environments. Notably, issues such as node heterogeneity (in terms of processing capacity and available disk space for storing replicas), significant variations in bandwidth, lack of centralized control, lack of global knowledge, distributive ownership and scalability make replication in WAN environments significantly more challenging than in the case of traditional domains. Interestingly, these are fundamental issues which arise for replication in different types of WAN environments. This paper specifically focusses on replication in two representative WAN environments, namely P2P systems and grids, and discusses open research issues concerning replication in these two environments as well as our perspectives on these issues

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call