Dataset replica placement strategy under a response time constraint in the cloud

Xiuguo Wu,Wei Su

doi:10.1504/ijitm.2019.10019106

Abstract

In cloud computing environment, especially data-intensive systems, large amounts of datasets are stored in distributed data centres, and are often retrieved by users in different regions. To reduce the users' response time, replicating the popular datasets to multiple suitable data centres is an advisable choice, as tasks can access the datasets from a nearby site. Nevertheless, the dataset replicas' suitable storage placement selection is still an important issue that should be solved urgently from the response time constraint view, for the reason that too many replicas are infeasible in practice. In this paper, we first propose a comprehensive dataset response time estimation model, then present a replica placement model based on Steiner tree. After that, an approximate replica placement algorithm under a response time constraint in the cloud is given using Kruskal minimum spanning tree. At last, a practical and reasonable performance evaluation is designed and implemented. Both the theoretical analysis and simulations conducted on general (random) datasets show the efficiency and effectiveness of the proposed strategy in the cloud.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dataset replica placement strategy under a response time constraint in the cloud

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technology and Management

Lead the way for us

Similar Papers

Geographically distributed data management to support large-scale data analysis
Tamer Z Emara ... Joshua Zhexue Huang
Scientific Reports | VOL. 13
Tamer Z Emara, et. al.Tamer Z Emara ... Joshua Zhexue Huang
18 Oct 2023
Scientific Reports | VOL. 13

Workload Migration across Distributed Data Centers under Electrical Load Shedding
Linfeng Shen ... Jiangchuan Liu
-
Linfeng Shen, et. al.Linfeng Shen ... Jiangchuan Liu
25 Jun 2021
25 Jun 2021

Distributed Data Strategies to Support Large-Scale Data Analysis Across Geo-Distributed Data Centers
Tamer Z Emara ... Joshua Zhexue Huang
IEEE Access | VOL. 8
Tamer Z Emara, et. al.Tamer Z Emara ... Joshua Zhexue Huang
01 Jan 2020
IEEE Access | VOL. 8

Towards building a multi‐datacenter infrastructure for massive remote sensing image processing
Wanfeng Zhang ... Dingsheng Liu
Concurrency and Computation: Practice and Experience | VOL. 25
Wanfeng Zhang, et. al.Wanfeng Zhang ... Dingsheng Liu
02 Jan 2013
Concurrency and Computation: Practice and Experience | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dataset replica placement strategy under a response time constraint in the cloud

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Technology and Management