Latency-Constrained Cost-Minimized Request Allocation for Geo-Distributed Cloud Services

Xinping Xu,Wenxin Li,Junxiao Wang,Keqiu Li,Heng Qi

doi:10.1109/ojcoms.2020.2964303

Abstract

Latency to end-users and regulatory requirements push cloud providers to operate many datacenters all around the globe to host their cloud services. An emerging problem under such geo-distributed architecture is to assign each user request to an appropriate datacenter to benefit both cloud providers (e.g., low bandwidth cost) and end-users (e.g., low latency)—known as request allocation. However, prior request allocation solutions have significant limitations: they either focus only on optimizing the benefits for one entity (e.g., providers or users), or ignore some practical yet indispensable factors (e.g., heterogeneous latency requirements of different users and diverse per unit bandwidth cost among different datacenters) when optimizing benefits for both entities. In this paper, we study the problem of minimizing the total bandwidth cost for cloud service providers while guaranteeing the latency requirement for end-users. Specifically, we formulate an integer programming with consideration of the diversities in both the delay of requests and per unit bandwidth cost of datacenters. To efficiently and practically solve this problem, we first relax the integer programming into a continuous convex optimization and then take the advantages of random sampling to enforce the solution to be a feasible one for the original integer programming. We have conducted rigorous theoretical analysis to prove that our algorithm can provide a considerable good competitive ratio. Extensive simulations demonstrate that our proposed algorithm can reduce the total bandwidth cost by 30% while guaranteeing the latency requirements of all requests, as compared to conventional methods.

Highlights

N OWADAYS, most cloud service providers (e.g., Google, Microsoft, Amazon) have deployed a geographically distributed infrastructure [1], [2], where datacenters are placed at different regions across the world to enhance application robustness and reduce user access delay at the same time
We study the problem of distributing user requests among geo-distributed datacenters to minimize the total bandwidth cost subject to the heterogeneous latency constraints from end-users
These results directly show that our proposed algorithm can significantly reduce the total bandwidth cost, when user requests are allocated across geographically distributed data centers

Summary

INTRODUCTION

N OWADAYS, most cloud service providers (e.g., Google, Microsoft, Amazon) have deployed a geographically distributed infrastructure [1], [2], where datacenters are placed at different regions across the world to enhance application robustness and reduce user access delay at the same time. In this problem, we take into account the diverse latency requirement of user requests, the heterogeneity on per unit bandwidth cost of different datacenters, as well as the bandwidth capacity of each datacenter’s upstream link. We develop the mathematical model and formulate an integer programming which considers the diverse per unit bandwidth cost across different datacenters To solve this problem efficiently, we further develop an algorithm that seamlessly combines the techniques of convex optimization and random sampling, which has a good competitive ratio in minimizing total bandwidth cost. We conduct extensive simulations to evaluate the performance of our proposed algorithm, in terms of reducing total bandwidth cost for cloud providers and guaranteeing latency requirements of end-users.

MODELING AND PROBLEM FORMULATION

ANALYSIS

SIMULATION RESULTS Total bandwidth cost

RELATED WORK

Findings

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Open Journal of the Communications Society	Publication Date: Jan 1, 2020
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Latency-Constrained Cost-Minimized Request Allocation for Geo-Distributed Cloud Services

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Open Journal of the Communications Society

Lead the way for us

Similar Papers

Cloud provider selection models for cloud storage services to meet availability requirements
Eiji Oki ... Takashi Kurimoto
-
Eiji Oki, et. al.Eiji Oki ... Takashi Kurimoto
01 Jan 2017
01 Jan 2017

An Empirical Analysis on Quality of Service (QoS) in Cloud Computing
J Amudhavel ... K Prem Kumar
Indian Journal of Science and Technology | VOL. 9
J Amudhavel, et. al.J Amudhavel ... K Prem Kumar
30 Jun 2016
Indian Journal of Science and Technology | VOL. 9

Online Cost-effective Edge Service Renting for Content Providers in Cloud and Edge Environments
Zizhe Jin ... Shijun Liu
-
Zizhe Jin, et. al.Zizhe Jin ... Shijun Liu
01 Sep 2021
01 Sep 2021

Cloud services as the ultimate gate(keeper)
Björn Lundqvist
Journal of Antitrust Enforcement | VOL. 7
Björn LundqvistBjörn Lundqvist
28 Jun 2019
Journal of Antitrust Enforcement | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Latency-Constrained Cost-Minimized Request Allocation for Geo-Distributed Cloud Services

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Open Journal of the Communications Society