Rhythm

Laiping Zhao,Keqiu Li,Kaixuan Zhang,Tie Qiu,Yanan Yang,Xiaobo Zhou,Yungang Bao

doi:10.1145/3342195.3387534

Abstract

Cloud service providers improve resource utilization by co-locating latency-critical (LC) workloads with best-effort batch (BE) jobs in datacenters. However, they usually treat an LC workload as a whole when allocating resources to BE jobs and neglect the different features of components of an LC workload. This kind of coarse-grained co-location method leaves a significant room for improvement in resource utilization. Based on the observation of the inconsistent interference tolerance abilities of different LC components, we propose a new abstraction called Servpod, which is a collection of a LC parts that are deployed on the same physical machine together, and show its merits on building a fine-grained co-location framework. The key idea is to differentiate the BE throughput launched with each LC Servpod, i.e., Servpod with high interference tolerance ability can be deployed along with more BE jobs. Based on Servpods, we present Rhythm, a co-location controller that maximizes the resource utilization while guaranteeing LC service's tail latency requirement. It quantifies the interference tolerance ability of each servpod through the analysis of tail-latency contribution. We evaluate Rhythm using LC services in forms of containerized processes and microservices, and find that it can improve the system throughput by 31.7%, CPU utilization by 26.2%, and memory bandwidth utilization by 34% while guaranteeing the SLA (service level agreement).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rhythm

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Component-distinguishable Co-location and Resource Reclamation for High-throughput Computing
Laiping Zhao ... Keqiu Li
ACM Transactions on Computer Systems | VOL. 42
Laiping Zhao, et. al.Laiping Zhao ... Keqiu Li
13 Feb 2024
ACM Transactions on Computer Systems | VOL. 42

Energy-Aware VM Consolidation in Cloud Data Centers Using Utilization Prediction Model
Fahimeh Farahnakian ... Hannu Tenhunen
IEEE Transactions on Cloud Computing | VOL. 7
Fahimeh Farahnakian, et. al.Fahimeh Farahnakian ... Hannu Tenhunen
01 Apr 2019
IEEE Transactions on Cloud Computing | VOL. 7

Characteristics of Co-Allocated Online Services and Batch Jobs in Internet Data Centers: A Case Study From Alibaba Cloud
Congfeng Jiang ... Jian Wan
IEEE Access | VOL. 7
Congfeng Jiang, et. al.Congfeng Jiang ... Jian Wan
01 Jan 2019
IEEE Access | VOL. 7

Online VM Consolidation in Cloud Environments
Deafallah Alsadie ... Eidah J Alzahrani
-
Deafallah Alsadie, et. al.Deafallah Alsadie ... Eidah J Alzahrani
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rhythm

Abstract

Talk to us

Similar Papers