High Resource Utilization Auto-Scaling Algorithms for Heterogeneous Container Configurations

Yi-Lin Cheng,Jan-Jan Wu,Ching-Chi Lin,Pangfeng Liu

doi:10.1109/icpads.2017.00030

Abstract

Auto-scaling is a technique that allocates resources according to dynamic workload. This paper focuses on auto-scaling with heterogeneous container configurations. The goal is to minimize the cost of container adjustments, and to reduce the resource insufficiency penalty, while maintaining high resource utilization. It is extremely difficult to achieve the minimal cost without knowing the future workloads in advance. Thus, we first propose an optimal dynamic programming algorithm that can scale optimally when given the future workload. This optimal solution is used as the baseline to evaluate other algorithms that do not have the future workload information. Then, we propose two greedy algorithms that do not need workload information in advance, and a heuristic algorithm that first predicts the workload of the next time step using Gradient Boosting Regression, then makes scaling decisions using the optimal dynamic programming algorithm. We evaluate these four algorithms with two realistic workload traces. The experiments show that when the cost to start new servers is much higher than resource insufficiency penalty, our short-term prediction approach will only increase the total cost by only 9.6%, and decrease the utilization by only 10%, when compared with the optimal dynamic programming that knows the future workload.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High Resource Utilization Auto-Scaling Algorithms for Heterogeneous Container Configurations

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Distributed Asynchronous Constraint Optimization Algorithm Based on Dynamic Reduction of Constraint Graph
Jian Cao ... Sijie Cai
-
Jian Cao, et. al.Jian Cao ... Sijie Cai
01 Aug 2010
01 Aug 2010

Optimal layouts on a chain ATM network
Ornan (Ori) Gerstel ... Shmuel Zaks
Discrete Applied Mathematics | VOL. 83
Ornan (Ori) Gerstel, et. al.Ornan (Ori) Gerstel ... Shmuel Zaks
01 Mar 1998
Discrete Applied Mathematics | VOL. 83

Order splitting with multiple capacitated suppliers
Xiangtong Qi
European Journal of Operational Research | VOL. 178
Xiangtong QiXiangtong Qi
01 Apr 2007
European Journal of Operational Research | VOL. 178

A Fully Polynomial-Time Approximation Scheme for Timing-Constrained Minimum Cost Layer Assignment
Shiyan Hu ... Zhuo Li
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 56
Shiyan Hu, et. al. Shiyan Hu ... Zhuo Li
01 Jul 2009
IEEE Transactions on Circuits and Systems II: Express Briefs | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High Resource Utilization Auto-Scaling Algorithms for Heterogeneous Container Configurations

Abstract

Talk to us

Similar Papers