Control strategies for adaptive resource allocation in cloud computing

Tiago Salviano Calmon,Amit Bhaya,Oumar Diene,Jonathan Ferreira Passoni,Vinicius Michel Gottin,Eduardo Vera Sousa

doi:10.1016/j.ifacol.2020.12.1964

Tiago Salviano Calmon, Amit Bhaya + Show 4 more

Open Access

https://doi.org/10.1016/j.ifacol.2020.12.1964

Copy DOI

Abstract

Using a compute infrastructure efficiently to execute jobs while respecting Service Level Agreements (SLAs) and thereby guaranteeing Quality of Service (QoS) poses a number of challenges. One such challenge lies in the fact that SLAs are set prior to the execution of a job, but the execution environment is subject to a number of possible disturbances, such as poor knowledge about actual resource necessity, demand peaks and hardware malfunctions, amongst others. Thus by using a fixed resource allocation, the manager of a shared computing environment risks violating user SLAs. Furthermore, the complexity of managing several workload executions increases with the number of workloads, implying the need for an automatic method to manage and control the execution of workloads. The execution time SLA is specially important in streaming scenarios such as web applications and continuous video processing, and is the focus of this paper. A method based on adaptive model predictive control (aMPC) is proposed here to adapt the amount of allocated resources to iterative workloads. The methodology is tested applied to Deep Learning Workloads, in standalone and multi-workload versions. The results show that using adaptive optimal control with a linearized model improves performance with respect to simpler control laws as well as reinforcement learning approaches.

Full Text