Abstract

Cloud computing has become an important research area in large-scale computing systems and is being employed by many organizations in government, businesses, and industry. Schemes and appropriate models for dynamic resources provisioning in the cloud environment have been extensively studied. To date, the research literature is lacking schemes and models that offer dynamic scalability in which Quality of Service (QoS) and high performance are provided to customers with the usage of the least number of cloud resources, especially for containerized services hosted on the cloud. With dynamic scalability, cloud computing can offer on-demand, timely, and dynamically adjustable computing resources to services hosted on the cloud. This paper presents a dynamic scaling model based on queueing theory to scale containers virtual resources and satisfy the customer Service Level Agreements (SLA) while guarding costs of scaling very low. The aim is to improve the virtual computing resources utilization and satisfy SLA constraints in terms of CPU utilization, system response time, system drop rate, system number of tasks, and system throughput. Simulation results are provided using Java Modelling Tools simulation tool, which shows that our proposed model can determine under any offered workload the needed containers instances to satisfy the required QoS parameters.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call