Auto-scaling Solutions Research Articles

Kubernetes is an open-source container orchestration system that provides a built-in module for dynamic resource provisioning named the Horizontal Pod Autoscaler (HPA). The HPA identifies the number of resources to be provisioned by calculating the ratio between the current and target utilisation metrics. The target utilisation metric, or threshold, directly impacts how many and how quickly resources will be provisioned. However, the determination of the threshold that would allow satisfying performance-based Service Level Objectives (SLOs) is a long, error-prone, manual process because it is based on the static threshold principle and requires manual configuration. This might result in underprovisioning or overprovisioning, leading to the inadequate allocation of computing resources or SLO violations. Numerous autoscaling solutions have been introduced as alternatives to the HPA to simplify the process. However, the HPA is still the most widely used solution due to its ease of setup, operation, and seamless integration with other Kubernetes functionalities. The present study proposes a method that utilises exploratory data analysis techniques along with moving average smoothing to identify the target utilisation threshold for the HPA. The objective is to ensure that the system functions without exceeding the maximum number of events that result in a violation of the response time defined in the SLO. A prototype was created to adjust the threshold values dynamically, utilising the proposed method. This prototype enables the evaluation and comparison of the proposed method with the HPA, which has the highest threshold set that meets the performance-based SLOs. The results of the experiments proved that the suggested method adjusts the thresholds to the desired service level with a 1–2% accuracy rate and only 4–10% resource overprovisioning, depending on the type of workload.

The elasticity in cloud is essential to the effective management of computational resources as it enables readjustment at runtime to meet application demands. Over the years, researchers and practitioners have proposed many auto-scaling solutions using versatile techniques ranging from simple if-then-else based rules to sophisticated optimisation, control theory and machine learning based methods. However, despite an extensive range of existing elasticity research, the aim of implementing an efficient scaling technique that satisfies the actual demands is still a challenge to achieve. The existing methods suffer from issues like: (1) the lack of adaptability and static scaling behaviour whilst considering completely fixed approaches; (2) the burden of additional computational overhead, the inability to cope with the sudden changes in the workload behaviour and the preference of adaptability over reliability at runtime whilst considering the fully dynamic approaches; and (3) the lack of considering uncertainty aspects while designing auto-scaling solutions. In this paper, we aim to address these issues using a holistic biologically-inspired feedback switch controller. This method utilises multiple controllers and a switching mechanism, implemented using fuzzy system, that realises the selection of suitable controller at runtime. The fuzzy system also facilitates the design of qualitative elasticity rules. Furthermore, to improve the possibility of avoiding the oscillatory behaviour (a problem commonly associated with switch methodologies), this paper integrates a biologically-inspired computational model of action selection. Lastly, we identify seven different kinds of real workload patterns and utilise them to evaluate the performance of the proposed method against the state-of-the-art approaches. The obtained computational results demonstrate that the proposed method results in achieving better performance without incurring any additional cost in comparison to the state-of-the-art approaches.

Auto-scaling Solutions Research Articles

Related Topics

Articles published on Auto-scaling Solutions

Auto-Scaling Techniques in Cloud Computing: Issues and Research Directions.

SLA-Adaptive Threshold Adjustment for a Kubernetes Horizontal Pod Autoscaler

Automatic data featurization for enhanced proactive service auto-scaling: Boosting forecasting accuracy and mitigating oscillation

Optimized resource usage with hybrid auto-scaling system for knative serverless edge computing

Hybrid Autoscaling Strategy on Container-Based Cloud Platform

Web Application Resource Requirements Estimation Based on the Workload Latent Features

Design and evaluation of a biologically-inspired cloud elasticity framework

Adaptive Microservice Scaling for Elastic Applications

Multilayered Autoscaling Performance Evaluation: Can Virtual Machines and Containers Co–Scale?

The Effect of the Resource Consumption Characteristics of Cloud Applications on the Efficiency of Low-Metric Auto Scaling Solutions

DEPAS: a decentralized probabilistic algorithm for auto-scaling

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Auto-scaling Solutions Research Articles

Related Topics

Articles published on Auto-scaling Solutions

Auto-Scaling Techniques in Cloud Computing: Issues and Research Directions.

SLA-Adaptive Threshold Adjustment for a Kubernetes Horizontal Pod Autoscaler

Automatic data featurization for enhanced proactive service auto-scaling: Boosting forecasting accuracy and mitigating oscillation

Optimized resource usage with hybrid auto-scaling system for knative serverless edge computing

Hybrid Autoscaling Strategy on Container-Based Cloud Platform

Web Application Resource Requirements Estimation Based on the Workload Latent Features

Design and evaluation of a biologically-inspired cloud elasticity framework

Adaptive Microservice Scaling for Elastic Applications

Multilayered Autoscaling Performance Evaluation: Can Virtual Machines and Containers Co–Scale?

The Effect of the Resource Consumption Characteristics of Cloud Applications on the Efficiency of Low-Metric Auto Scaling Solutions

DEPAS: a decentralized probabilistic algorithm for auto-scaling