Auto-scaling Approach Research Articles

In order to deal with the fast changing requirements of container based services in clouds, auto-scaling is used as an essential mechanism for adapting the number of provisioned resources with the variable service workloads. However, the latest auto-scaling approaches lack the comprehensive consideration of variable workloads and hybrid auto-scaling for multi-type services. Firstly, the historical data based proactive approaches are widely used to handle complex and variable workloads in advance. The decision-making accuracy of proactive approaches depends on the prediction algorithm, which is affected by the anomalies, missing values and errors in the historical workload data, and the unexpected workload cannot be handled. Secondly, the trigger based reactive approaches are seriously affected by workload fluctuation which causes the frequent invalid scaling of service resources. Besides, due to the existence of scaling time, there are different completion delays of different scaling actions. Thirdly, the latest approaches also ignore the different scaling time of hybrid scaling for multi-type services including stateful services and stateless services. Especially, when the stateful services are scaled horizontally, the neglected long scaling time causes the untimely supply and withdrawal of resources. Consequently, all three issues above can lead to the degradation of Quality of Services (QoS) and the inefficient utilization of resources. This paper proposes a new hybrid auto-scaling approach for multi-type services to resolve the impact of service scaling time on decision making. We combine the proactive scaling strategy with the reactive anomaly detection and correction mechanism. For making a proactive decision, the ensemble learning model with the structure improved deep network is designed to predict the future workload. On the basis of the predicted results and the scaling time of different types of services, the auto-scaling decisions are made by a Deep Reinforcement Learning (DRL) model with heterogeneous action space, which integrates horizontal and vertical scaling actions. Meanwhile, with the anomaly detection and correction mechanism, the workload fluctuation and unexpected workload can be detected and handled. We evaluate our approach against three different proactive and reactive auto-scaling approaches in the cloud environment, and the experimental results show the proposed approach can achieve the better scaling behavior compared to state-of-the-art approaches.

Read full abstract

The Internet of Things is an emerging technology used in cloud computing and provides many services of the cloud. The cloud services users mostly suffer from service delays and disruptions due to service cloud resource management based on vertical and horizontal scalable systems. Adding more resources to a single cloud server is called vertical scaling, and an increasing number of servers is known as horizontal scaling. The service-bursts significantly impact the vertical scaled environment where the scale-up degrades the service quality and users' trust after reaching the server's maximum capacity. Besides, the horizontally scaled environment, though being resilient, is cost-inefficient. It is also hard to detect and manage bursts online to sustain application efficiency for complex workloads. Burst detection in real-time workloads is a complicated issue because even in the presence of auto-scaling methods, it can dramatically degrade the application's efficiency. This research study presents a new bursts-aware auto-scaling approach that detects bursts in dynamic workloads using resource estimation, decision-making scaling, and workload forecasting while reducing response time. This study proposes a hybrid auto-scaled service cloud model that ensures the best approximation of vertical and horizontal scalable systems to ensure Quality of Service (QoS) for smart campus-based applications. This study carries out the workload prediction and auto-scaling employing an ensemble algorithm. The model pre-scales the scalable vertical system by leveraging the service-load predictive modeling using an ensemble classification of defined workload estimation. The prediction of the upcoming workload helped scale-up the system, and auto-scaling dynamically scaled the assigned resources to many users' service requests. The proposed model efficiently managed service-bursts by addressing load balancing challenges through horizontal auto-scaling to ensure application consistency and service availability. The study simulated the smart campus environment model to monitor the time-stamped diverse service-requests appearing with different workloads.

Read full abstract

Auto-scaling Approach Research Articles

Articles published on Auto-scaling Approach

Impact of Autoscaling on Application Performance in Cloud Environments

Learning-driven hybrid scaling for multi-type services in cloud

A Q-learning based auto-scaling approach for provisioning big data analysis services in cloud environments

An Auto-Scaling Approach for Microservices in Cloud Computing Environments

Graph neural networks for representing multivariate resource usage: A multiplayer mobile gaming case-study

Scalable Edge Computing Environment Based on the Containerized Microservices and Minikube

Cost-efficient auto-scaling of container-based elastic processes

Intelligent Horizontal Autoscaling in Edge Computing using a Double Tower Neural Network

Hybrid Auto-Scaled Service-Cloud-Based Predictive Workload Modeling and Analysis for Smart Campus System

HSOP: A Hybrid Service Orchestration Platform for Internet-Telephony Networks

Optimizing the performance of optimization in the cloud environment–An intelligent auto-scaling approach

IBrownout: An Integrated Approach for Managing Energy and Brownout in Container-Based Clouds

Auto-Scaling Approach for Cloud based Mobile Learning Applications

WITHDRAWN: A fuzzy auto-scaling approach using workload prediction for MMOG application in a cloud environment

Automatic Scaling Hadoop in the Cloud for Efficient Process of Big Geospatial Data

NASLA: Novel Auto Scaling Approach based on Learning Automata for Web Application in Cloud Computing Environment

A cost-aware auto-scaling approach using the workload prediction in service clouds

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Auto-scaling Approach Research Articles

Articles published on Auto-scaling Approach

Impact of Autoscaling on Application Performance in Cloud Environments

Learning-driven hybrid scaling for multi-type services in cloud

A Q-learning based auto-scaling approach for provisioning big data analysis services in cloud environments

An Auto-Scaling Approach for Microservices in Cloud Computing Environments

Graph neural networks for representing multivariate resource usage: A multiplayer mobile gaming case-study

Scalable Edge Computing Environment Based on the Containerized Microservices and Minikube

Cost-efficient auto-scaling of container-based elastic processes

Intelligent Horizontal Autoscaling in Edge Computing using a Double Tower Neural Network

Hybrid Auto-Scaled Service-Cloud-Based Predictive Workload Modeling and Analysis for Smart Campus System

HSOP: A Hybrid Service Orchestration Platform for Internet-Telephony Networks

Optimizing the performance of optimization in the cloud environment–An intelligent auto-scaling approach

IBrownout: An Integrated Approach for Managing Energy and Brownout in Container-Based Clouds

Auto-Scaling Approach for Cloud based Mobile Learning Applications

WITHDRAWN: A fuzzy auto-scaling approach using workload prediction for MMOG application in a cloud environment

Automatic Scaling Hadoop in the Cloud for Efficient Process of Big Geospatial Data

NASLA: Novel Auto Scaling Approach based on Learning Automata for Web Application in Cloud Computing Environment

A cost-aware auto-scaling approach using the workload prediction in service clouds