Learn-as-you-go with Megh: Efficient Live Migration of Virtual Machines

Debabrota Basu,Yang Hong,Haibo Chen,Stephane Bressan,Xiayang Wang

doi:10.1109/tpds.2019.2893648

Abstract

Cloud providers leverage live migration of virtual machines to reduce energy consumption and allocate resources efficiently in data centers. Each migration decision depends on three questions: when to move a virtual machine, which virtual machine to move and where to move it? Dynamic, uncertain, and heterogeneous workloads running on virtual machines make such decisions difficult. Knowledge-based and heuristics-based algorithms are commonly used to tackle this problem. Knowledge-based algorithms, such as MaxWeight scheduling algorithms, are dependent on the specifics and the dynamics of the targeted Cloud architectures and applications. Heuristics-based algorithms, such as MMT algorithms, suffer from high variance and poor convergence because of their greedy approach. We propose an online reinforcement learning algorithm called Megh. Megh does not require prior knowledge of the workload rather learns the dynamics of workloads as-it-goes. Megh models the problem of energy- and performance-efficient resource management during live migration as a Markov decision process and solves it using a functional approximation scheme. While several reinforcement learning algorithms are proposed to solve this problem, these algorithms remain confined to the academic realm as they face the curse of dimensionality. They are either not scalable in real-time, as it is the case of MadVM, or need an elaborate offline training, as it is the case of Q-learning. These algorithms often incur execution overheads which are comparable with the migration time of a VM. Megh overcomes these deficiencies. Megh uses a novel dimensionality reduction scheme to project the combinatorially explosive state-action space to a polynomial dimensional space with a sparse basis. Megh has the capacity to learn uncertain dynamics and the ability to work in real-time without incurring significant execution overhead. Megh is both scalable and robust. We implement Megh using the CloudSim toolkit and empirically evaluate its performance with the PlanetLab and the Google Cluster workloads. Experiments validate that Megh is more cost-effective, converges faster, incurs smaller execution overhead and is more scalable than MadVM and MMT. An empirical sensitivity analysis explicates the choice of parameters in experiments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learn-as-you-go with Megh: Efficient Live Migration of Virtual Machines

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems

Lead the way for us

Journal: IEEE Transactions on Parallel and Distributed Systems	Publication Date: Aug 1, 2019
Citations: 88

Similar Papers

Comparison of VM Live Migration Process Impacts on QoS of Web Server Hosted in Cloud
Matheus Torquato
International Journal of Computer Applications | VOL. 137
Matheus TorquatoMatheus Torquato
17 Mar 2016
International Journal of Computer Applications | VOL. 137

DLSM: Decoupled Live Storage Migration with Distributed Device Mapper Storage
Zhaoning Zhang ... Ziyang Li
-
Zhaoning Zhang, et. al.Zhaoning Zhang ... Ziyang Li
01 Apr 2014
01 Apr 2014

A Lightweight Model for Estimating Energy Cost of Live Migration of Virtual Machines
Anja Strunk
-
Anja StrunkAnja Strunk
01 Jun 2013
01 Jun 2013

Challenges and New Solutions for Live Migration of Virtual Machines in Cloud Computing Environments
Fei Zhang
-
Fei ZhangFei Zhang
21 Feb 2022
21 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learn-as-you-go with Megh: Efficient Live Migration of Virtual Machines

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Parallel and Distributed Systems