QuickDedup: Efficient VM deduplication in cloud computing environments

Shweta Saharan,Gaurav Somani,Gaurav Gupta,Robin Verma,Manoj Singh Gaur,Rajkumar Buyya

doi:10.1016/j.jpdc.2020.01.002

Abstract

Deduplication is one of the major storage optimisation techniques for Virtual Machines (VMs) in cloud environment. Usually, hashing of blocks helps in identifying duplicate data blocks. This paper proposes a novel deduplication approach, QuickDedup that reduces the overall deduplication time, metadata overhead and the number of hash computations, and subsequent comparisons for the VM disk images. In addition to minimising the deduplication related metadata, which is a necessary by-product useful in checking deduplication, QuickDedup, follows novel byte comparison scheme to prepare various block classes. This way, QuickDedup eliminates or minimises the need for hash calculation and subsequent comparisons. QuickDedup performs the calculation and comparisons of hashes within the respective categories only. QuickDedup saves the space required for hash storage during deduplication and makes deduplication of VM disk images much faster. We conducted a detailed evaluation of QuickDedup on various metrics with different kinds and sizes of VM images taken from publicly available datasets. The evaluation results show a substantial improvement of up to 96% in the overall deduplication time required to deduplicate VM images apart from significant savings in metadata and storage overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

QuickDedup: Efficient VM deduplication in cloud computing environments

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing

Lead the way for us

Journal: Journal of Parallel and Distributed Computing	Publication Date: Jan 28, 2020
Citations: 22

Similar Papers

Clustering-based acceleration for virtual machine image deduplication in the cloud environment
Jiwei Xu ... Tao Huang
The Journal of Systems & Software | VOL. 121
Jiwei Xu, et. al.Jiwei Xu ... Tao Huang
01 Apr 2016
The Journal of Systems & Software | VOL. 121

The Performance Analysis of GlusterFS In Virtual Storage
Cheng Zhang ... Xiaodong Li
-
Cheng Zhang, et. al.Cheng Zhang ... Xiaodong Li
01 Jan 2015
01 Jan 2015

Digital Watermarking of Virtual Machine Images
Kumiko Tadano ... Masahiro Kawato
-
Kumiko Tadano, et. al.Kumiko Tadano ... Masahiro Kawato
01 Jan 2009
01 Jan 2009

Virtual machine images as structured data: the mirage image library
...
-
, et. al. ...
14 Jun 2011
14 Jun 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

QuickDedup: Efficient VM deduplication in cloud computing environments

Abstract

Talk to us

Similar Papers

More From: Journal of Parallel and Distributed Computing