HPDV:A Highly Parallel Deduplication Cluster for Virtual Machine Images

Chuan Lin,Qiang Cao,Xiaoqian Li,Changsheng Xie,Jie Yao,Jianzhong Huang

doi:10.1109/ccgrid.2018.00074

Abstract

Data deduplication has been widely introduced to effectively reduce storage requirement of virtual machine (VM) images running on VM servers in the virtualized cloud platforms. Nevertheless, the existing state-of-the-art deduplication for VM images approaches can not sufficiently exploit the potential of underlying hardware with consideration of the interference of deduplication on the foreground VM services, which could affect the quality of VM services. In this paper, we present HPDV, a highly parallel deduplication cluster for VM images, which well utilizes the parallelism to achieve high throughput with minimum interference on the foreground VM services. The main idea behind HPDV is to exploit idle CPU resource of VM servers to parallelize the compute-intensive chunking and fingerprinting, and to parallelize the I/O-intensive fingerprint indexing in the deduplication servers by dividing the globally shared fingerprint index into multiple independent sub-indexes according to the operating systems of VM images. To ensure the quality of VM services, a resource-aware scheduler is proposed to dynamically adjust the number of parallel chunking and fingerprinting threads according to the CPU utilization of VM servers. Our evaluation results demonstrate that compared to a state-of-the-art deduplication system for VM images called Light, HPDV achieves up to 67% deduplication throughput improvement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HPDV:A Highly Parallel Deduplication Cluster for Virtual Machine Images

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

VMFlock
Samer Al-Kiswany ... Matei Ripeanu
-
Samer Al-Kiswany, et. al.Samer Al-Kiswany ... Matei Ripeanu
08 Jun 2011
08 Jun 2011

Digital Watermarking of Virtual Machine Images
Kumiko Tadano ... Masahiro Kawato
-
Kumiko Tadano, et. al.Kumiko Tadano ... Masahiro Kawato
01 Jan 2009
01 Jan 2009

Clustering-based acceleration for virtual machine image deduplication in the cloud environment
Jiwei Xu ... Tao Huang
The Journal of Systems & Software | VOL. 121
Jiwei Xu, et. al.Jiwei Xu ... Tao Huang
01 Apr 2016
The Journal of Systems & Software | VOL. 121

Minimizing Latency in Fetching Virtual Machine Images Based on Multi-point Collaborative Approach
Binbin Huang ... Fangchun Yang
-
Binbin Huang, et. al.Binbin Huang ... Fangchun Yang
01 Aug 2013
01 Aug 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HPDV:A Highly Parallel Deduplication Cluster for Virtual Machine Images

Abstract

Talk to us

Similar Papers