SAFE: A Source Deduplication Framework for Efficient Cloud Backup Services

Yujuan Tan,Hong Jiang,Edwin Hsing-Mean Sha,Zhichao Yan,Dan Feng

doi:10.1007/s11265-013-0775-x

Abstract

Due to the relatively low bandwidth of WAN that supports cloud backup services and the increasing amount of backed-up data stored at service providers, the deduplication scheme used in the cloud backup environment must remove the redundant data for backup operations to reduce backup times and storage costs and for restore operations to reduce restore times. In this paper, we propose SAFE, a source deduplication framework for efficient cloud backup and restore operations. SAFE consists of three salient features, (1) Hybrid Deduplication, combining the global file-level and local chunk-level deduplication to achieve an optimal tradeoff between the deduplication efficiency and overhead to achieve a short backup time; (2) Semantic-aware Elimination, exploiting file semantics to narrow the search space for the redundant data in hybrid deduplication process to reduce the deduplication overhead; and (3) Unmodified Data Removal, removing the files and data chunks that are kept intact from data transmission for some restore operations. Through extensive experiments driven by real-world datasets, the SAFE framework is shown to maintain a much higher deduplication efficiency/overhead ratio than existing solutions, shortening the backup time by an average of 38.7 %, and reduce the restore time by a ratio of up to 9.7 : 1.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SAFE: A Source Deduplication Framework for Efficient Cloud Backup Services

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Journal: Journal of Signal Processing Systems	Publication Date: Jun 21, 2013
Citations: 40

Similar Papers

CABdedupe: A Causality-Based Deduplication Performance Booster for Cloud Backup Services
Yujuan Tan ... Zhichao Yan
-
Yujuan Tan, et. al.Yujuan Tan ... Zhichao Yan
01 May 2011
01 May 2011

A Study on Cloud Backup Technology and Its Development
He Zhonglin ... He Yuhua
-
He Zhonglin, et. al.He Zhonglin ... He Yuhua
01 Jan 2010
01 Jan 2010

AA-Dedupe: An Application-Aware Source Deduplication Approach for Cloud Backup Services in the Personal Computing Environment
Yinjin Fu ... Lei Tian
-
Yinjin Fu, et. al.Yinjin Fu ... Lei Tian
01 Sep 2011
01 Sep 2011

Application-Aware Client-Side Data Reduction and Encryption of Personal Data in Cloud Backup Services
Yin-Jin Fu ... Nong Xiao
Journal of Computer Science and Technology | VOL. 28
Yin-Jin Fu, et. al.Yin-Jin Fu ... Nong Xiao
01 Nov 2013
Journal of Computer Science and Technology | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SAFE: A Source Deduplication Framework for Efficient Cloud Backup Services

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems