Various Data Deduplication Techniques of Primary Storage

D Viji,S Revathy

doi:10.1109/icces45898.2019.9002185

Abstract

In this emerging era of data-driven applications, volume of data has been significantly increased due to new types of data generation device as well as storage devices. So a large amount of repeated data will be generated and stored in cloud services. Repeated data will be drastically reducing the storage space of the cloud environment. So that we need to do the data deduplication that is data block or chunk exactly stored only once in the environment. Data deduplication can be applied in different storage types, archives or backup storage, primary storage, disk drives, and RAM. The contribution of this paper classifies the data deduplication techniques in the primary storage system and identifies the performance and challenges of each technique in primary storage. Finally, ongoing research issues and undefined design points of deduplication techniques in primary storage are recognized and discussed. Primary storage system contains mutable data that means data often changed or removed, but the backup storage system mainly contains immutable data that is which never changed frequently and also some data could not be deleted at any time. In this article, we are analyzed various deduplication techniques of primary storage.

Full Text