Encrypted Data Deduplication Research Articles

Data deduplication is a technique to eliminate duplicate data in order to save storage space and enlarge upload bandwidth, which has been applied by cloud storage systems. However, a cloud storage provider (CSP) may tamper user data or cheat users to pay unused storage for duplicate data that are only stored once. Although previous solutions adopt message-locked encryption along with Proof of Retrievability (PoR) to check the integrity of deduplicated encrypted data, they ignore proving the correctness of duplication check during data upload and require the same file to be derived into same verification tags, which suffers from brute-force attacks and restricts users from flexibly creating their own individual verification tags. In this paper, we propose a verifiable deduplication scheme called VeriDedup to address the above problems. It can guarantee the correctness of duplication check and support flexible tag generation for integrity check over encrypted data deduplication in an integrative way. Concretely, we propose a novel Tag-flexible Deduplication-supported Integrity Check Protocol (TDICP) based on Private Information Retrieval (PIR) by introducing a novel verification tag called <inline-formula><tex-math notation="LaTeX">${note\ set}$</tex-math></inline-formula> , which allows multiple users holding the same file to generate their individual verification tags and still supports tag deduplication at the CSP. Furthermore, we make the first attempt to guarantee the correctness of data duplication check by introducing a novel User Determined Duplication Check Protocol (UDDCP) based on Private Set Intersection (PSI), which can resist a CSP from providing a fake duplication check result to users. Security analysis shows the correctness and soundness of our scheme. Simulation studies based on real data show the efficacy and efficiency of our proposed scheme and its significant advantages over prior arts.

Read full abstract

Deduplication of encrypted data is a significant function for both the privacy of stored data and efficient storage management. Several deduplication techniques have been designed to provide improved security or efficiency. In this study, we focus on the client-side deduplication technique, which has more advantages than the server-side deduplication technique, particularly in communication overhead, owing to conditional data transmissions. From a security perspective, poison, dictionary, and identification attacks are considered as threats against client-side deduplication. Unfortunately, in contrast to other attacks, identification attacks and the corresponding countermeasures have not been studied in depth. In identification attacks, an adversary tries to identify the existence of a specific file. Identification attacks should be countered because adversaries can use the attacks to break the privacy of the data owner. Therefore, in the literature, some counter-based countermeasures have been proposed as temporary remedies for such attacks. In this paper, we present an analysis of the security features of deduplication techniques against identification attacks and show that the lack of security of the techniques can be eliminated by providing uncertainness to the conditional responses in the deduplication protocol, which are based on the existence of files. We also present a concrete countermeasure, called the time-locked deduplication technique, which can provide uncertainness to the conditional responses by withholding the operation of the deduplication functionality until a predefined time. An additional cost for locking is incurred only when the file to be stored does not already exist in the server’s storage. Therefore, our technique can improve the security of client-side deduplication against identification attacks at almost the same cost as existing techniques, except in the case of files uploaded for the first time.

Read full abstract

Encrypted Data Deduplication Research Articles

Related Topics

Articles published on Encrypted Data Deduplication

AF-Dedup: Secure Encrypted Data Deduplication Based on Adaptive Dynamic Merkle Hash Forest PoW for Cloud Storage

SecDedup: Secure data deduplication with dynamic auditing in the cloud

A lightweight encrypted deduplication scheme supporting backup

VeriDedup: A Verifiable Cloud Data Deduplication Scheme With Integrity and Duplication Proof

Blockchain‐based secure deduplication of encrypted data supporting client‐side semantically secure encryption without trusted third party

Secure Cloud Data Deduplication with Efficient Re-Encryption

Updatable Block-Level Message-Locked Encryption

Secure File Storage and Deduplication in Cloud Server Using Cryptography

Secure Password-Protected Encryption Key for Deduplicated Cloud Storage Systems

Hybrid-cloud management in secure multi-domain environments

Investigating the Adoption of Hybrid Encrypted Cloud Data Deduplication With Game Theory

Toward Serverless and Efficient Encrypted Deduplication in Mobile Cloud Computing Environments

Locked Deduplication of Encrypted Data to Counter Identification Attacks in Cloud Storage Platforms

Secure Encrypted Data Deduplication Based on Data Popularity

SecDedup: Secure Encrypted Data Deduplication With Dynamic Ownership Updating

CSED: Client-Side encrypted deduplication scheme based on proofs of ownership for cloud storage

Secure deduplication of encrypted data in online and offline environments

Secure deduplication of encrypted data in online and offline environments

Efficient Client-Side Deduplication of Encrypted Data With Public Auditing in Cloud Storage

Privacy-preserving deduplication of encrypted data with dynamic ownership management in fog computing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Encrypted Data Deduplication Research Articles

Related Topics

Articles published on Encrypted Data Deduplication

AF-Dedup: Secure Encrypted Data Deduplication Based on Adaptive Dynamic Merkle Hash Forest PoW for Cloud Storage

SecDedup: Secure data deduplication with dynamic auditing in the cloud

A lightweight encrypted deduplication scheme supporting backup

VeriDedup: A Verifiable Cloud Data Deduplication Scheme With Integrity and Duplication Proof

Blockchain‐based secure deduplication of encrypted data supporting client‐side semantically secure encryption without trusted third party

Secure Cloud Data Deduplication with Efficient Re-Encryption

Updatable Block-Level Message-Locked Encryption

Secure File Storage and Deduplication in Cloud Server Using Cryptography

Secure Password-Protected Encryption Key for Deduplicated Cloud Storage Systems

Hybrid-cloud management in secure multi-domain environments

Investigating the Adoption of Hybrid Encrypted Cloud Data Deduplication With Game Theory

Toward Serverless and Efficient Encrypted Deduplication in Mobile Cloud Computing Environments

Locked Deduplication of Encrypted Data to Counter Identification Attacks in Cloud Storage Platforms

Secure Encrypted Data Deduplication Based on Data Popularity

SecDedup: Secure Encrypted Data Deduplication With Dynamic Ownership Updating

CSED: Client-Side encrypted deduplication scheme based on proofs of ownership for cloud storage

Secure deduplication of encrypted data in online and offline environments

Secure deduplication of encrypted data in online and offline environments

Efficient Client-Side Deduplication of Encrypted Data With Public Auditing in Cloud Storage

Privacy-preserving deduplication of encrypted data with dynamic ownership management in fog computing