DATA FINDING, SHARING AND DUPLICATION REMOVAL IN THE CLOUD

Janhavi Rahul Patil

doi:10.55041/ijsrem29587

Abstract

Deduplication involves eliminating duplicate or redundant data to reduce stored data volume, commonly used in data backup, network optimization, and storage management. However, traditional deduplication methods have limitations with encrypted data and security. The primary objective of this project is to develop new distributed deduplication systems that offer increased reliability. In these systems, data chunks are distributed across the Hadoop Distributed File System (HDFS), and a robust key management system is utilized to ensure secure deduplication with slave nodes. Instead of having multiple copies of the same content, deduplication removes redundant data by retaining only one physical copy and referring other instances to that copy. The granularity of deduplication can vary, ranging from an entire file to a data block. The MD5 and 3DES algorithms are used to enhance the deduplication process. The proposed approach in this project is the Proof of Ownership (POF) of the file. With this method, deduplication can effectively address the issues of reliability and label consistency in HDFS storage systems. The proposed system has successfully reduced the cost and time associated with uploading and downloading data, while also optimizing storage space. Key Words: Cloud computing, data storage, file checksum algorithms, computational infrastructure, duplication.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DATA FINDING, SHARING AND DUPLICATION REMOVAL IN THE CLOUD

Abstract

Talk to us

Similar Papers

More From: INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT

Lead the way for us

Similar Papers

CLOUD BASED DUPLICATION REMOVAL SYSTEM
Aditya Rajesh Narwade
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08
Aditya Rajesh NarwadeAditya Rajesh Narwade
18 Mar 2024
INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT | VOL. 08

클라우드 환경에서 MongoDB 기반의 비정형 로그 처리 시스템 설계 및 구현
Myoungjin Kim ... Seungho Han
Journal of Korean Society for Internet Information | VOL. 14
Myoungjin Kim, et. al.Myoungjin Kim ... Seungho Han
31 Dec 2014
Journal of Korean Society for Internet Information | VOL. 14

Performance Evaluation of Single Board Computer for Hadoop Distributed File System (HDFS)
Adnan Adnan ... Zulkifli Tahir
-
Adnan Adnan, et. al.Adnan Adnan ... Zulkifli Tahir
01 Jul 2019
01 Jul 2019

REVIEW ON PRIVACY PRESERVING DATA ANALYTICS USING CRYPTOGRAPHIC TECHNIQUE FOR LARGE DATA SET
Yashi Gupta
International Journal of Advanced Research in Computer Science | VOL. 8
Yashi GuptaYashi Gupta
30 Aug 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DATA FINDING, SHARING AND DUPLICATION REMOVAL IN THE CLOUD

Abstract

Talk to us

Similar Papers

More From: INTERANTIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING AND MANAGEMENT