Reducing impact of data fragmentation caused by in-line deduplication

Michal Kaczmarczyk,Wojciech Kilian,Cezary Dubnicki,Marcin Barczynski

doi:10.1145/2367589.2367600

Reducing impact of data fragmentation caused by in-line deduplication

Michal Kaczmarczyk, Wojciech Kilian + Show 2 more

https://doi.org/10.1145/2367589.2367600

Copy DOI

Publication Date: Jun 4, 2012

Citations: 99

#In-line Deduplication #Backup Set + Show 4 more

Abstract
Full-Text
Similar Papers

Abstract

Deduplication results inevitably in data fragmentation, because logically continuous data is scattered across many disk locations. In this work we focus on fragmentation caused by duplicates from previous backups of the same backup set, since such duplicates are very common due to repeated full backups containing a lot of unchanged data. For systems with in-line dedup which detects duplicates during writing and avoids storing them, such fragmentation causes data from the latest backup being scattered across older backups. As a result, the time of restore from the latest backup can be significantly increased, sometimes more than doubled.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.