Demand-Aware Erasure Coding for Distributed Storage Systems

Jun Li,Baochun Li

doi:10.1109/tcc.2018.2885306

Abstract

Distributed storage systems provide cloud storage services by storing data on commodity storage servers. Conventionally, data are protected against failures of such commodity servers by replication. Erasure coding consumes less storage overhead than replication to tolerate the same number of failures and thus has been replacing replication in many distributed storage systems. However, with erasure coding, the overhead of reconstructing data from failures also increases significantly. Under the ever-changing workload where data accesses can be highly skewed, it is challenging to deploy erasure coding with appropriate values of parameters to achieve a well trade-off between storage overhead and reconstruction overhead. In this paper, we propose Zebra, a framework that encodes data by their demand into multiple tiers that deploy erasure codes with different values of parameters. Zebra automatically determines the number of such tiers and dynamically assigns erasure codes with optimal values of parameters into corresponding tiers. With Zebra, a flexible trade-off between storage overhead and reconstruction overhead is achieved with multiple tiers. When demand changes, Zebra adjusts itself with a marginal amount of network transfer. We demonstrate that Zebra can work with two representative families of erasure codes in distributed storage systems, Reed-Solomon codes and local reconstruction codes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Demand-Aware Erasure Coding for Distributed Storage Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing

Lead the way for us

Journal: IEEE Transactions on Cloud Computing	Publication Date: Apr 1, 2021
Citations: 10

Similar Papers

Zebra: Demand-aware erasure coding for distributed storage systems
Jun Li ... Baochun Li
-
Jun Li, et. al.Jun Li ... Baochun Li
01 Jun 2016
01 Jun 2016

A New Adaptive Coding Selection Method for Distributed Storage Systems
Bing Wei ... Yao Song
IEEE Access | VOL. 6
Bing Wei, et. al.Bing Wei ... Yao Song
01 Jan 2018
IEEE Access | VOL. 6

Benchmarking the performance of hadoop triple replication and erasure coding on a nation-wide distributed cloud
Lakshmi J Mohan ... Aaron Harwood
-
Lakshmi J Mohan, et. al.Lakshmi J Mohan ... Aaron Harwood
01 Jun 2015
01 Jun 2015

Beehive: Erasure Codes for Fixing Multiple Failures in Distributed Storage Systems
Jun Li ... Baochun Li
IEEE Transactions on Parallel and Distributed Systems | VOL. 28
Jun Li, et. al.Jun Li ... Baochun Li
01 May 2017
IEEE Transactions on Parallel and Distributed Systems | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Demand-Aware Erasure Coding for Distributed Storage Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cloud Computing