Locality and Availability in Distributed Storage

Ankit Singh Rawat,Dimitris S Papailiopoulos,Alexandros G Dimakis,Sriram Vishwanath

doi:10.1109/tit.2016.2524510

Ankit Singh Rawat, Dimitris S Papailiopoulos + Show 2 more

Open Access

https://doi.org/10.1109/tit.2016.2524510

Copy DOI

Abstract

This paper studies the problem of information symbol availability in codes: we refer to a systematic code as code with $(r, t)$ -availability if every information (systematic) symbol can be reconstructed from $t$ disjoint groups of other code symbols, each of the sizes at most $r$ . This paper shows that it is possible to construct codes that can support a scaling number of parallel reads while keeping the rate to be an arbitrarily high constant. It further shows that this is possible with the minimum Hamming distance arbitrarily close to the Singleton bound. This paper also presents a bound demonstrating a tradeoff between rate, minimum Hamming distance, and availability parameters. Our codes match the aforementioned bound, and their constructions rely on certain combinatorial structures. Resolvable designs provide one way to realize these required combinatorial structures. The two constructions presented in this paper require field sizes, which are linear and exponential in the code length, respectively. From a practical standpoint, our codes are relevant for distributed storage applications involving hot data, i.e., the information, which is frequently accessed by multiple processes in parallel.

Full Text