Abstract:Petabyte-scale distributed storage systems are currently transitioning to erasure codes to achieve higher storage efficiency. Classical codes like Reed-Solomon are highly sub-optimal for distributed environments due to their high overhead in single-failure events. Locally Repairable Codes (LRCs) form a new family of codes that are repair efficient. In particular, LRCs minimize the number of nodes participating in single node repairs during which they generate small network traffic. Two large-scale distributed … Show more
“…This was re-cently further generalized for nonlinear codes [25] and several explicit constructions of locally repairable codes have been investigated (e.g. [18,24,30,37,38]). Note that locally repairable codes are related but fundamentally different from locally decodable codes [42].…”
Consider a large database of n data items that need to be stored using m servers. We study how to encode information so that a large number k of read requests can be performed in parallel while the rate remains constant (and ideally approaches one). This problem is equivalent to the design of multiset Batch Codes introduced by Ishai, Kushilevitz, Ostrovsky and Sahai [17].We give families of multiset batch codes with asymptotically optimal rates of the form 1 − 1/poly(k) and a number of servers m scaling polynomially in the number of read requests k. An advantage of our batch code constructions over most previously known multiset batch codes is explicit and deterministic decoding algorithms and asymptotically optimal fault tolerance.Our main technical innovation is a graph-theoretic method of designing multiset batch codes using dense bipartite graphs with no small cycles. We modify prior graph constructions of dense, high-girth graphs to obtain our batch code results. We achieve close to optimal tradeoffs between the parameters for bipartite graph based batch codes.
“…This was re-cently further generalized for nonlinear codes [25] and several explicit constructions of locally repairable codes have been investigated (e.g. [18,24,30,37,38]). Note that locally repairable codes are related but fundamentally different from locally decodable codes [42].…”
Consider a large database of n data items that need to be stored using m servers. We study how to encode information so that a large number k of read requests can be performed in parallel while the rate remains constant (and ideally approaches one). This problem is equivalent to the design of multiset Batch Codes introduced by Ishai, Kushilevitz, Ostrovsky and Sahai [17].We give families of multiset batch codes with asymptotically optimal rates of the form 1 − 1/poly(k) and a number of servers m scaling polynomially in the number of read requests k. An advantage of our batch code constructions over most previously known multiset batch codes is explicit and deterministic decoding algorithms and asymptotically optimal fault tolerance.Our main technical innovation is a graph-theoretic method of designing multiset batch codes using dense bipartite graphs with no small cycles. We modify prior graph constructions of dense, high-girth graphs to obtain our batch code results. We achieve close to optimal tradeoffs between the parameters for bipartite graph based batch codes.
“…Locally repairable codes for several parameter sets (N, K, D, R, ∆) are constructed in [6], [9], [21], [22], [11], [23]. In [6] it was proved that bound (2) is not achievable for all parameters (N, K, R).…”
Abstract-Typically, locally repairable codes (LRCs) and regenerating codes have been studied independently of each other, and it has not been clear how the parameters of one relate to those of the other. In this paper, a novel connection between locally repairable codes and exact regenerating codes is established. Via this connection, locally repairable codes are interpreted as exact regenerating codes. Further, some of these codes are shown to perform better than time-sharing codes between minimum bandwidth regenerating and minimum storage regenerating codes.
“…Various codes have been proposed to reduce repair locality, such as scalar linear codes [6] [8] and vector codes [7] [9] [10]. In [6] [8], extra parity constraints are introduced into encoded symbols of an MDS code to enhance repair locality, and a trade-off is demonstrated between the minimum distance and the repair locality of the resulting code.…”
Section: Introductionmentioning
confidence: 99%
“…In [6] [8], extra parity constraints are introduced into encoded symbols of an MDS code to enhance repair locality, and a trade-off is demonstrated between the minimum distance and the repair locality of the resulting code. The trade-off is extended in [7] to accommodate vector codes for local repair This work was supported in part by NSF under Grant ECCS-1055877.…”
Section: Introductionmentioning
confidence: 99%
“…An outer MDS code and an inner fractional repetition code are employed in [13] to construct regenerating codes with local, exact and uncoded repair. Scalar linear codes such as Pyramid codes investigated in [6] [14] and that proposed in [8] can also be viewed as examples of two-layer encoding.…”
In this paper, we propose locally repairable codes (LRCs) with optimal minimum distance for distributed storage systems (DSS). A two-layer encoding structure is employed to ensure data reconstruction and the designated repair locality. The data is first encoded in the first layer by any existing maximum distance separable (MDS) codes, and then the encoded symbols are divided into non-overlapping groups and encoded by an MDS array code in the second layer. The encoding in the second layer provides enough redundancy for local repair, while the overall code performs recovery of the data based on redundancy from both layers. Our codes can be constructed over a finite field with size growing linearly with the total number of nodes in the DSS, and facilitate efficient degraded reads.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.