2020 IEEE International Symposium on Information Theory (ISIT) 2020
DOI: 10.1109/isit44484.2020.9174482
|View full text |Cite
|
Sign up to set email alerts
|

Coded Data Rebalancing: Fundamental Limits and Constructions

Abstract: Distributed databases often suffer unequal distribution of data among storage nodes, which is known as 'data skew'. Data skew arises from a number of causes such as removal of existing storage nodes and addition of new empty nodes to the database. Data skew leads to performance degradations and necessitates 'rebalancing' at regular intervals to reduce the amount of skew. We define an r-balanced distributed database as a distributed database in which the storage across the nodes has uniform size, and each bit o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

1
25
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 7 publications
(26 citation statements)
references
References 11 publications
(16 reference statements)
1
25
0
Order By: Relevance
“…The above data exchange problem models a number of cache-enabled multi-receiver communication problems studied recently in the coding theory community, including Coded Caching [1], Coded Distributed Computing [2,3], Coded Data Shuffling [4][5][6], and Coded Data Rebalancing [7]. In [8], a special case of our general problem here was considered in the name of cooperative data exchange, where the goal was to reach a state in which all nodes have all the data in the system.…”
Section: Introduction and Main Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The above data exchange problem models a number of cache-enabled multi-receiver communication problems studied recently in the coding theory community, including Coded Caching [1], Coded Distributed Computing [2,3], Coded Data Shuffling [4][5][6], and Coded Data Rebalancing [7]. In [8], a special case of our general problem here was considered in the name of cooperative data exchange, where the goal was to reach a state in which all nodes have all the data in the system.…”
Section: Introduction and Main Resultsmentioning
confidence: 99%
“…available exclusively in Q ∪ 0 Symmetrization over appropriately chosen demand vectors: Choosing the same special set of demand vectors D as in (7) and averaging the above lower bound over the demand vectors in D similar to the proof of Theorem 2, we obtain a bound similar to (8):…”
mentioning
confidence: 99%
“…• For the problem of Coded Data Rebalancing, introduced in [4], we show a new converse for data rebalancing under multiple node failures. (Section V) • We present a new converse for a generic coded caching setting with multiple transmitters, receivers, and multilevel cache sizes.…”
Section: Definition 1 a Data Exchange Problem On A Set Of K Nodes Is ...mentioning
confidence: 99%
“…Natarajan is with the Department of Electrical Engineering, Indian Institute of Technology Hyderabad, email: lakshminatarajan@iith.ac.in. recently in the coding theory community, including Coded Caching [1], Coded Distributed Computing [2], Coded Data Shuffling [3], and Coded Data Rebalancing [4].…”
Section: Definition 1 a Data Exchange Problem On A Set Of K Nodes Is ...mentioning
confidence: 99%
See 1 more Smart Citation