2020
DOI: 10.1109/tit.2020.2964547
|View full text |Cite
|
Sign up to set email alerts
|

On the Fundamental Limits of Coded Data Shuffling for Distributed Machine Learning

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
13
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(15 citation statements)
references
References 23 publications
2
13
0
Order By: Relevance
“…Elmahdy and Mohajer considered the data shuffling problem that a master node communicates a set of files to a set of worker nodes through a shared link. They proposed a deterministic and systematic coded shuffling scheme to find out the exact rate of cache files . These researched results support our work to reduce shuffling overloads, include computing and communication overhead, through an execution instructions redesigning methods.…”
Section: Related Worksupporting
confidence: 71%
See 1 more Smart Citation
“…Elmahdy and Mohajer considered the data shuffling problem that a master node communicates a set of files to a set of worker nodes through a shared link. They proposed a deterministic and systematic coded shuffling scheme to find out the exact rate of cache files . These researched results support our work to reduce shuffling overloads, include computing and communication overhead, through an execution instructions redesigning methods.…”
Section: Related Worksupporting
confidence: 71%
“…They proposed a deterministic and systematic coded shuffling scheme to find out the exact rate of cache files. 31,32 These researched results support our work to reduce shuffling overloads, include computing and communication overhead, through an execution instructions redesigning methods. We analyze the programming language and algorithm structure to practically design a set of replacement instructions and cache the exact set of data for iterative applications from implementation perspective.…”
Section: Related Work Comparisonssupporting
confidence: 68%
“…The above data exchange problem models a number of cache-enabled multi-receiver communication problems studied recently in the coding theory community, including Coded Caching [1], Coded Distributed Computing [2,3], Coded Data Shuffling [4][5][6], and Coded Data Rebalancing [7]. In [8], a special case of our general problem here was considered in the name of cooperative data exchange, where the goal was to reach a state in which all nodes have all the data in the system.…”
Section: Introduction and Main Resultsmentioning
confidence: 99%
“…We have considered the decentralized version of the coded data shuffling problem in this subsection. The centralized version of the data shuffling problem was introduced in [4] and its information theoretic limits were studied elaborately in [6]. Our data exchange bound, when applied to the setting in [6], results in a looser converse result than that in [6].…”
Section: Proof Of the Decentralized Data Shuffling Conversementioning
confidence: 99%
See 1 more Smart Citation