2019 IEEE International Symposium on Information Theory (ISIT) 2019
DOI: 10.1109/isit.2019.8849845
|View full text |Cite
|
Sign up to set email alerts
|

Cascaded Coded Distributed Computing on Heterogeneous Networks

Abstract: Coded distributed computing (CDC) introduced by Li et al. in 2015 offers an efficient approach to trade computing power to reduce the communication load in general distributed computing frameworks such as MapReduce. For the more general cascaded CDC, Map computations are repeated at r nodes to significantly reduce the communication load among nodes tasked with computing Q Reduce functions s times. While an achievable cascaded CDC scheme was proposed, it only operates on homogeneous networks, where the storage,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
1

Relationship

2
5

Authors

Journals

citations
Cited by 31 publications
(29 citation statements)
references
References 13 publications
0
29
0
Order By: Relevance
“…A unique contribution of this work is the successful validation of the FLCD through empirical evaluations on AMAZON EC2. This provides strong evidence on the effectiveness of the combinatorial designs utilized in not only this work, but also those in [24], [25].…”
Section: Introductionmentioning
confidence: 64%
See 1 more Smart Citation
“…A unique contribution of this work is the successful validation of the FLCD through empirical evaluations on AMAZON EC2. This provides strong evidence on the effectiveness of the combinatorial designs utilized in not only this work, but also those in [24], [25].…”
Section: Introductionmentioning
confidence: 64%
“…This is also the first time that theoretical predictions of the shuffle time of a CDC design are validated by empirical evaluations. While the proposed FLCD schemes in this work originate from previously developed combinatorial designs for CDC networks [24], [25], a key difference is that FLCD leverages the design freedom in defining map and reduce functions to support varying IV sizes in a more general MapReduce framework. Compared to [24], [25] which focus on heterogeneous systems, this new approach puts a different emphasis on asymptotic homogeneous systems and aims to design more flexible CDC schemes that can operate under a wider range of system parameters.…”
Section: Introductionmentioning
confidence: 99%
“…In our computation-aware function assignment, the least number of output functions assigned to nodes is given by . Table II lists the least numbers required for input files and output functions in [3], [10], [13], [14] and our scheme in the MapReduce system considered in Section IV-C. It can be seen that the numbers required for output functions in our function assignment strategies are relatively close to existing works, and our computation-aware function assignment requires less number of output functions than those in [13], [14], but the number required for input files in our scheme is much larger than existing works.…”
Section: Discussion On the Required Numbers Of Input Files And Outmentioning
confidence: 99%
“…When m > 0.75 and K = 12, our shuffle-aware function assignment achieves smaller communication load than L * Hom , because: 1) coded multicasting opportunities are sufficiently exploited by this function assignment; 2) nodes with higher computation load are assigned more output functions and less communication is needed to satisfy the requests of these nodes. Table I shows the achievable communication loads of [13], [14] and our results with four function assignments for certain m in the MapReduce systems with K = 12. The communication load in [13] is the largest because each output function is computed by multiple nodes.…”
Section: Corollary 1 For a Heterogeneous Mapreduce Computing System mentioning
confidence: 99%
“…A converse bound was proposed in [14] to show that the proposed coded distributed computing scheme is optimal in terms of communication load. This coded distributed computing framework was extended to the cases such as computing only necessary intermediate values [15], [16], reducing file partitions and number of output functions [16], [17], and considering random network topologies [18], stragglers [19], storage cost [20], and heterogeneous computing power, function assignment and storage space [21], [22].…”
Section: Relation To Other Problemsmentioning
confidence: 99%