2021
DOI: 10.48550/arxiv.2110.14895
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Pipeline Parallelism for Inference on Heterogeneous Edge Computing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(4 citation statements)
references
References 19 publications
0
4
0
Order By: Relevance
“…A model partitioning problem for model-distributed inference is designed using dynamic programming [5] with the cost of high complexity. As compared to [5], our AR-MDI algorithm (i) reduces the computation cost as our algorithm makes on the fly calculations while [5] solves a dynamic programming problem, (ii) makes model distribution fully decentralized while [5] relies on a genie to solve the dynamic programming problem and make the corresponding model distribution, (iii) is adaptive to timevarying resources, and (iv) recovers if there is a failure in the network. Model distributed inference is considered for a multi-source setup in [16].…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…A model partitioning problem for model-distributed inference is designed using dynamic programming [5] with the cost of high complexity. As compared to [5], our AR-MDI algorithm (i) reduces the computation cost as our algorithm makes on the fly calculations while [5] solves a dynamic programming problem, (ii) makes model distribution fully decentralized while [5] relies on a genie to solve the dynamic programming problem and make the corresponding model distribution, (iii) is adaptive to timevarying resources, and (iv) recovers if there is a failure in the network. Model distributed inference is considered for a multi-source setup in [16].…”
Section: Related Workmentioning
confidence: 99%
“…The partial rate δ n−1 (k) corresponds to the rate in the denominator of (2), i.e., δ n−1 (k) = n−1 z=1 1/γ z (k). 5 On the other hand, (k−1) corresponds to the whole term, i.e., (k−1) = N z=1 1/γ z (k−1). Each worker calculates their partial rate by summing up the previous worker's partial rate and its own rate, i.e., ρ n (k)/d n (k) as shown in line 34.…”
Section: Ar-mdi: Adaptive and Resilient MDImentioning
confidence: 99%
See 2 more Smart Citations