ICC 2021 - IEEE International Conference on Communications 2021
DOI: 10.1109/icc42927.2021.9500892
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive and Collaborative Edge Inference in Task Stream with Latency Constraint

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 13 publications
0
5
0
Order By: Relevance
“…For tasks with deadlines, edgent can maximize accuracy without exceeding the deadline. Song et al [49] also propose a collaborative inference system based on edgent, which applies early-exit mechanism and model division technology to solve the problem of EI in task flow scenarios. At the same time, the authors design an offline dynamic programming algorithm and an online deep reinforcement learning algorithm to dynamically select the exit point and partition point of the model in the task flow, so as to balance the efficiency and accuracy of inference tasks.…”
Section: Inference Accuracy Maximizationmentioning
confidence: 99%
“…For tasks with deadlines, edgent can maximize accuracy without exceeding the deadline. Song et al [49] also propose a collaborative inference system based on edgent, which applies early-exit mechanism and model division technology to solve the problem of EI in task flow scenarios. At the same time, the authors design an offline dynamic programming algorithm and an online deep reinforcement learning algorithm to dynamically select the exit point and partition point of the model in the task flow, so as to balance the efficiency and accuracy of inference tasks.…”
Section: Inference Accuracy Maximizationmentioning
confidence: 99%
“…Therefore, given S i,i , step 4 can provide S i,j , since the grouping policy that gives S i,i is the same as that of S i,j except the last group. To derive S i+1,i+1 , we first need to derive the feasible region of the second last group, where the latency constraint of the second last group cannot be too close to that of the last group according to (20). As a result, D is used to denote the set of all feasible indices of the first task in the second last group (step 6).…”
Section: B Different Latency Constraintsmentioning
confidence: 99%
“…The authors minimize the coinference latency under light workload scenarios or maximize the co-inference throughput under heavy workload scenarios. Further, the authors of [20] extend Edgent to the scenarios with random task arrivals. They propose a deep reinforcement learning agent to balance the trade-off between the number of tasks completed within the latency constraints and the inference accuracy.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations