2023
DOI: 10.1109/access.2023.3244497
|View full text |Cite
|
Sign up to set email alerts
|

DNN Partitioning for Inference Throughput Acceleration at the Edge

Abstract: Deep neural network (DNN) inference on streaming data requires computing resources to satisfy inference throughput requirements. However, latency and privacy sensitive deep learning applications cannot afford to offload computation to remote clouds because of the implied transmission cost and lack of trust in third-party cloud providers. Among solutions to increase performance while keeping computation on a constrained environment, hardware acceleration can be onerous, and model optimization requires extensive… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(2 citation statements)
references
References 20 publications
0
2
0
Order By: Relevance
“…is paper focuses on the theoretical and experimental study of stability in distributed, decentralized systems for resource discovery and allocation in non-deterministic, heterogenous, time-variant networking and communication infrastructures. is analysis can be relevant for a number of use cases: multicast content distribution in data centers [1] or in other constrained infrastructures, distributed applications and systems including information-centric architectures [2], mechanisms for dynamic allocation of workloads in the cloud [3], computing resources in data centers [4] or in other deployments of the Internet edge [5]. In all these systems, the study of performance is challenging as these systems increasingly rely on distributed multi-party architectures, involving uncoordinated or loosely coordinated agents that interact to each other, may have partial, possibly inconsistent views of the environment, and follow autonomous, possibly con icting policies.…”
Section: Problem Statementmentioning
confidence: 99%
See 1 more Smart Citation
“…is paper focuses on the theoretical and experimental study of stability in distributed, decentralized systems for resource discovery and allocation in non-deterministic, heterogenous, time-variant networking and communication infrastructures. is analysis can be relevant for a number of use cases: multicast content distribution in data centers [1] or in other constrained infrastructures, distributed applications and systems including information-centric architectures [2], mechanisms for dynamic allocation of workloads in the cloud [3], computing resources in data centers [4] or in other deployments of the Internet edge [5]. In all these systems, the study of performance is challenging as these systems increasingly rely on distributed multi-party architectures, involving uncoordinated or loosely coordinated agents that interact to each other, may have partial, possibly inconsistent views of the environment, and follow autonomous, possibly con icting policies.…”
Section: Problem Statementmentioning
confidence: 99%
“…Fig. 6 displays the Beta ing 5 distribution parameters for N = 500 samples of single-orbit (b-orbit) processes at iteration n = 3000, a er excluding extreme values (in the case of b-orbit, zeros 6 ).…”
Section: P (See Appendix D)mentioning
confidence: 99%