2022
DOI: 10.48550/arxiv.2207.07812
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Survey on Collaborative DNN Inference for Edge Intelligence

Abstract: With the vigorous development of artificial intelligence (AI), the intelligent applications based on deep neural network (DNN) change people's lifestyles and the production efficiency. However, the huge amount of computation and data generated from the network edge becomes the major bottleneck, and traditional cloud-based computing mode has been unable to meet the requirements of real-time processing tasks. To solve the above problems, by embedding AI model training and inference capabilities into the network … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 66 publications
(72 reference statements)
0
4
0
Order By: Relevance
“…Collaboration among devices in proximity is also beneficially exploited for inference splitting in [84], where close devices exchange data via the Web real-time communication protocol. Similar approaches are surveyed in [99] and references therein.…”
Section: ) State-of-the-artmentioning
confidence: 99%
“…Collaboration among devices in proximity is also beneficially exploited for inference splitting in [84], where close devices exchange data via the Web real-time communication protocol. Similar approaches are surveyed in [99] and references therein.…”
Section: ) State-of-the-artmentioning
confidence: 99%
“…The balancedSplit function implements a binary search that calls at each step to the greedy method splitCheck to verify if the array P can be segmented into s parts with at most bound parameters in each. This method traverses the array assigning values to a fragment as long as their sum is less than bound, and starting with a new fragment when exceeding it (lines [19][20][21][22][23][24]. If the final amount of fragments needed does not exceed the required s, then bound is a sufficient upper bound and smaller values are searched (lines 8-10).…”
Section: Balancing Model Segmentationmentioning
confidence: 99%
“…Edge Computing aims at bringing computations near to the sensors in Internet of Things (IoT) deployments, in order to improve latencies, increase security and reduce access cost to datacenters. The convergence of Edge Computing and Artificial Intelligence (AI) tasks in the so-called Edge-AI paradigm [14,20] pursues bringing intelligence to edge devices in order to cover a number of applications necessary in many IoT scenarios (object detection for smart cameras [9], smart city applications [24], healthcare [1] or autonomous driving [16]), among others), bringing all the benefits of Edge Computing to the AI arena.…”
Section: Introductionmentioning
confidence: 99%
“…The common solution of EI is to deploy powerful edge servers to operate DL execution at the edge. In scenarios where deploying these servers is challenging, costly, or even impossible in high-mobility and harsh scenarios, device-to-device collaborative DL execution could be a viable approach to perform these applications [19].…”
Section: Introductionmentioning
confidence: 99%