2023
DOI: 10.1109/tmc.2022.3141080
|View full text |Cite
|
Sign up to set email alerts
|

Large-Scale Computation Offloading Using a Multi-Agent Reinforcement Learning in Heterogeneous Multi-Access Edge Computing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 47 publications
(7 citation statements)
references
References 31 publications
0
7
0
Order By: Relevance
“…These studies are usually model-based traditional optimization algorithms, which require a large amount of a priori knowledge to construct an accurate mathematical model, a situation that leads to poor generalization ability of the model and makes it difficult to adapt to the new states that appear in dynamic environments [18] . To cope with the UAV group task migration problem in this dynamic task environment, some researches have proposed task migration methods based on multi-agent reinforcement learning, which uses deep neural networks to end-toend the interaction relationship of multi-agent within the UAV group, and adaptively learn to update the strategy by interacting with the environment to improve the generalization ability of the model.…”
Section: Related Workmentioning
confidence: 99%
“…These studies are usually model-based traditional optimization algorithms, which require a large amount of a priori knowledge to construct an accurate mathematical model, a situation that leads to poor generalization ability of the model and makes it difficult to adapt to the new states that appear in dynamic environments [18] . To cope with the UAV group task migration problem in this dynamic task environment, some researches have proposed task migration methods based on multi-agent reinforcement learning, which uses deep neural networks to end-toend the interaction relationship of multi-agent within the UAV group, and adaptively learn to update the strategy by interacting with the environment to improve the generalization ability of the model.…”
Section: Related Workmentioning
confidence: 99%
“…A number of computations are redirected to edge devices that are more suited to handle them, such as those with GPUs, or to devices with bigger energy reserves, or even directly to the cloud. These edge systems are equipped to monitor energy usage and can intelligently distribute tasks to suitable edge devices using offloading algorithms, often integrating machine learning methods for optimized decision-making [102,104].…”
Section: Energymentioning
confidence: 99%
“…Based on above-mentioned discussions, we propose the AB-MAPPO algorithm, which is summarized in Algorithm 1. The complexity of attention module is O(I 2 V ), where V is the length of feature-value vectors, according to [35]. For an MLP, the computational complexity…”
Section: Complexity Analysismentioning
confidence: 99%