2022
DOI: 10.1109/tmc.2022.3208457
|View full text |Cite
|
Sign up to set email alerts
|

Evolutionary Multi-Objective Reinforcement Learning Based Trajectory Control and Task Offloading in UAV-Assisted Mobile Edge Computing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
10

Relationship

0
10

Authors

Journals

citations
Cited by 53 publications
(15 citation statements)
references
References 45 publications
0
15
0
Order By: Relevance
“…From an Environmental aspect , utilizing vehicles with high carbon footprint causes an international crisis of global warming. Though it’s unavoidable to use airplanes for transportation of products from long-distance locations, truck delivery and especially for products that are not large in terms of their size can be alternated with a green vehicle like robots and drones (Farajzadeh et al, 2020; Moadab et al, 2022 , Song et al, 2022 ). The environmental cost is not negligible, and the harmful effects of carbon emission of the operation can cause more damage to the economy and involve more people.…”
Section: Sensitivity Analysis and Discussionmentioning
confidence: 99%
“…From an Environmental aspect , utilizing vehicles with high carbon footprint causes an international crisis of global warming. Though it’s unavoidable to use airplanes for transportation of products from long-distance locations, truck delivery and especially for products that are not large in terms of their size can be alternated with a green vehicle like robots and drones (Farajzadeh et al, 2020; Moadab et al, 2022 , Song et al, 2022 ). The environmental cost is not negligible, and the harmful effects of carbon emission of the operation can cause more damage to the economy and involve more people.…”
Section: Sensitivity Analysis and Discussionmentioning
confidence: 99%
“…In both MADDPG and MAPPO, the critic network has a global view of the system, which is only applied during the training phase and actor networks are employed for each agent. An improved version of DDPG is proposed in [44] to exploit temporal correlations and improved computation cost. A major bottleneck of these algorithms is the scalability due to the shared critic network, even if the PS is considered.…”
Section: Suitability Of Marl For Ra Scheme Designmentioning
confidence: 99%
“…The results of the experiment demonstrate the effectiveness and superiority of the proposed algorithm with respect to existing approaches. Fuhong Song et al have proposed an approach which uses multi-objective reinforcement learning for optimizing the UAV's trajectory and offloading decisions in real-time [18]. The algorithm considers multiple objectives, including minimizing energy consumption, maximizing data processing efficiency while maintaining a stable network connection.…”
Section: Related Workmentioning
confidence: 99%