2023
DOI: 10.1109/ojcoms.2023.3251297
|View full text |Cite
|
Sign up to set email alerts
|

A Continuous Actor–Critic Deep Q-Learning-Enabled Deployment of UAV Base Stations: Toward 6G Small Cells in the Skies of Smart Cities

Abstract: Uncrewed aerial vehicle-mounted base stations (UAV-BSs), also know as drone base stations, are considered to have promising potential to tackle the limitations of ground base stations. They can provide cost-effective Internet connection to es that are out of infrastructure. They can also take over quickly as service providers when ground base stations fail in an unanticipated manner. UAV-BSs benefit from their mobile nature that enables them to change their 3D locations if the demand profile changes rapidly. I… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 50 publications
0
5
0
Order By: Relevance
“…The first one is to consider an LTE UAV instead of the Multi-RAT UAV. LTE UAVs have been considered in many works as in [29], [30], [54].…”
Section: A Simulation Setupmentioning
confidence: 99%
“…The first one is to consider an LTE UAV instead of the Multi-RAT UAV. LTE UAVs have been considered in many works as in [29], [30], [54].…”
Section: A Simulation Setupmentioning
confidence: 99%
“…The advent of consumer drones, flying ad hoc networks, low-latency 5G, and advancements beyond 5G have significantly accelerated progress in this field. The authors of [118] describe a continuous actor-critic deep Q-learning (ACDQL) strategy to solve the location optimization problem of UAV-BSs in the presence of mobile endpoints, extending the action space of the reinforcement learning (RL) algorithm from discrete to continuous. In detail, a scheme for the dynamic positioning of a UAV-BS in the case of user mobility is proposed, overcoming previous works presented in the literature, which considered fixed locations for the ground endpoints.…”
Section: Uavs and Drone-based Solutionsmentioning
confidence: 99%
“…Q-learning is applied to select patches for leader UAVs, and follower UAVs are dispatched to cover the patches based on a star communication topology. Modifies the TD3 model for online path planning of multiple UAVs Energy efficiency maximization TD3 [173] Uses a DRL approach with flow-level models to determine optimal UAVs' trajectories Network throughput maximization, flow blocking probability minimization PPO [34] Presents a constrained DQN for optimizing the 3D trajectory design of multiple UAVs in a wireless network Downlink capacity maximization DQN [174] Using Q-learning based method to optimize the deployment and dynamic movement of UAVs Mean opinion score of ground users maximization Q-learning [175] Introduces a UAVs cell-free network utilizing DRL to optimize UAV trajectories in limited energy resources and insufficient environment knowledge Vehicle coverage maximization, energy consumption minimization DDPG [176] Presents a MADDPG-absed method for optimizing the trajectory of UAVs in a dynamic communication system Fairness of ground users, energy efficiency, throughput maximization MADDPG [177] Utilizes single-/multi-agent AC algorithms for initial UAV deployment and trajectory design Data downlink rate of users maximization A2C [178] Using heterogeneous graphs to represent the relationship between UAVs and users, and enhance the cooperation of multi-agent through communication Fair throughput maximization Multi-agent DQN [179] Leverage continuous AC deep Q-learning for optimal real-time UAV deployment Sum data rate maximization AC…”
Section: A Navigationmentioning
confidence: 99%
“…In [179], the authors explore the deployment of UAV as aerial BSs to offer downlink Internet connectivity following the incapacitation of ground BS due to natural disasters. Given the mobility of ground endpoints, the authors emphasize the need for real-time deployment of UAVs to enhance network performance.…”
Section: B Performance Enhancement Via Trajectory Optimizationmentioning
confidence: 99%