2022
DOI: 10.3390/s22041630
|View full text |Cite
|
Sign up to set email alerts
|

Deep Q-Learning-Based Transmission Power Control of a High Altitude Platform Station with Spectrum Sharing

Abstract: A High Altitude Platform Station (HAPS) can facilitate high-speed data communication over wide areas using high-power line-of-sight communication; however, it can significantly interfere with existing systems. Given spectrum sharing with existing systems, the HAPS transmission power must be adjusted to satisfy the interference requirement for incumbent protection. However, excessive transmission power reduction can lead to severe degradation of the HAPS coverage. To solve this problem, we propose a multi-agent… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(9 citation statements)
references
References 17 publications
0
9
0
Order By: Relevance
“…In problem (6), constraints (6b) and (6c) limit the maximum transmit power of the MBS and HAPS. Constraint (6d) ensures that orthogonal subcarrier allocation has been preserved within each MBS and HAPS.…”
Section: System Model and Problem Formulationmentioning
confidence: 99%
See 2 more Smart Citations
“…In problem (6), constraints (6b) and (6c) limit the maximum transmit power of the MBS and HAPS. Constraint (6d) ensures that orthogonal subcarrier allocation has been preserved within each MBS and HAPS.…”
Section: System Model and Problem Formulationmentioning
confidence: 99%
“…Due to the presence of binary variable F, and non-convex objective function and constraint (6d), problem ( 6) is a mixedinteger non-linear program (MINLP), which is challenging to solve for the global solution. Thus, in the following section, we develop a SCA-based rapid converging iterative algorithm to solve (6) for the sub-optimal solution. To this end, we apply the reformulation-linearization techniques (RLT) to transform the MINLP into an equivalent convex problem, which is then solved in an iterative manner until it converges to the sub-optimal solution.…”
Section: System Model and Problem Formulationmentioning
confidence: 99%
See 1 more Smart Citation
“…Jiang et al [ 21 ] propose a value-iteration-based RL algorithm, which can efficiently converge to stable strategies and significantly improve network performance. Jo et al [ 22 ] propose a multi-agent Deep Q-learning (DQL)-based transmission power control algorithm which minimizes the outage probability of the High-Altitude Platform Station downlink. However, existing multi-agent algorithms are capable of static target search tasks and have poor performance for moving and invisible objects.…”
Section: Introductionmentioning
confidence: 99%
“…Ref. [8] proposed a multi-agent DQL (Deep Q-learning)-based transmission power control algorithm to minimize the outage probability while satisfying the interference requirement of an interfered system. To deal with the potential risk of action-value overestimation from the DQL, they developed even a DDQL (Double DQL).…”
mentioning
confidence: 99%