2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW) 2019
DOI: 10.1109/ccaaw.2019.8904901
|View full text |Cite
|
Sign up to set email alerts
|

Deep Reinforcement Learning for Continuous Power Allocation in Flexible High Throughput Satellites

Abstract: Many of the next generation of satellites will be equipped with numerous degrees of freedom in power and bandwidth allocation capabilities, making manual resource allocation impractical. Therefore, it is desirable to automate the operation of these highly flexible satellites. This paper presents a novel approach based on Deep Reinforcement Learning to allocate power in multibeam satellite systems. The proposed architecture represents the problem as continuous state and action spaces. We make use of the Proxima… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 40 publications
(45 citation statements)
references
References 7 publications
0
45
0
Order By: Relevance
“…In a multibeam system, the power allocation is usually a continuous variable [25]; however, in this article, the power allocation is considered to be performed by selecting one value among several in a set of possible power levels per beam, due to existing technologies that allow power to be modified in 0.5-dB steps [7].…”
Section: Numerical Results and Analysismentioning
confidence: 99%
See 3 more Smart Citations
“…In a multibeam system, the power allocation is usually a continuous variable [25]; however, in this article, the power allocation is considered to be performed by selecting one value among several in a set of possible power levels per beam, due to existing technologies that allow power to be modified in 0.5-dB steps [7].…”
Section: Numerical Results and Analysismentioning
confidence: 99%
“…7). The DRM will attempt to minimize KPI 1 in (24) while maximizing KPI 2 in (25). In this sense, the performance of the algorithm used for the DRM is evaluated by exploiting a joint KPI, defined as…”
Section: B Cnn Performance For Drm and Comparison With Benchmark Algmentioning
confidence: 99%
See 2 more Smart Citations
“…From the four resources introduced above, recent research has focused on power allocation, [4][5][6][7][8] while the other satellite resources such as the frequency assignment and beam placement have been less studied. The latter two problems have been shown to be NP hard, 9,10 which is why brute force algorithms are unfeasible for medium-to large-scale scenarios (ie, >850 beams, as we show later).…”
Section: Literature Reviewmentioning
confidence: 99%