2023
DOI: 10.1109/lwc.2022.3217316
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Resource Allocation With Deep Reinforcement Learning in Multibeam Satellite Communication

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 16 publications
(4 citation statements)
references
References 15 publications
0
4
0
Order By: Relevance
“…The load-balancing traffic scheduling scheme based on deep reinforcement learning (DRL) [59] applies the MDP paradigm in the decision-making process, transforming the traffic scheduling problem in SAGIN into an improved max-flow problem. The twin delayed DDPG (TD3) algorithm [60] is used for the joint allocation of subchannels and power in SAGIN, aiming to optimize the system's digital bandwidth resources while ensuring packet latency and reliability.…”
Section: B Markov Decision Processmentioning
confidence: 99%
See 2 more Smart Citations
“…The load-balancing traffic scheduling scheme based on deep reinforcement learning (DRL) [59] applies the MDP paradigm in the decision-making process, transforming the traffic scheduling problem in SAGIN into an improved max-flow problem. The twin delayed DDPG (TD3) algorithm [60] is used for the joint allocation of subchannels and power in SAGIN, aiming to optimize the system's digital bandwidth resources while ensuring packet latency and reliability.…”
Section: B Markov Decision Processmentioning
confidence: 99%
“…Liu et al [114] resorted to the reparameterized deep deterministic policy gradient (RPDDPG) algorithm to resolve the adaptive transmission strategy problem (ATSP) in SAGIN, aiming to maximize system throughput while satisfying latency and reliability requirements for data packets. Deng et al [60] proposed a twin delayed DDPG (TD3)-based DRL algorithm for the joint allocation of subchannels and power. This algorithm is applied to tackle the ATSP in SAGIN with the goal of optimizing system throughput while ensuring packet latency and reliability.…”
Section: Deep Reinforcement Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…10 Based on GSO multibeam satellites, researchers proposed deep reinforcement learning algorithms based on double delay deep deterministic policy gradients to reduce training complexity (compared to previous genetic algorithms and Q-learning), integrate four algorithms (independent training, priority experience replay, scaling factor, and noise backreaction) to overcome constraints effects thereby enhancing selection of user fairness, and to further improve baseline scenarios. 11 It is worth noting that the experimental results are obtained under the premise of fixing many influencing factors, such as orbit height, number of beams, etc., and further validation is needed to see whether they apply to the changing conditions in reality. For LEO satellites with higher real-time requirements, heuristic algorithms such as genetic algorithms cannot achieve real-time scheduling due to slow convergence.…”
Section: Work Related To Ai-based Communication Resource Allocationmentioning
confidence: 99%