2020
DOI: 10.1109/lnet.2020.3000334
|View full text |Cite
|
Sign up to set email alerts
|

Link-Level Throughput Maximization Using Deep Reinforcement Learning

Abstract: A multi-agent deep reinforcement learning framework is proposed to address link level throughput maximization by power allocation and modulation and coding scheme (MCS) selection. Given the complex problem space, reward shaping is utilized instead of classical training procedures. The time-frame utilities are decomposed into subframe rewards, and a stepwise training procedure is proposed, starting from a simplified power allocation setup without MCS selection, incorporating MCS selection gradually, as the agen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 9 publications
0
4
0
Order By: Relevance
“…Since the probability density function of the Rayleigh distribution is known, probabilities corresponding to the defined SNR, hence CQI, intervals can be calculated. In conclusion, using the packet arrival probabilities and state transitions expressed in (18) and (20), and CQI probabilities, we can obtain P n l ss ′ for all states and all actions. We remark that the formulated MDP has a countable-state space considering both ∆ q (l) ∈ {0, 1, .…”
Section: Adaptive Blocklength Selection For Minimizing Age Violation ...mentioning
confidence: 95%
See 2 more Smart Citations
“…Since the probability density function of the Rayleigh distribution is known, probabilities corresponding to the defined SNR, hence CQI, intervals can be calculated. In conclusion, using the packet arrival probabilities and state transitions expressed in (18) and (20), and CQI probabilities, we can obtain P n l ss ′ for all states and all actions. We remark that the formulated MDP has a countable-state space considering both ∆ q (l) ∈ {0, 1, .…”
Section: Adaptive Blocklength Selection For Minimizing Age Violation ...mentioning
confidence: 95%
“…Some works in the literature also use RL techniques for AMC to optimize traditional performance metrics such as throughput [17], [18] and spectral efficiency [19]. However, none of them consider dynamic MCS selection in AoIaware systems.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Additionally, the application of model-based reinforcement learning technology in 5G NR has been explored in [7] , which exhibits reduced communication delay compared to conventional methods. The literature [8] proposes a multi-agent deep reinforcement learning (DRL) framework to enhance throughput through power allocation and modulation coding scheme selection. Furthermore, [9] proposes a transmitter modulation coding scheme selection model, QL-AMC, which is derived from the Q-learning algorithm, resulting in enhanced spectral efficiency of the system.…”
Section: Introductionmentioning
confidence: 99%