2023
DOI: 10.1109/tmc.2023.3243910
|View full text |Cite
|
Sign up to set email alerts
|

Online Reinforcement Learning for Beam Tracking and Rate Adaptation in Millimeter-wave Systems

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(2 citation statements)
references
References 40 publications
0
2
0
Order By: Relevance
“…In [107], a limited feedback channel is assumed in order to reproduce real-world scenarios, so a limited CSI is used by a DNN regression for beam allocation, resulting in near-optimal performance in the −10 up to 20 dB SNR regimen. The authors in [108] propose the use of standard ACK/NACK messages transmitted by the UE to the BS during the Hybrid Automatic Repeat Request (HARQ) procedure as the input to an online RL scheme to lower the signaling overhead required for beam tracking and rate adaptation. In [109], the RSRP reported by the UE is used to feed an ML-assisted beam change prediction scheme based on Long Short-Term Memory (LSTM) and helps saving more than half the power used by the UE for Beam Management (BM) compared to other methods.…”
Section: Beam Selection In Mimo Systemsmentioning
confidence: 99%
“…In [107], a limited feedback channel is assumed in order to reproduce real-world scenarios, so a limited CSI is used by a DNN regression for beam allocation, resulting in near-optimal performance in the −10 up to 20 dB SNR regimen. The authors in [108] propose the use of standard ACK/NACK messages transmitted by the UE to the BS during the Hybrid Automatic Repeat Request (HARQ) procedure as the input to an online RL scheme to lower the signaling overhead required for beam tracking and rate adaptation. In [109], the RSRP reported by the UE is used to feed an ML-assisted beam change prediction scheme based on Long Short-Term Memory (LSTM) and helps saving more than half the power used by the UE for Beam Management (BM) compared to other methods.…”
Section: Beam Selection In Mimo Systemsmentioning
confidence: 99%
“…In [104], a limited feedback channel is assumed in order to reproduce real-world scenarios, so a limited CSI is used by a DNN regression for beam allocation, resulting in near-optimal performance in the −10 up to 20 dB SNR regimen. The authors in [105] propose the use of standard ACK/NACK messages transmitted by the UE to the BS during the Hybrid Automatic Repeat Request (HARQ) procedure as input to an online RL scheme to lower the signaling overhead required for beam tracking and rate adaptation. In [106], the RSRP reported by the UE is used to feed a ML assisted beam change prediction scheme based on Long Short-Term Memory (LSTM), and helps saving more than half the power used by the UE for Beam Management (BM) compared to other methods.…”
Section: Beam Selection In Mimo Systemsmentioning
confidence: 99%