2022
DOI: 10.1049/cmu2.12501
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning integrated reinforcement learning for adaptive beamforming in B5G networks

Abstract: In this paper, a deep learning integrated reinforcement learning (DLIRL) algorithm is proposed for comprehending intelligent beamsteering in Beyond Fifth Generation (B5G) networks. The smart base station in B5G networks aims to steer the beam towards appropriate user equipment based on the acquaintance of isotropic transmissions. The foremost methodology is to optimize beam direction through reinforcement learning that delivers significant improvement in signal to noise ratio (SNR). This includes alternate pat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2025
2025

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 35 publications
0
5
0
Order By: Relevance
“…On the other hand, Elbir et al [169] generate the precoder from artificial channels using a convolutional neural network, achieving better results than the heuristic, deep learning, and Multilayer Perceptron (MLP) solutions that were compared in the article. However, samples of real-world network indicators are abundant in most cases, such as AoA and Angle of Departure (AoD) [170], the pilots present in different frame configurations [171,172], and samples from the channel [173] and, therefore, can also be used to train neural networks and result in more accurate precoders, tailored to specific conditions.…”
Section: Precoding and Combining In Mimo With Hybrid Or Digital Archi...mentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, Elbir et al [169] generate the precoder from artificial channels using a convolutional neural network, achieving better results than the heuristic, deep learning, and Multilayer Perceptron (MLP) solutions that were compared in the article. However, samples of real-world network indicators are abundant in most cases, such as AoA and Angle of Departure (AoD) [170], the pilots present in different frame configurations [171,172], and samples from the channel [173] and, therefore, can also be used to train neural networks and result in more accurate precoders, tailored to specific conditions.…”
Section: Precoding and Combining In Mimo With Hybrid Or Digital Archi...mentioning
confidence: 99%
“…The authors propose a new way of combining DL and RL for beamforming, leveraging high spectral efficiency and overall beamforming effectiveness. [172] ine Dynamic subarrays AHC Proposed hybrid precoding, which can efficiently avoid mutually correlated metrics.…”
Section: Drlmentioning
confidence: 99%
“…On the other hand, Elbir et al [164] generated precoder from artificial channels using a convolutional neural network, achieving better results than the heuristic, deep learning, and Multilayer Perceptrons (MLP) solutions that were compared in the article. However, samples of real-world network indicators are abundant in most cases, such as AoA and Angle of Departure (AoD) [165], the pilots present in different frame configurations [166,167], and samples from the channel [168], and, therefore, can also be used to train neural networks and result in more accurate precoders, tailored to specific conditions.…”
Section: Mlpmentioning
confidence: 99%
“…RL is then to optimize the beam direction, whose state space, action space, reward, and learning algorithm are defined in [5]. Initial simulation is performed on MATLAB 2021a siteviewer for radio transmission (RT) using site viewer and simultaneous channel generation [6], but it could equally well have used Winprop, which models more accurate geometries and accommodates more surface properties [5]. The output of RT is fed for channel generation, which in turn produces input for the DNN for training.…”
Section: Distance Measurement and Localization Using Aimentioning
confidence: 99%
“…If a beamformer steers the main beam in a particular direction (θ , ϕ), then the directional accuracy is defined as ±δθ, ±δϕ from the maximum signal-to-noise ratio (SNR) of the main lobe. The proposed deep learning integrated reinforcement learning (DLIRL) beamforming has an angle-of-departure (AoD) accuracy towards the UE location with a deviation of ±2 • , whereas RL has a deviation of ±3 • and DNN's deviation is ±5 • [6] is required to measure the ToA at accuracy levels of ∼33 picoseconds (calculated from 1 cm/speed of light).…”
Section: Distance Measurement and Localization Using Aimentioning
confidence: 99%