2022
DOI: 10.1109/lcomm.2021.3140155
|View full text |Cite
|
Sign up to set email alerts
|

Long-Term CSI-Based Design for RIS-Aided Multiuser MISO Systems Exploiting Deep Reinforcement Learning

Abstract: In this paper, we study the transmission design for reconfigurable intelligent surface (RIS)-aided multiuser communication networks. Different from most of the existing contributions, we consider long-term CSI-based transmission design, where both the beamforming vectors at the base station (BS) and the phase shifts at the RIS are designed based on long-term CSI, which can significantly reduce the channel estimation overhead. Due to the lack of explicit ergodic data rate expression, we propose a novel deep det… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 18 publications
0
8
0
Order By: Relevance
“…shows the convergence behavior of the proposed method. In order to better demonstrate the convergence performance of the proposed algorithm, instant reward and average reward are considered, among which the average reward is defined as follows [51]:…”
Section: B Convergencementioning
confidence: 99%
“…shows the convergence behavior of the proposed method. In order to better demonstrate the convergence performance of the proposed algorithm, instant reward and average reward are considered, among which the average reward is defined as follows [51]:…”
Section: B Convergencementioning
confidence: 99%
“…On the contrary, deep reinforcement learning (DRL) is a novel approach that combines deep learning (DL) and reinforcement learning (RL). It has been proven to be a significant breakthrough in non-convex optimization problems, including hybrid beamforming design [ 22 ], spectrum intelligence sensing [ 23 ], channel state estimation [ 24 ], and power allocation strategy optimization [ 25 ]. Compared with deep learning (DL), the DRL algorithm does not require a large amount of training labeled data as inputs and is therefore very friendly for the optimization of wireless communication systems, where obtaining data is more tedious.…”
Section: Introductionmentioning
confidence: 99%
“…A dynamic power allocation problem with the time-varying channel is illustrated in [24] with a single transmit antenna, further studied in [25] by involving transmit beamforming in consideration and extended into multi-user scenario in [32]. Due to the appealing features of flexible deployment and sustainability in low power consumption, beamforming design of reconfigurable intelligent surface (RIS)-aided communications is proposed in [27]- [29], [34] to reduce computations compared with the alternating framework but requires unaffordable signaling overhead and complexity to obtain CSI. In terms of active beamforming using DRL, several efforts have been made on designing low complexity algorithms based on deep Q-network (DQN) [25], [31]- [33] and partially observed MDP [35] frameworks.…”
Section: Introductionmentioning
confidence: 99%
“…In terms of active beamforming using DRL, several efforts have been made on designing low complexity algorithms based on deep Q-network (DQN) [25], [31]- [33] and partially observed MDP [35] frameworks. However, existing works [24]- [34] assume that perfect CSIT or instantaneous channel gain via receiver feedback is known at the transmitter. Unfortunately, such an assumption is impractical in real-world systems with CSI feedback/acquisition delay and user mobility [6], [7].…”
Section: Introductionmentioning
confidence: 99%