2022

DOI: 10.3390/s22124379

|View full text |Cite

|

Sign up to set email alerts

|

A Low-Complexity Algorithm for a Reinforcement Learning-Based Channel Estimator for MIMO Systems

Tae‐Kyoung Kim

¹

,

²

Abstract: This paper proposes a low-complexity algorithm for a reinforcement learning-based channel estimator for multiple-input multiple-output systems. The proposed channel estimator utilizes detected symbols to reduce the channel estimation error. However, the detected data symbols may include errors at the receiver owing to the characteristics of the wireless channels. Thus, the detected data symbols are selectively used as additional pilot symbols. To this end, a Markov decision process (MDP) problem is defined to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction5

Citation Types

Supporting

0

Mentioning

18

Contrasting

0

Year Published

2023

2023

2024

2024

Publication Types

Select...

Article3

Other1

Relationship

Self Cite2

Independent2

Authors

Journals

Cited by 4 publications

(18 citation statements)

References 29 publications

Supporting

0

Mentioning

18

Contrasting

0

Order By: Relevance

“…As a non-iterative approach, the reinforcement learning (RL)-aided channel estimator was introduced in [ 27 , 28 , 29 , 30 , 31 , 32 , 33 ]. The basic concept of this approach is the sequential selection of detected data symbols to minimize the channel estimation errors.…”

Section: Introductionmentioning

confidence: 99%

“…Hence, a Markov decision process (MDP) was defined to solve the sequential selection, and the corresponding optimal policy was derived in a closed-form expression in [ 31 ]. In [ 32 ], a low-complexity algorithm was investigated by introducing sub-blocks and finite backup samples, and the computational complexity and latency were significantly reduced without performance loss. Recently, a general framework for RL-aided channel estimation was studied in [ 33 ] based on Monte Carlo tree search.…”

Section: Introductionmentioning

confidence: 99%

“…Recently, a general framework for RL-aided channel estimation was studied in [ 33 ] based on Monte Carlo tree search. However, the RL-aided channel estimators in [ 31 , 32 , 33 ] were originally considered in time-invariant channels; they perform insufficiently in time-varying channels.…”

Section: Introductionmentioning

confidence: 99%

“…First, we define the optimization problem in time-varying channels to select the detected data symbols and minimize the estimation error between the estimated and current channels. This optimization problem is different from those in [ 31 , 32 , 33 ], where the selection of the detected data symbols is unchanged because the current channel remains unchanged with the time slot index. We propose an RL algorithm for the optimization problem that captures the time-varying nature of a channel.…”

Section: Introductionmentioning

confidence: 99%

“…Using this adjustment, we derive the optimal policy as a closed-form solution. Note that the proposed optimal policy differs from those in [ 31 , 32 , 33 ] because the influence of soft-decision symbols in the virtual state for future rewards gradually diminishes as the time slot index increases. We propose a further performance improvement scheme to refine the state elements.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Reinforcement Learning-Aided Channel Estimator in Time-Varying MIMO Systems

¹

,

²

2023

Self Cite

View full text Add to dashboard Cite

This paper proposes a reinforcement learning-aided channel estimator for time-varying multi-input multi-output systems. The basic concept of the proposed channel estimator is the selection of the detected data symbol in the data-aided channel estimation. To achieve the selection successfully, we first formulate an optimization problem to minimize the data-aided channel estimation error. However, in time-varying channels, the optimal solution is difficult to derive because of its computational complexity and the time-varying nature of the channel. To address these difficulties, we consider a sequential selection for the detected symbols and a refinement for the selected symbols. A Markov decision process is formulated for sequential selection, and a reinforcement learning algorithm that efficiently computes the optimal policy is proposed with state element refinement. Simulation results demonstrate that the proposed channel estimator outperforms conventional channel estimators by efficiently capturing the variation of the channels.

“…As a non-iterative approach, the reinforcement learning (RL)-aided channel estimator was introduced in [ 27 , 28 , 29 , 30 , 31 , 32 , 33 ]. The basic concept of this approach is the sequential selection of detected data symbols to minimize the channel estimation errors.…”

Section: Introductionmentioning

confidence: 99%

“…Hence, a Markov decision process (MDP) was defined to solve the sequential selection, and the corresponding optimal policy was derived in a closed-form expression in [ 31 ]. In [ 32 ], a low-complexity algorithm was investigated by introducing sub-blocks and finite backup samples, and the computational complexity and latency were significantly reduced without performance loss. Recently, a general framework for RL-aided channel estimation was studied in [ 33 ] based on Monte Carlo tree search.…”

Section: Introductionmentioning

confidence: 99%

“…Recently, a general framework for RL-aided channel estimation was studied in [ 33 ] based on Monte Carlo tree search. However, the RL-aided channel estimators in [ 31 , 32 , 33 ] were originally considered in time-invariant channels; they perform insufficiently in time-varying channels.…”

Section: Introductionmentioning

confidence: 99%

“…First, we define the optimization problem in time-varying channels to select the detected data symbols and minimize the estimation error between the estimated and current channels. This optimization problem is different from those in [ 31 , 32 , 33 ], where the selection of the detected data symbols is unchanged because the current channel remains unchanged with the time slot index. We propose an RL algorithm for the optimization problem that captures the time-varying nature of a channel.…”

Section: Introductionmentioning

confidence: 99%

“…Using this adjustment, we derive the optimal policy as a closed-form solution. Note that the proposed optimal policy differs from those in [ 31 , 32 , 33 ] because the influence of soft-decision symbols in the virtual state for future rewards gradually diminishes as the time slot index increases. We propose a further performance improvement scheme to refine the state elements.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Reinforcement Learning-Aided Channel Estimator in Time-Varying MIMO Systems

¹

,

²

2023

Self Cite

View full text Add to dashboard Cite

This paper proposes a reinforcement learning-aided channel estimator for time-varying multi-input multi-output systems. The basic concept of the proposed channel estimator is the selection of the detected data symbol in the data-aided channel estimation. To achieve the selection successfully, we first formulate an optimization problem to minimize the data-aided channel estimation error. However, in time-varying channels, the optimal solution is difficult to derive because of its computational complexity and the time-varying nature of the channel. To address these difficulties, we consider a sequential selection for the detected symbols and a refinement for the selected symbols. A Markov decision process is formulated for sequential selection, and a reinforcement learning algorithm that efficiently computes the optimal policy is proposed with state element refinement. Simulation results demonstrate that the proposed channel estimator outperforms conventional channel estimators by efficiently capturing the variation of the channels.

DRL at the Physical Layer

2023

Deep Reinforcement Learning for Wireless Communications and Networking

View full text Add to dashboard Cite

No abstract

FAQ: A Fuzzy-Logic-Assisted Q-Learning Model for Resource Allocation in 6G V2X

¹

,

²

,

³

et al. 2024

IEEE Internet Things J.

View full text Add to dashboard Cite

No abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Product

Browser Extension Assistant by scite Citation Statement Search Reference Check Visualizations Dashboards Explore Journals Explore Organizations Explore Funders Embedding Badge Embedding Citation Search Pricing

Resources

Blog Help & FAQ Accessibility Statement API Terms For Universities & Governments For Researchers For Publishers For Corporate, Pharma & Enterprise Author Marketing Become an Affiliate Get an organization trial or quote scite Data & Services

About

News & Press Careers Read our Paper Coverage

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Copyright © 2024 scite LLC. All rights reserved.

Made with 💙 for researchers

Part of the Research Solutions Family.