Reinforcement Learning-Aided Channel Estimator in Time-Varying MIMO Systems.

Advanced Search

Tae-Kyoung Kim, Moonsik Min

Author Information

Tae-Kyoung Kim: Department of Electronic Engineering, Gachon University, Seongnam 13120, Republic of Korea. ORCID
Moonsik Min: School of Electronic and Electrical Engineering, Kyungpook National University, Daegu 41566, Republic of Korea. ORCID

PMID: 37420854 DOI: 10.3390/s23125689

This paper proposes a reinforcement learning-aided channel estimator for time-varying multi-input multi-output systems. The basic concept of the proposed channel estimator is the selection of the detected data symbol in the data-aided channel estimation. To achieve the selection successfully, we first formulate an optimization problem to minimize the data-aided channel estimation error. However, in time-varying channels, the optimal solution is difficult to derive because of its computational complexity and the time-varying nature of the channel. To address these difficulties, we consider a sequential selection for the detected symbols and a refinement for the selected symbols. A Markov decision process is formulated for sequential selection, and a reinforcement learning algorithm that efficiently computes the optimal policy is proposed with state element refinement. Simulation results demonstrate that the proposed channel estimator outperforms conventional channel estimators by efficiently capturing the variation of the channels.

data-aided channel estimation first-order Gaussian—Markov channel model non-iterative approach reinforcement learning

Sensors (Basel). 2021 Jul 16;21(14): [PMID: 34300599]
Sensors (Basel). 2021 Dec 31;22(1): [PMID: 35009848]
Sensors (Basel). 2022 Jun 09;22(12): [PMID: 35746162]

2021R1F1A1063273/National Research Foundation of Korea
2023R1A2C1004034/National Research Foundation of Korea
4199990113966/Ministry of Education, Korea

Algorithms

Computer Simulation

Markov Chains

Policy

Journal Article

OpenLB
Open Library of Bioscience