Reinforcement Learning-Based Joint User Pairing and Power Allocation in MIMO-NOMA Systems.

Jaehee Lee, Jaewoo So
Author Information
  1. Jaehee Lee: Department of Electronic Engineering, Sogang University, Seoul 04107, Korea. ORCID
  2. Jaewoo So: Department of Electronic Engineering, Sogang University, Seoul 04107, Korea. ORCID

Abstract

In this paper, we consider a multiple-input multiple-output (MIMO)-non-orthogonal multiple access (NOMA) system with reinforcement learning (RL). NOMA, which is a technique for increasing the spectrum efficiency, has been extensively studied in fifth-generation (5G) wireless communication systems. The application of MIMO to NOMA can result in an even higher spectral efficiency. Moreover, user pairing and power allocation problem are important techniques in NOMA. However, NOMA has a fundamental limitation of the high computational complexity due to rapidly changing radio channels. This limitation makes it difficult to utilize the characteristics of the channel and allocate radio resources efficiently. To reduce the computational complexity, we propose an RL-based joint user pairing and power allocation scheme. By applying Q-learning, we are able to perform user pairing and power allocation simultaneously, which reduces the computational complexity. The simulation results show that the proposed scheme achieves a sum rate similar to that achieved with the exhaustive search (ES).

Keywords

Grants

  1. 2019R1F1A1058716/National Research Foundation of Korea
  2. 2020R1F1A1065109/National Research Foundation of Korea

Word Cloud

Created with Highcharts 10.0.0NOMAuserpairingpowerallocationcomputationalcomplexitymultiple-inputmultiple-outputMIMOmultipleaccessreinforcementlearningefficiencylimitationradioschemepaperconsider-non-orthogonalsystemRLtechniqueincreasingspectrumextensivelystudiedfifth-generation5GwirelesscommunicationsystemsapplicationcanresultevenhigherspectralMoreoverproblemimportanttechniquesHoweverfundamentalhighduerapidlychangingchannelsmakesdifficultutilizecharacteristicschannelallocateresourcesefficientlyreduceproposeRL-basedjointapplyingQ-learningableperformsimultaneouslyreducessimulationresultsshowproposedachievessumratesimilarachievedexhaustivesearchESReinforcementLearning-BasedJointUserPairingPowerAllocationMIMO-NOMASystemsnon-orthogonal

Similar Articles

Cited By (2)