Spectrum-efficient user grouping and resource allocation based on deep reinforcement learning for mmWave massive MIMO-NOMA systems.

Minghao Wang, Xin Liu, Fang Wang, Yang Liu, Tianshuang Qiu, Minglu Jin
Author Information
  1. Minghao Wang: College of Electronic Information Engineering, Inner Mongolia University, Hohhot, 010021, China.
  2. Xin Liu: College of Electronic Information Engineering, Inner Mongolia University, Hohhot, 010021, China.
  3. Fang Wang: College of Electronic Information Engineering, Inner Mongolia University, Hohhot, 010021, China.
  4. Yang Liu: College of Electronic Information Engineering, Inner Mongolia University, Hohhot, 010021, China. yangliu@imu.edu.cn.
  5. Tianshuang Qiu: Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, 116024, China.
  6. Minglu Jin: Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, 116024, China.

Abstract

Millimeter-wave (mmWave) massive multiple-input multiple-output non-orthogonal multiple access (MIMO-NOMA) is proven to be a primary technique for sixth-generation (6G) wireless communication networks. However, the great increase in users and antennas brings challenges for interference suppression and resource allocation for mmWave massive MIMO-NOMA systems. This study proposes a spectrum-efficient and fast convergence deep reinforcement learning (DRL)-based resource allocation framework to optimize user grouping and allocation of subchannel and power. First, an enhanced K-means grouping algorithm is proposed to reduce the multi-user interference and accelerate the convergence. Then, a dueling deep Q-network (DQN) structure is proposed to perform subchannel allocation, which further improves the convergence speed. Moreover, a deep deterministic policy gradient (DDPG)-based power resource allocation algorithm is designed to avoid the performance loss caused by power quantization and improve the system's achievable sum-rate. The simulation results demonstrate that our proposed scheme outperforms other neural network-based algorithms in terms of convergence performance, and can achieve higher system capacity compared with the greedy algorithm, the random algorithm, the RNN algorithm, and the DoubleDQN algorithm.

References

  1. Nature. 2015 Feb 26;518(7540):529-33 [PMID: 25719670]
  2. IEEE Internet Things J. 2022 Jun 15;9(12): [PMID: 38486943]

Grants

  1. 62161037/National Natural Science Foundation of China
  2. 62071257/National Natural Science Foundation of China
  3. 2023JQ17/Natural Science Foundation of Inner Mongolia Autonomous Region

Word Cloud

Created with Highcharts 10.0.0allocationalgorithmresourceconvergencedeepmmWavemassiveMIMO-NOMAgroupingpowerproposedinterferencesystemsreinforcementlearning-basedusersubchannelperformanceMillimeter-wavemultiple-inputmultiple-outputnon-orthogonalmultipleaccessprovenprimarytechniquesixth-generation6GwirelesscommunicationnetworksHowevergreatincreaseusersantennasbringschallengessuppressionstudyproposesspectrum-efficientfastDRLframeworkoptimizeFirstenhancedK-meansreducemulti-useraccelerateduelingQ-networkDQNstructureperformimprovesspeedMoreoverdeterministicpolicygradientDDPGdesignedavoidlosscausedquantizationimprovesystem'sachievablesum-ratesimulationresultsdemonstrateschemeoutperformsneuralnetwork-basedalgorithmstermscanachievehighersystemcapacitycomparedgreedyrandomRNNDoubleDQNSpectrum-efficientbased

Similar Articles

Cited By

No available data.