Resource allocation based on deep reinforcement learning in D2D communication
DOI:
Author:
Affiliation:

1.Nanjing University of Information Science & Technology,Nanjing 210044, China; 2.Wuxi University,Wuxi 214105, China; 3.Key Laboratory of Advanced Control of Light Industry Process, Ministry of Education, Jiangnan University, Wuxi 214122, China; 4.State Key Laboratory of Network and Switching Technology, Beijing University of Posts and Telecommunications,Beijing 100876, China

Clc Number:

TN929.5

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    Device to device (D2D) communication can be based on cellular facilities to improve resource utilization, user throughput and save battery energy. In D2D network, mode selection and resource allocation are the key issues. In order to improve the sum rate and spectrum efficiency of D2D communication, a scheme of joint mode selection, power and resource block allocation is proposed. Firstly, the mode selection criteria are selected according to the user′s geographical location to help the user select the corresponding communication mode; Then, for the multiplexing communication mode, the asynchronous dominant action evaluation (A3C) algorithm based on deep reinforcement learning is used to allocate resource blocks and power to different D2D users. The simulation results show that the joint optimization scheme based on A3C algorithm proposed in this paper has fast convergence speed and better performance than other algorithms.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: March 08,2024
  • Published: