Journal on Communications ›› 2020, Vol. 41 ›› Issue (6): 51-60.doi: 10.11959/j.issn.1000-436x.2020117

• Papers • Previous Articles     Next Articles

Improved satellite resource allocation algorithm based on DRL and MOP

Pei ZHANG1,2,Shuaijun LIU3,Zhiguo MA2(),Xiaohui WANG1,Junde SONG1   

  1. 1 School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China
    2 China Academy of Information and Communications Technology,Beijing 100191,China
    3 Institute of Software,Chinese Academy of Sciences,Beijing 100190,China
  • Revised:2020-05-20 Online:2020-06-25 Published:2020-07-04
  • Supported by:
    The National Key Research and Development Program of China(2018YFB0105105);The National Science and Technology Major Project of China(2018ZX03001016)

Abstract:

In view of the multi-objective optimization (MOP) problem of sequential decision-making for resource allocations in multi-beam satellite systems,a deep reinforcement learning(DRL) based DRL-MOP algorithm was proposed to improve the system performance and user satisfaction degree.With considering the normalized weighted sum of spectrum efficiency,energy efficiency,and satisfaction index as the optimization goal,the dynamically changing system environments and user arrival model were built by the proposed algorithm,and the optimization of the accumulative performance in satellite systems based on DRL and MOP was realized.Simulation results show that the proposed algorithm can solve the MOP problem with rapid convergence ability and low complexity,and it is obviously superior to other algorithms in terms of system performance and user satisfaction optimization.

Key words: multi-beam satellite system, resource allocation, sequential decision-making, deep reinforcement learning, multi-objective optimization

CLC Number: 

No Suggested Reading articles found!