Journal on Communications ›› 2023, Vol. 44 ›› Issue (7): 207-217.doi: 10.11959/j.issn.1000-436x.2023130

• Correspondences • Previous Articles    

Dynamic task scheduling method for relay satellite networks based on hierarchical reinforcement learning

Runzi LIU1, Tianci MA1, Weihua WU2, Chenhong YAO1, Qinghai YANG3   

  1. 1 School of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an 710399, China
    2 School of Physics and Information Technology, Shaanxi Normal University, Xi’an 710119, China
    3 School of Telecommunications Engineering, Xidian University, Xi’an 710071, China
  • Revised:2023-07-04 Online:2023-07-01 Published:2023-07-01
  • Supported by:
    The National Natural Science Foundation of China(61701365);The National Natural Science Foundation of China(61801365);The National Natural Science Foundation of China(61971327);Natural Science Basic Research Program of Shaanxi(2023-JC-YB-566);Natural Science Basic Research Program of Shaanxi(2023-JC-YB-542);Key Research and Development Program of Shaanxi Province(2021GY-066);Young Talent Fund of University Association for Science and Technology of Shaanxi Province(20200112);Postdoctoral Foundation of Shaanxi Province(2018BSHEDZZ47)

Abstract:

In recent years, with the increasing number of various emergency tasks, how to control the impact on common tasks while ensuring system revenue has become a huge challenge for the dynamic scheduling of relay satellite networks.Aiming at this problem, with the goal of maximizing the total revenue of emergency tasks and minimizing the damage to common tasks, a dynamic task scheduling method for relay satellite networks based on hierarchical reinforcement learning was proposed.Specifically, in order to take into account the long-term and short-term performance of the system at the same time, a two-layer scheduling framework implemented by upper-level and lower-level DQN was designed.The upper-level DQN was responsible for determining the temporary optimization goal based on long-term performance, and the lower-level DQN determined the scheduling strategy for current task according to the optimization goal.Simulation results show that compared with traditional deep learning methods and the heuristic methods dealing with dynamic scheduling problems, the proposed method can improve the total revenue of urgent tasks while reducing the damage to common tasks.

Key words: relay satellite networks, task scheduling, deep reinforcement learning, multi-objective optimization, dynamic scheduling

CLC Number: 

No Suggested Reading articles found!