Space-Integrated-Ground Information Networks ›› 2023, Vol. 4 ›› Issue (1): 12-22.doi: 10.11959/j.issn.2096-8930.2023002

Special Issue: 大规模星座组网及测控关键技术

• Special Issue: Key Technologies for Networking and TT&C in Large Scale Constellations • Previous Articles     Next Articles

Resource Scheduling Method for Integration of TT&C and Observation Based on Multi-Agent Deep Reinforcement Learning

Siyue CHENG, Haoran LI, Weigang BAI, Di ZHOU, Yan ZHU   

  1. School of Telecommunications Engineering, Xidian University, Xi’an 710071,China
  • Revised:2023-03-01 Online:2023-03-20 Published:2023-03-01
  • Supported by:
    National Key Research&Development Program of China(2020YFB1806100);National Natural Science Foundation of China(62101410);The Foundation of Shaanxi Province(QCYRCXM-2022-228)

Abstract:

With the development of satellite communication technology and the continuous expansion of the constellation scale, the integration of TT&C and observation technology has become the mainstream trend.The large constellation scale, many scheduling objects and complex operation joint control bring great challenges to the integrated resource scheduling of satellite network TT&C and observation.Subject to the low solution effi ciency and complex constraints of scheduling algorithms, the traditional TT&C resource scheduling technology adopts the advance injection TT&C instructions to perform tasks according to the fi xed deployment, which is diffi cult to meet the scheduling needs of emergencies and emergency tasks.Therefore, a kind of resource scheduling method based on multi-agent actor-Agent Actor-Critic Deterministic Policy Gradient Algorithms (MADDPG) was presented.With centralized training and distributed execution, the multi-agent model of integrated task of TT&C and observation was established.By analyzed the scheduling strategy of neighbor agent, the response speed of local information was improved.According to the model and constraints in the integrated resource scheduling problem of TT&C and observation, selected signifi cant and interpretable constraints, then established the multi-agent resource scheduling reinforcement learning model, and carried on the simulation test.The simulation results showed that the task benefi t of this method was 22% higher than the traditional method.

Key words: integration of TT&C and observation, large-scale constellation system, resources scheduling, multi-agent deep reinforcement learning, tasks reward

CLC Number: 

No Suggested Reading articles found!