Chinese Journal on Internet of Things ›› 2022, Vol. 6 ›› Issue (3): 103-112.doi: 10.11959/j.issn.2096-3750.2022.00284

• Theory and Technology • Previous Articles     Next Articles

A distributed strategy for the multi-target rescue using a UAV swarm under communication constraints

Hanqing YU1, Yan LIN1,2, Linqiong JIA1, Qiang LI3, Yijin Zhang1   

  1. 1 School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
    2 National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
    3 Peng Cheng Laboratory, Shenzhen 518000, China
  • Revised:2022-06-15 Online:2022-08-05 Published:2022-08-08
  • Supported by:
    The National Natural Science Foundation of China(62071236);The National Natural Science Foundation of China(62001225);The Fundamental Research Funds for the Central Universities of China(30920021127);The Natural Science Foundation of Jiangsu Province(BK20190454);The Major Key Project of PCL(PCL2021A15);The Open Research Fund of National Mobile Communications Research Laboratory, Southeast University(2022D07)

Abstract:

The current designs of the cooperative decision-making of an unmanned aerial vehicle (UAV) swarm usually adopt unreasonable assumptions on the communication ability between UAVs.Focusing on a multi-target rescue problem of a UAV swarm under constraints of energy, load and path, the limitation on the information sharing due to the communication constraints and the flight path of UAVs were taken into account.Firstly, the problem was formulated as a partially observable Markov decision process (POMDP).Then, a recurrent neural network was used to propose a deep-reinforcement-learning-based distributed rescue strategy, which is able to adapt to the changeable communication topology.Simulation results show that the proposed strategy outperforms other strategies under communication constraints, and further show that a careful joint setting of the size and communication ability of a UAV swarm is needed to achieve the best compromise between the UAV swarm rescue performance and the cost.

Key words: unmanned aerial vehicle, multi-target rescue, Markov decision process, distributed strategy, reinforcement learning

CLC Number: 

No Suggested Reading articles found!