物联网学报 ›› 2022, Vol. 6 ›› Issue (3): 103-112.doi: 10.11959/j.issn.2096-3750.2022.00284

• 理论与技术 • 上一篇    下一篇

面向多目标救援的通信受限无人机集群分布式策略

俞汉清1, 林艳1,2, 贾林琼1, 李强3, 张一晋1   

  1. 1 南京理工大学电子工程与光电技术学院,江苏 南京 210094
    2 东南大学移动通信国家重点实验室,江苏 南京 210096
    3 鹏城实验室,广东 深圳 518000
  • 修回日期:2022-06-15 出版日期:2022-08-05 发布日期:2022-08-08
  • 作者简介:俞汉清(1999- ),男,南京理工大学电子工程与光电技术学院在读,CCF 学生会员,主要研究方向为深度学习、强化学习在无线网络的应用
    林艳(1990- ),女,博士,南京理工大学副教授,主要研究方向为6G无线资源分配、强化学习等
    贾林琼(1989- ),女,博士,南京理工大学讲师,主要研究方向为可见光通信与移动通信
    李强(1973- ),男,博士,鹏城实验室正高级工程师,主要研究方向为物联网与5G/B5G
    张一晋(1982- ),博士,南京理工大学教授,主要研究方向为序列设计、无线网络与人工智能
  • 基金资助:
    国家自然科学基金资助项目(62071236);国家自然科学基金资助项目(62001225);中央高校基本科研业务费资助项目(30920021127);江苏省自然科学基金资助项目(BK20190454);鹏城实验室重大攻关项目(PCL2021A15);东南大学移动通信国家重点实验室开放研究基金资助项目(2022D07)

A distributed strategy for the multi-target rescue using a UAV swarm under communication constraints

Hanqing YU1, Yan LIN1,2, Linqiong JIA1, Qiang LI3, Yijin Zhang1   

  1. 1 School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
    2 National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
    3 Peng Cheng Laboratory, Shenzhen 518000, China
  • Revised:2022-06-15 Online:2022-08-05 Published:2022-08-08
  • Supported by:
    The National Natural Science Foundation of China(62071236);The National Natural Science Foundation of China(62001225);The Fundamental Research Funds for the Central Universities of China(30920021127);The Natural Science Foundation of Jiangsu Province(BK20190454);The Major Key Project of PCL(PCL2021A15);The Open Research Fund of National Mobile Communications Research Laboratory, Southeast University(2022D07)

摘要:

现有无人机集群的协同决策设计所依据的信息共享缺乏对无人机之间通信能力的合理假设。针对电量、载荷和路线约束下的无人机集群多目标救援问题,结合无人机飞行路线,考虑通信能力对无人机之间信息共享的限制。首先,将问题建模成部分可观测马尔可夫决策过程;然后,利用循环神经网络提出基于深度强化学习的能够适应通信拓扑结构不断变化的分布式救援策略。仿真结果表明,所提策略相较于其他策略在通信受限的情况下具有更佳的分布式救援性能,无人机数量和无人机通信能力需要依据救援场景进行联合设置方能达到无人机集群救援性能和使用成本的最佳折中。

关键词: 无人机, 多目标救援, 马尔可夫决策过程, 分布式策略, 强化学习

Abstract:

The current designs of the cooperative decision-making of an unmanned aerial vehicle (UAV) swarm usually adopt unreasonable assumptions on the communication ability between UAVs.Focusing on a multi-target rescue problem of a UAV swarm under constraints of energy, load and path, the limitation on the information sharing due to the communication constraints and the flight path of UAVs were taken into account.Firstly, the problem was formulated as a partially observable Markov decision process (POMDP).Then, a recurrent neural network was used to propose a deep-reinforcement-learning-based distributed rescue strategy, which is able to adapt to the changeable communication topology.Simulation results show that the proposed strategy outperforms other strategies under communication constraints, and further show that a careful joint setting of the size and communication ability of a UAV swarm is needed to achieve the best compromise between the UAV swarm rescue performance and the cost.

Key words: unmanned aerial vehicle, multi-target rescue, Markov decision process, distributed strategy, reinforcement learning

中图分类号: 

No Suggested Reading articles found!