通信学报 ›› 2024, Vol. 45 ›› Issue (2): 137-149.doi: 10.11959/j.issn.1000-436x.2024032

• 学术论文 • 上一篇    

无人机集群联合拓扑控制的智能路由规划方法

颜志, 易正伦, 欧阳博, 王耀南   

  1. 湖南大学电气与信息工程学院,湖南 长沙 410082
  • 修回日期:2023-11-07 出版日期:2024-02-01 发布日期:2024-02-01
  • 作者简介:颜志(1986− ),男,湖南衡东人,博士,湖南大学副教授、博士生导师,主要研究方向为机器人通信与组网
    易正伦(1998− ),男,广东惠州人,湖南大学硕士生,主要研究方向为无人机集群通信与组网
    欧阳博(1987− ),男,湖南耒阳人,博士,湖南大学副教授、博士生导师,主要研究方向为机器学习、多机器人系统、复杂系统分析与控制、无线通信技术
    王耀南(1957− ),男,云南龙陵人,博士,湖南大学教授、博士生导师,主要研究方向为机器人感知与控制技术
  • 基金资助:
    国家自然科学基金资助项目(62293511);湖南省科技重大专项基金资助项目(2021GK1010);网络与交换技术国家重点实验室(北京邮电大学)开放课题基金资助项目(SKLNST-2021-2-03)

Intelligent route planning method with jointing topology control of UAV swarm

Zhi YAN, Zhenglun YI, Bo OUYANG, Yaonan WANG   

  1. College of Electrical and Information Engineering, Hunan University, Changsha 410082, China
  • Revised:2023-11-07 Online:2024-02-01 Published:2024-02-01
  • Supported by:
    The National Natural Science Foundation of China(62293511);Special Funding Support for the Construction of Innovative Provinces in Hunan Province(2021GK1010);The State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, China(SKLNST-2021-2-03)

摘要:

针对现有无人机集群路由协议拓扑适变能力弱,易产生包重传、能量空洞和高时延,严重恶化了数据路由性能的问题,针对无人机集群中集群拓扑与路由的耦合特性,提出了一种联合拓扑控制的智能路由规划(IRPJTC)方法。该方法由基于虚拟力的自适应拓扑控制(VFATC)和基于近端策略优化的地理路由规划(PPO-GRP)组成。其中,VFATC 使各无人机根据邻居运动状态信息自适应调整与邻居的距离,保证集群中链路的稳定连接;进一步,PPO-GRP引入VFATC中的链路稳定性指标,并结合端到端时延与能耗指标,设计多目标奖励函数,采用深度强化学习中的近端策略优化算法训练路由策略。仿真实验结果表明,IRPJTC 相比于现有路由方法,能在保证分组传输成功率的同时,使端到端时延降低12.11%,无人机集群能耗降低4.56%,且具备更强的能耗均衡能力。

关键词: 无人机集群, 路由协议, 拓扑控制, 近端策略优化, 深度强化学习

Abstract:

Existing routing protocols without awareness of the topology causes excessive retransmissions, energy holes, and long delay, data routing performance was seriously deteriorated.Considering the relation of topology and routing, an intelligent route planning with jointing topology control (IRPJTC) method was proposed.IRPJTC consisted of two part, the virtual force-based adaptive topology control (VFATC), and the PPO-based geographic routing protocol (PPO-GRP).Based on neighbor’s mobility information, the distance between UAVs was adaptively adjusted by VFATC to provide stable links between UAVs.Combined with link stability metric in VFATC, end-to-end delay and energy consumption, a multi-objective reward function was designed by PPO-GRP to train optimal routing strategy.According to the performance study, the proposed IRPJTC reduces existing routing protocols by 12.11% of end-to-end delay, and 4.56% of energy consumption, and has a better energy balance ability.

Key words: UAV swarm, routing protocol, topology control, proximal policy optimization, deep reinforcement learning

中图分类号: 

No Suggested Reading articles found!