通信学报 ›› 2023, Vol. 44 ›› Issue (9): 12-24.doi: 10.11959/j.issn.1000-436x.2023179

• 学术论文 • 上一篇    

智能电网中基于多智能体强化学习的频谱分配算法

燕锋1, 林晓薇2, 李正浩3, 徐霞4, 夏玮玮1, 沈连丰1   

  1. 1 东南大学移动通信全国重点实验室,江苏 南京 210096
    2 东南大学软件学院,江苏 南京 211100
    3 国网山东省电力公司信息通信公司,山东 济南 250001
    4 国网山东省电力公司济南供电公司,山东 济南 250012
  • 修回日期:2023-09-04 出版日期:2023-09-01 发布日期:2023-09-01
  • 作者简介:燕锋(1983- ),男,湖北天门人,博士,东南大学副教授,主要研究方向为无人机自组网、卫星互联网、无线传感器网络等
    林晓薇(1999- ),女,广西桂林人,东南大学硕士生,主要研究方向为无线网络资源管理、强化学习应用等
    李正浩(1991- ),男,山东济南人,国网山东省电力公司信息通信公司工程师,主要研究方向为云计算、5G通信、数字化等方面
    徐霞(1986- ),女,山东成武人,国网山东省电力公司济南供电公司高级工程师,主要研究方向为电力系统智慧物联网、网络资源优化等
    夏玮玮(1975- ),女,江苏句容人,博士,东南大学副研究员,主要研究方向为无线网络资源管理、边缘计算、泛在网络与短距离无线通信等
    沈连丰(1952- ),男,江苏邳州人,东南大学教授、博士生导师,主要研究方向为宽带移动通信、短距离无线通信和泛在网络等
  • 基金资助:
    国家电网有限公司科技基金资助项目(520601220022)

Spectrum allocation algorithm based on multi-agent reinforcement learning in smart grid

Feng YAN1, Xiaowei LIN2, Zhenghao LI3, Xia XU4, Weiwei XIA1, Lianfeng SHEN1   

  1. 1 National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
    2 School of Software, Southeast University, Nanjing 211100, China
    3 State Grid Shandong Information and Telecommunication Company, Jinan 250001, China
    4 State Grid Jinan Power Supply Company, Jinan 250012, China
  • Revised:2023-09-04 Online:2023-09-01 Published:2023-09-01
  • Supported by:
    The Science and Technology Project of State Grid Corporation of China(520601220022)

摘要:

针对智能电网中利用5G网络承载多样化电力终端的业务需求,提出了一种基于多智能体强化学习的频谱分配算法。首先,基于智能电网中部署的集成接入回程系统,考虑智能电网中轻量化和非轻量化终端业务的不同通信需求,将频谱分配问题建模为最大化系统总能效的非凸混合整数规划。其次,将前述问题构建为一个部分可观测的马尔可夫决策过程并转换为完全协作的多智能体问题,进而提出了一种集中训练分布执行框架下基于多智能体近端策略优化的频谱分配算法。最后,通过仿真验证了所提算法的性能。仿真结果表明,所提算法具有更快的收敛速度,通过有效减少层内与层间干扰、平衡接入与回程链路速率,可以将系统总速率提高25.2%。

关键词: 智能电网, 集成接入回程, 频谱分配, 多智能体强化学习

Abstract:

In view of the fact that 5G networks are used to meet the service requirements of various power terminals in smart grid, a spectrum allocation algorithm based on multi-agent reinforcement learning was proposed.Firstly, for the integrated access backhaul system deployed in smart grid, considering the different communication requirements of services in lightweight and non-lightweight terminal, the spectrum allocation problem was formulated as a non-convex mixed-integer programming aiming to maximize the overall energy efficiency.Secondly, the above problem was modeled as a partially observable Markov decision process and transformed into a fully cooperative multi-agent problem, then a spectrum allocation algorithm was proposed which was based on multi-agent proximal policy optimization under the framework of centralized training and distributed execution.Finally, the performance of the proposed algorithm was verified by simulation.The results show that the proposed algorithm has a faster convergence speed and can increase the overall transmission rate by 25.2% through effectively reducing intra-layer and inter-layer interference and balancing the access and backhaul link rates.

Key words: smart grid, integrated access and backhaul, spectrum allocation, multi-agent reinforcement learning

中图分类号: 

No Suggested Reading articles found!