通信学报 ›› 2023, Vol. 44 ›› Issue (12): 86-98.doi: 10.11959/j.issn.1000-436x.2023235

• 学术论文 • 上一篇    

基于多智能体强化学习的异构网络CRE偏置动态优化算法

张铖1,2, 朱家烨1, 刘泽宁2, 黄永明1,2   

  1. 1 东南大学移动通信全国重点实验室,江苏 南京 211111
    2 网络通信与安全紫金山实验室,江苏 南京 211111
  • 修回日期:2023-11-13 出版日期:2023-12-01 发布日期:2023-12-01
  • 作者简介:张铖(1988- ),男,安徽望江人,博士,东南大学副教授、博士生导师,主要研究方向为无线通信系统的空时信号处理、机器学习辅助的无线通信智能优化技术等
    朱家烨(1998- ),女,江苏无锡人,东南大学硕士生,主要研究方向为无线通信网络中的多小区干扰协调
    刘泽宁(1993- ),男,江苏淮安人,博士,网络通信与安全紫金山实验室研究员,主要研究方向为边缘计算、智能干扰优化和资源分配技术
    黄永明(1977- ),男,江苏吴江人,博士,网络通信与安全紫金山实验室研究员,东南大学博士生导师,主要研究方向为智能5G/6G移动通信、毫米波无线通信等
  • 基金资助:
    国家自然科学基金资助项目(62225107);国家自然科学基金资助项目(62271140);江苏省前沿引领技术基础研究重大基金资助项目(BK20222001);江苏省创新创业人才计划基金资助项目(JSSCBS20211332)

Multi-agent reinforcement learning based dynamic optimization algorithm of CRE offset for heterogeneous networks

Cheng ZHANG1,2, Jiaye ZHU1, Zening LIU2, Yongming HUANG1,2   

  1. 1 National Mobile Communication Research Laboratory, Southeast University, Nanjing 211111, China
    2 Purple Mountain Laboratories: Networking, Communications and Security, Nanjing 211111, China
  • Revised:2023-11-13 Online:2023-12-01 Published:2023-12-01
  • Supported by:
    The National Natural Science Foundation of China(62225107);The National Natural Science Foundation of China(62271140);The Natural Science Foundation on Frontier Leading Technology Basic Research Project of Jiangsu(BK20222001);The Jiangsu Innovative and Entrepreneurial Talent Program(JSSCBS20211332)

摘要:

为应对无线网络用户激增导致的高吞吐量需求,针对宏微异构网络干扰场景,提出一种基于多智能体强化学习的小区范围扩展(CRE)偏置动态优化算法。基于协作多智能体强化学习的值分解网络框架,通过合理利用并在微微基站间交互系统内用户分布及其所受干扰水平,实现所有微微基站的个性化 CRE 偏置值在线本地化决策。仿真结果表明,与CRE=5 dB、分布式Q-Learning算法相比,所提算法在提高系统吞吐量、均衡各基站吞吐量及改善边缘用户吞吐量方面具有明显优势。

关键词: 异构网络, 小区范围扩展, 多智能体强化学习, 值分解网络算法

Abstract:

To cope with the high throughput demand caused by the proliferation of wireless network users, a multi-agent reinforcement learning based dynamic optimization algorithm of cell range expansion (CRE) offset was proposed for interference scenarios in macro-pico heterogeneous networks.Based on the value decomposition network framework of collaborative multi-agent reinforcement learning, a personalized online local decision of CRE offset for all pico-base stations was achieved by reasonably utilizing and interacting the intra-system user distribution and their interference levels among pico-base stations.Simulation results show that the proposed algorithm has significant advantages in increasing system throughput, balancing the throughput of each base station and improving edge-user throughput, compared to CRE=5 dB and distributed Q-learning algorithms.

Key words: heterogeneous network, cell range expansion, multi-agent reinforcement learning, value decomposition network algorithm

中图分类号: 

No Suggested Reading articles found!