智能科学与技术学报 ›› 2020, Vol. 2 ›› Issue (4): 394-400.doi: 10.11959/j.issn.2096-6652.202042

• 专刊:深度强化学习 • 上一篇    下一篇

异构多智能体系统的输出同步:一个基于数据的强化学习方法

刘莹莹, 王占山   

  1. 东北大学信息科学与工程学院,辽宁 沈阳 110819
  • 修回日期:2020-12-01 出版日期:2020-12-15 发布日期:2020-12-01
  • 作者简介:刘莹莹(1995- ),女,东北大学信息科学与工程学院博士生,主要研究方向为数据驱动的系统无模型控制、基于强化学习的系统最优控制、多智能体系统最优控制等。
    王占山(1971- ),男,博士,东北大学信息科学与工程学院教授、博士生导师,主要研究方向为神经动力系统稳定性理论、复杂网络同步性与一致性以及数据驱动的智能故障诊断与自适应容错控制等。
  • 基金资助:
    国家自然科学基金资助项目(61973070);辽宁省“兴辽英才计划”资助项目(XLYC1802010);流程工业综合自动化基础研究基金资助项目(2018ZCX22)

Output synchronization of heterogeneous multi-agent system:a reinforcement learning approach based on data

Yingying LIU, Zhanshan WANG   

  1. College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
  • Revised:2020-12-01 Online:2020-12-15 Published:2020-12-01
  • Supported by:
    The National Natural Science Foundation of China(61973070);Liaoning Revitalization Talents Program(XLYC1802010);SAPI Fundamental Research Funds(2018ZCX22)

摘要:

通过强化学习研究了异构多智能体系统的输出同步问题。根据多智能体系统的拓扑结构,定义一个具有邻居控制输入的性能指标和价值函数。为克服已有控制方法需要系统模型的弊端,提出一个基于系统数据的强化学习算法,使输出同步控制器也可以被应用于模型未知的情况。此外,通过调节价值函数中的权重矩阵,可以减少每个智能体的控制成本。最后,通过一个仿真示例验证了该方法的有效性和定义的价值函数的优越性。

关键词: 多智能体系统, 强化学习, 输出同步, 基于数据

Abstract:

The output synchronization of heterogeneous multi-agent system was studied by reinforcement learning.According to the topology of multi-agent system, the performance index and value function with neighbor control input were defined.To overcome the disadvantage of existing control methods that require system model, a reinforcement learning algorithm based on system data was proposed.Hence, the output synchronization controller can also be applied when the system model was unknown.In addition, by adjusting the weight matrix in value function, the control cost of each agent can be reduced.Finally, a simulation example was given to illustrate the effectiveness of proposed method and the superiority of defined value function.

Key words: multi-agent system, reinforcement learning, output synchronization, based on data

中图分类号: 

No Suggested Reading articles found!