[1] |
LI X W , SOH Y C , XIE L H ,et al. Cooperative output regulation of heterogeneous linear multi-agent networks via H∞performance allocation[J]. IEEE Transactions on Automatic Control, 2019,64(2): 683-696.
|
[2] |
孙长银, 穆朝絮 . 多智能体深度强化学习的若干关键科学问题[J]. 自动化学报, 2020,7(46): 1301-1312.
|
|
SUN C Y , MU C X . Important scientific problems of multi-agent deep reinforcement learning[J]. Acta Automatica Sinica, 2020,7(46): 1301-1312.
|
[3] |
裴国旭, 杜晓明, 薛昭 ,等. 多智能体系统在军事仿真领域的应用现状[J]. 飞航导弹, 2017(2): 46-49,73.
|
|
PEI G X , DU X M , XUE Z ,et al. Application status of multi-agent system in military simulation field[J]. Aerodynamic Missile Journal, 2017(2): 46-49,73.
|
[4] |
ABDELKADER A , ABDELHAMID T . Distributed output regulation of heterogeneous linear multi-agent systems with communication constraints[J]. Automatica, 2018,91: 152-158.
|
[5] |
史乐, 李辉, 原江波 . 基于消息通信的多智能体系统的应用[J]. 计算机应用, 2008,28(2): 531-534.
|
|
SHI L , LI H , YUAN J B . Multi-agent system based on the message communicaiton[J]. Journal of Computer Applications, 2008,28(2): 531-534.
|
[6] |
QIN J H , MA Q C , SHI Y ,et al. Recent advances in consensus of multi-agent systems:a brief survey[J]. IEEE Transactions on Industrial Electronics, 2017,64(6): 4972-4983.
|
[7] |
ZHANG H G , JIANG H , LUO Y H ,et al. Data-driven optimal consensus control for discrete-time multi-agent systems with unknown dynamics using reinforcement learning method[J]. IEEE Transactions on Industrial Electronics, 2017,64(5): 4091-4100.
|
[8] |
WANG X , YANG G H . Fault-tolerant consensus tracking control for linear multi-agent systems under switching directed network[J]. IEEE Transactions on Cybernetics, 2020,50(5): 1921-1930.
|
[9] |
ZHANG J C , ZHU F L . Observer-based output consensus of a class of heterogeneous multi-agent systems with unmatched disturbances[J]. Communications in Nonlinear Science and Numerical Simulation, 2018,56: 240-251.
|
[10] |
LIU L . Adaptive cooperative output regulation for a class of nonlinear multi-agent systems[J]. IEEE Transactions on Automatic Control, 2015,60(6): 1677-1682.
|
[11] |
MA Q , QIN J , ZHENG W X ,et al. Output group synchronization for networks of heterogeneous linear systems under internal model principle[J]. IEEE Transactions on Circuits and Systems I:Regular Papers, 2018,65(5): 1684-1695.
|
[12] |
YAN Y M , HUANG J . Cooperative output regulation of discrete-time linear time-delay multi-agent systems[J]. IET Control Theory & Applications, 2016,10(16): 2019-2026.
|
[13] |
王飞跃, 曹东璞, 魏庆来 . 强化学习:迈向知行合一的智能机制与算法[J]. 智能科学与技术学报, 2020,2(2): 101-106.
|
|
WANG F Y , CAO D P , WEI Q L . Reinforcement learning:toward actionknowledge merged intelligent mechanisms and algorithms[J]. Chinese Journal of Intelligent Science and Technology, 2020,2(2): 101-106.
|
[14] |
LEWIS F L , VRABIE D . Reinforcement learning and adaptive dynamic programming for feedback control[J]. IEEE Circuits and Systems Magazine, 2009,9(3): 32-50.
|
[15] |
LIU Y Y , WANG Z S , SHI Z . H∞ tracking control for linear discrete-time systems via reinforcement learning[J]. International Journal of Robust and Nonlinear Control, 2020,30(1): 282-301.
|
[16] |
MODARES H , NAGESHRAO S P , LOPES G A D ,et al. Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning[J]. Automatica, 2016,71: 334-341.
|
[17] |
MODARES H , LEWIS F L , KANG W ,et al. Optimal synchronization of heterogeneous nonlinear systems with unknown dynamics[J]. IEEE Transactions on Automatic Control, 2018,63(1): 117-131.
|
[18] |
KIUMARSI B , LEWIS F L , MODARES H ,et al. Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics[J]. Automatica, 2014,50: 1167-1175.
|
[19] |
KIUMARSI B , LEWIS F L . Output synchronization of heterogeneous discrete-time systems:A model-free optimal approach[J]. Automatica, 2017,84: 86-94.
|
[20] |
YANG Y L , MODARES H , WUNSCH D C ,et al. Leader-follower output synchronization of linear heterogeneous systems with active leader using reinforcement learning[J]. IEEE Transactions on Neural Networks and Learning Systems, 2018,29(6): 2139-2153.
|