[1] |
LIU D R , WEI Q L . Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems[J]. IEEE Transactions on Neural Networks and Learning Systems, 2014,25(3): 621-634.
|
[2] |
LIU D R , WEI Q L , WANG D ,et al. Adaptive dynamic programming for optimal residential energy management[M]. Adaptive dynamic programming with applications in optimal control. Cham: Springer International Publishing, 2017: 483-535.
|
[3] |
WANG D , HA M M , QIAO J F . Self-learning optimal regulation for discrete-time nonlinear systems under event-driven formulation[J]. IEEE Transactions on Automatic Control, 2020,65(3): 1272-1279.
|
[4] |
WANG D , QIAO J F , CHENG L . An approximate neuro-optimal solution of discounted guaranteed cost control design[J]. IEEE Transactions on Cybernetics, 2020: 1-10.
|
[5] |
LI Y M , LIU Y J , TONG S C . Observer-based neuro-adaptive optimized control of strict-feedback nonlinear systems with state constraints[J]. IEEE Transactions on Neural Networks and Learning Systems, 2021: 1-15.
|
[6] |
YANG X , HE H B , ZHONG X N . Approximate dynamic programming for nonlinear-constrained optimizations[J]. IEEE Transactions on Cybernetics, 2021,51(5): 2419-2432.
|
[7] |
WANG D , HA M M , CHENG L . Neuro-optimal trajectory tracking with value iteration of discrete-time nonlinear dynamics[J]. IEEE Transactions on Neural Networks and Learning Systems, 2021: 1-12.
|
[8] |
WANG D , LIU D R , WEI Q L . Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach[J]. Neurocomputing, 2012,78(1): 14-22.
|
[9] |
KIUMARSI B , LEWIS F L . Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems[J]. IEEE Transactions on Neural Networks and Learning Systems, 2015,26(1): 140-151.
|
[10] |
HA M M , WANG D , LIU D R . Data-based nonaffine optimal tracking control using iterative DHP approach[J]. IFAC-PapersOnLine, 2020,53(2): 4246-4251.
|
[11] |
WANG D , HA M M , QIAO J F . Data-driven iterative adaptive critic control toward an urban wastewater treatment plant[J]. IEEE Transactions on Industrial Electronics, 2021,68(8): 7362-7369.
|
[12] |
WEI Q L , LIAO Z H , SONG R Z ,et al. Self-learning optimal control for ice-storage air conditioning systems via data-based adaptive dynamic programming[J]. IEEE Transactions on Industrial Electronics, 2021,68(4): 3599-3608.
|
[13] |
李金娜, 程薇燃 . 基于强化学习的数据驱动多智能体系统最优一致性综述[J]. 智能科学与技术学报, 2020,2(4): 327-340.
|
|
LI J N , CHENG W R . An overview of optimal consensus for data driven multi-agent system based on reinforcement learning[J]. Chinese Journal of Intelligent Science and Technology, 2020,2(4): 327-340.
|
[14] |
李涛, 魏庆来 . 基于深度强化学习的智能暖气温度控制系统[J]. 智能科学与技术学报, 2020,2(4): 348-353.
|
|
LI T , WEI Q L . Intelligent heating temperature control system based on deep reinforcement learning[J]. Chinese Journal of Intelligent Science and Technology, 2020,2(4): 348-353.
|
[15] |
王金予, 魏欣然, 石文磊 ,等. 强化学习在资源优化领域的应用[J]. 大数据, 2021,7(5): 131-149.
|
|
WANG J Y , WEI X R , SHI W L ,et al. Applications of reinforcement learning in the field of resource optimization[J]. Big Data Research, 2021,7(5): 131-149.
|
[16] |
LUO B , LIU D R , HUANG T W ,et al. Model-free optimal tracking control via critic-only Q-learning[J]. IEEE Transactions on Neural Networks and Learning Systems, 2016,27(10): 2134-2144.
|
[17] |
LI S , DING L , GAO H B ,et al. ADP-based online tracking control of partially uncertain time-delayed nonlinear system and application to wheeled mobile robots[J]. IEEE Transactions on Cybernetics, 2020,50(7): 3182-3194.
|