Actor-critic algorithm with incremental dual natural policy gradient
Peng ZHANG,Quan LIU,Shan ZHONG,Jian-wei ZHAI,Wei-sheng QIAN
Journal on Communications . 2017, (4): 166 -177 .  DOI: 10.11959/j.issn.1000-436x.2017089