增强型深度确定策略梯度算法
陈建平,何超,刘全,吴宏杰,胡伏原,傅启明
Enhanced deep deterministic policy gradient algorithm
Jianping CHEN,Chao HE,Quan LIU,Hongjie WU,Fuyuan HU,Qiming FU
通信学报 . 2018, (11): 106 -115 .  DOI: 10.11959/j.issn.1000-436x.2018238