Gradient descent Sarsa(?)algorithm based on the adaptive potential function shaping reward mechanism
Fei XIAO,Quan LIU,Qi-ming FU,Hong-kun SUN,Long GAO
Journal on Communications . 2013, (1): 77 -89 .  DOI: 1000-436X(2013)01-0077-12