[1] |
ZHANG L J , QI X , PANG Y J . Adaptive output feedback control based on DRFNN for AUV[J]. Ocean Engineering, 2009,36(9-10): 716-722.
|
[2] |
SUTTON R , BARTO A . Reinforcement learning:an introduction[M]. Cambridge: MIT Press, 1998.
|
[3] |
TESAURO G . TD-Gammon,a self-teaching backgammon program,achieves master-level play[J]. Neural Computation, 1944,6(2): 215-219.
|
[4] |
MNIH V , KAVUKCUOGLU K , SILVER D ,et al. Playing atari with deep reinforcement learning[J]. Computer Science, 2013.
|
[5] |
SILVER D , HUANG A , MADDISON C J ,et al. Mastering the game of go with deep neural networks and tree search[J]. Nature, 2016,529(7587): 484-489.
|
[6] |
SILVER D , TECHNOLOGIES D , LEVER G ,et al. Deterministic policy gradient algorithms[C]// International Conference on Machine Learning. New York:ACM Press, 2014.
|
[7] |
LILLICRAP T P , HUNT J J , PRITZEL A ,et al. Continuous control with deep reinforcement learning[J]. Computer Science, 2015,6(6): A187.
|
[8] |
HWANGBO J , LEE J , DOSOVITSKIY A ,et al. Learning agile and dynamic motor skills for legged robots[J]. Science Robotics, 2019,4(26).
|
[9] |
严卫生 . 鱼雷航行力学[M]. 西安: 西北工业大学出版社, 2005.
|
|
YAN W S . Torpedo navigation mechanics[M]. Xi’an: Northwestern Polytechnical University Press, 2005.
|
[10] |
GOODFELLOW I , BENGIO Y , COURVILLE A . Deep learning[M]. Cambridge: MIT Press, 2016.
|
[11] |
KINGMA D , BA J . ADAM:a method for stochastic optimization[J]. Computer Science, 2014.
|
[12] |
MNIH V , KAVUKCUOGLU K , SILVER D ,et al. Human-level control through deep reinforcement learning[J]. Nature, 2015, 518-529.
|
[13] |
KONDA V , TSITSIKLIS J . Actor-critic algorithms[J]. In Advances in Neural Information Processing Systems, 2003: 1008-1014.
|
[14] |
UHLENBECK G E , ORNSTEIN L S . On the theory of the Brownian motion[J]. Revista Latinoamericana De Microbiología, 1973,15(1): 29.
|
[15] |
ABADI M , BARHAM P , CHEN P ,et al. TensorFlow:a system for large-scale machine learning[J]. Google Brain, 2016.
|
[16] |
NAIR V , HINTON G . Rectified linear units improve restricted Boltzmann machines[C]// International Conference on Machine Learning.[S.l.:s.n.], 2010.
|