1 |
SUTTON R S , BARTO A G . Reinforcement Learning:An Introduction[M]. Cambridge:MIT Press, 1998.
|
2 |
LIN C S , KIM H . Selection of learning parameters for CMAC-based adaptive critic learning[J]. IEEE Trans Neural Networks, 1999,6(3):642-647.
|
3 |
PELLEG D , MOORE A , SHROFF N B . X-means:extending K-Means with efficient estimation of the number of clusters[A]. Proc of the 17th International Conf on Machine Learning[C]. Boston:Morgan Kaufmann Press, 2000.727-734.
|
4 |
PELLEG D , MOORE A . Accelerating exact k-means algorithms with geometric reasoning[A]. Proc of the fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining[C]. 1999.277-281.
|
5 |
陈宗海, 文锋, 聂建斌 等. 基于节点生长 k-均值聚类算法的强化学习方法[J]. 计算机研究与发展, 2006,43(4):661-666. CHEN Z H , WEN F , NIE J B , et al. A reinforcement learning method based on node-growing k-means cluster algorithm[J]. Journal of Computer Research and Development, 2006,43(4):661-666.
|
6 |
文锋, 陈宗海, 卓睿 等. 连续状态自适应离散化基于K-均值聚类的强化学习方法[J]. 控制与决策, 2006,21(2):143-147. WEN F , CHEN Z H , ZHUO R , et al. Reinforcement learning method of continuous state adaptively discretized based on K-means clustering[J]. Control and Decision, 2006,21(2):143-147.
|
7 |
顾冬雷, 陈卫东, 席裕庚 . 一种基于增强学习的自适应控制方法[J]. 控制与决策, 2002,17(4):473-479. GU D L , CHEN W D , XI Y G . A novel adaptive control algorithm based on reinforcement learning[J]. Control and Decision, 2002,17(4):473-479.
|
8 |
MOORE A W , ATKESON C G . The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces[J]. Machine Learning, 1995,21(3):199-233.
|
9 |
UTHER W T B , VELOSO M M . Tree based discretization for continuous state space reinforcement learning[A]. AAAI’98[C]. Madison,Wisconsin,United States, 1998
|
10 |
SHERSTOV A A , STONE P . Function Approximation Via Tile Coding:Automating Parameter Choice Abstraction,Reformulation and Approximation[M]. Springer Berlin Heidelberg, 2005:194-205.
|
11 |
WHITESON S , TAYLOR M E , STONE P . Adaptive tile Coding for Value Function Approximation[M]. Computer Science Department,University of Texas at Austin, 2007.
|
12 |
WHITESON S , STONE P . Evolutionary function approximation for reinforcement learning[J]. The Journal of Machine Learning Research, 2006,7:877-917.
|
13 |
NOKHBEH-ZAEEM M , KHASHABI D , TALEBI H A , et al. Adaptive tiled neural networks[A]. 2011 IEEE International Conference on Systems,Man,and Cybernetics (SMC)[C]. New Orleans,LA,USA, 2011.2543-2548.
|
14 |
LIN S , WRIGHT R . Evolutionary tile coding:an automated state abstraction algorithm for reinforcement learning[A]. AAAI Workshops[C]. 2010.
|