大数据

• •    

基于动态动作覆盖的深度强化学习新闻推荐

董相宏,安俊秀   

  1. 成都信息工程大学软件工程学院,成都 610000

Deep reinforcement learning news recommendation based on dynamic action coverage

DONG Xianghong, AN Junxiu   

  1. School of Software Engineering, Chengdu University of Information Technology, Chengdu 610000,China

摘要: 新闻推荐系统对于新媒体新闻传播有着重要作用,本文提出了一种以深度强化学习为基础的旨在结合神经网络的表征能力和强化学习的策略选择能力以提升新闻推荐效果的推荐系统。本文使用动态动作掩码加强对用户短期兴趣的判断能力,使用优化缓存机制提升经验缓存的使用效率,通过区域遮蔽性质的奖励设计加快模型训练,提高推荐系统在新闻推荐领域的表现。实验表明,本文所提模型在新闻数据集上推荐准确率与现今主流的神经网络推荐方法相当,且在排序性能上优于当前先进的推荐算法。

关键词: 新闻推荐, 强化学习, 动态掩码, 优势缓存, 内在奖励

Abstract: News recommendation system plays an important role in news dissemination of new media. This paper proposes a recommendation system based on deep reinforcement learning, which aims to combine the representation ability of neural network and the strategy selection ability of reinforcement learning to improve the effect of news recommendation. This paper uses dynamic action masks to enhance the ability to judge users' short-term interests, uses the optimization cache mechanism to improve the efficiency of experience cache use, and accelerates model training through the reward design of regional masking nature to improve the performance of the recommendation system in the field of news recommendation. Experimental results show that the accuracy of the proposed model in news data sets is comparable to the current mainstream neural network recommendation methods, and its ranking performance is more than others.

Key words:  , news recommendation, reinforcement learning, dynamic mask, advantage cache, internal reward

中图分类号: 

No Suggested Reading articles found!