Big Data Research ›› 2022, Vol. 8 ›› Issue (5): 88-105.doi: 10.11959/j.issn.2096-0271.2022033

• STUDY • Previous Articles     Next Articles

Knowledge-enhanced policy-guided interactive reinforcement recommendation system

Yuqi ZHANG1,2, Xiaowen HUANG1,2, Jitao SANG1,2   

  1. 1 School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China
    2 Beijing Key Lab of Traffic Data Analysis and Mining, Beijing 100044, China
  • Online:2022-09-15 Published:2022-09-01
  • Supported by:
    The Fundamental Research Funds for the Central Universities(2021RC217)

Abstract:

The recommendation system is an important means to solve the problem of information overload in social media.To solve the problem that traditional recommendation systems cannot optimize the longterm user experience, researchers have proposed the interactive recommendation system and tried to use deep reinforcement learning to optimize the strategy of recommendation.However, the reinforcement recommendation algorithm faces problems such as sparse feedback, learning from zero which damages the user experience, and large item space.To solve the above problems, an improved interactive reinforcement recommendation model KGP-DQN was proposed.The model constructed a behavioral knowledge graph representation module, which combines user historical behavior and knowledge graph to solve the problem of sparse feedback.The model constructed a strategy initialization module to provide an initialization strategy for the reinforcement recommendation system based on user historical behaviors to solve the problem of learning from zero.The model constructed the candidate select module which creates candidates by dynamic clustering based on the item representation on the behavioral knowledge graph to solve the problem of large action space.The experiments were conducted on three real-world datasets.The experimental results show that the KGP-DQN method can quickly and effectively train the reinforcement recommendation system and its recommendation accuracy on three datasets is more than 80%.

Key words: interactive recommendation system, deep reinforcement learning, knowledge graph, policy initialization, candidate select

CLC Number: 

No Suggested Reading articles found!