电信科学 ›› 2022, Vol. 38 ›› Issue (2): 71-83.doi: 10.11959/j.issn.1000-0801.2022035
申情1,2, 郭文宾1, 楼俊钢1,3, 余强国2
修回日期:
2021-10-26
出版日期:
2022-02-20
发布日期:
2022-02-01
作者简介:
申情(1982- ),女,湖州学院副教授,主要研究方向为个性化推荐、多目标优化、智能决策等基金资助:
Qing SHEN1,2, Wenbin GUO1, Jungang LOU1,3, Qiangguo YU2
Revised:
2021-10-26
Online:
2022-02-20
Published:
2022-02-01
Supported by:
摘要:
个性化推荐已成为解决信息过载的最有效手段之一,也是海量数据挖掘研究领域的热点技术。然而传统推荐算法往往只使用用户对物品的评分信息,而缺少对用户与物品潜在特征的综合考虑。基于因子分解机、宽神经网络、交叉网络和深度神经网络的融合,提出一种新的考虑多层次潜在特征的模型,可以提取用户与物品的浅层潜在特征、低阶非线性潜在特征、线性交叉潜在特征以及高阶非线性潜在特征。在 4 个常用的数据集上的实验结果表明,考虑用户与物品多层次潜在特征可以有效提高个性化推荐的预测精度。最后,研究了嵌入层维度以及神经元数量等因素对新模型预测性能的影响。
中图分类号:
申情, 郭文宾, 楼俊钢, 余强国. 考虑多层次潜在特征的个性化推荐模型[J]. 电信科学, 2022, 38(2): 71-83.
Qing SHEN, Wenbin GUO, Jungang LOU, Qiangguo YU. Personalized recommendation model with multi-level latent features[J]. Telecommunications Science, 2022, 38(2): 71-83.
表3
数据集MovieLens-100k训练集占比不同(不同数据稀疏度)情况的MAE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 0.825 4±0.022 7 | 0.838 5±0.013 2 | 0.842 9±0.017 9 | 0.882 3±0.027 2 |
FM | 0.744 9±0.002 3 | 0.745 8±0.005 4 | 0.747 7±0.007 9 | 0.761 3±0.009 4 |
DNN | 0.732 5±0.009 3 | 0.741 4±0.019 9 | 0.745 8±0.013 1 | 0.760 5±0.017 2 |
DeepFM | 0.719 9±0.005 3 | 0.729 5±0.006 9 | 0.742 2±0.004 5 | 0.759 1±0.006 4 |
DCN | 0.723 2±0.007 3 | 0.730 1±0.005 7 | 0.744 9±0.007 8 | 0.758 3±0.008 9 |
WD | 0.719 4±0.005 4 | 0.728 9±0.004 9 | 0.7411±0.009 5 | 0.759 3±0.010 1 |
本文模型 | 0.710 2±0.004 3 | 0.724 9±0.006 7 | 0.735 2±0.007 9 | 0.754 7±0.005 3 |
表4
数据集MovieLens-100k训练集占比不同(不同数据稀疏度)情况的RMSE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 1.028 3±0.034 1 | 1.043 3±0.017 3 | 1.054 2±0.021 5 | 1.094 3±0.029 5 |
FM | 0.944 1±0.003 3 | 0.945 3±0.007 7 | 0.948 5±0.009 4 | 0.967 5±0.013 1 |
DNN | 0.936 3±0.011 1 | 0.942 7±0.014 9 | 0.947 4±0.006 6 | 0.966 1±0.009 2 |
DeepFM | 0.920 1±0.006 9 | 0.928 9±0.006 1 | 0.938 6±0.008 6 | 0.959 2±0.011 7 |
DCN | 0.922 9±0.009 4 | 0.931 1±0.008 3 | 0.941 6±0.010 9 | 0.959 7±0.009 1 |
WD | 0.919 4±0.006 8 | 0.927 3±0.009 1 | 0.937 9±0.014 2 | 0.958 1±0.007 4 |
本文模型 | 0.909 8±0.005 3 | 0.926 1±0.004 3 | 0.931 2±0.005 5 | 0.952 4±0.008 5 |
表5
数据集MovieLens-1m训练集占比不同(不同数据稀疏度)情况的MAE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 0.821 3±0.008 1 | 0.830 3±0.012 4 | 0.851 4±0.015 8 | 0.876 9±0.019 3 |
FM | 0.674 5±0.018 2 | 0.675 6±0.019 6 | 0.706 8±0.021 4 | 0.734 2±0.025 1 |
DNN | 0.692 6±0.019 7 | 0.701 8±0.020 3 | 0.719 4±0.024 6 | 0.738 5±0.026 8 |
DeepFM | 0.672 8±0.007 8 | 0.672 7±0.009 1 | 0.688 4±0.012 6 | 0.706 8±0.013 4 |
DCN | 0.673 6±0.014 1 | 0.673 3±0.015 7 | 0.689 5±0.017 3 | 0.714 5±0.016 2 |
WD | 0.674 7±0.009 3 | 0.673 9±0.010 6 | 0.691 7±0.011 4 | 0.718 1±0.013 6 |
本文模型 | 0.663 5±0.011 7 | 0.668 1±0.012 9 | 0.685 2±0.013 5 | 0.702 3±0.015 1 |
表6
数据集MovieLens-1m训练集占比不同(不同数据稀疏度)情况的RMSE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 1.019 8±0.007 2 | 1.028 7±0.009 1 | 1.047 6±0.012 7 | 1.073 7±0.015 3 |
FM | 0.862 3±0.017 8 | 0.864 7±0.019 4 | 0.885 4±0.021 3 | 0.915 8±0.023 8 |
DNN | 0.877 4±0.015 4 | 0.886 4±0.018 3 | 0.897 2±0.020 6 | 0.923 8±0.026 8 |
DeepFM | 0.856 5±0.008 7 | 0.862 3±0.009 5 | 0.873 4±0.010 7 | 0.899 1±0.011 3 |
DCN | 0.857 2±0.005 7 | 0.863 7±0.008 6 | 0.875 9±0.007 4 | 0.903 4±0.009 1 |
WD | 0.858 6±0.017 5 | 0.863 1±0.013 2 | 0.879 3±0.014 4 | 0.908 9±0.012 8 |
本文模型 | 0.849 3±0.019 7 | 0.857 2±0.014 5 | 0.872 6±0.015 9 | 0.891 2±0.013 7 |
表7
数据集MovieLens-latest-small训练集占比不同(不同数据稀疏度)情况的MAE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 0.924 7±0.053 3 | 0.935 0±0.046 2 | 0.951 7±0.041 7 | 0.968 4±0.038 9 |
FM | 0.692 6±0.015 5 | 0.699 8±0.017 6 | 0.711 8±0.019 4 | 0.728 3±0.021 3 |
DNN | 0.711 7±0.023 5 | 0.723 9±0.031 6 | 0.735 6±0.033 4 | 0.741 2±0.036 1 |
DeepFM | 0.684 3±0.012 7 | 0.688 2±0.014 2 | 0.701 8±0.016 7 | 0.711 3±0.017 8 |
DCN | 0.680 2±0.007 6 | 0.679 5±0.009 4 | 0.695 7±0.010 3 | 0.708 4±0.012 7 |
WD | 0.682 2±0.011 8 | 0.684 7±0.013 7 | 0.697 3±0.015 6 | 0.709 8±0.017 5 |
本文模型 | 0.671 4±0.013 6 | 0.673 3±0.014 6 | 0.690 2±0.016 2 | 0.701 5±0.018 4 |
表8
数据集MovieLens-latest-small训练集占比不同(不同数据稀疏度)情况的RMSE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 1.016 0±0.068 3 | 1.022 6±0.053 9 | 1.036 4±0.058 6 | 1.057 6±0.049 3 |
FM | 0.893 4±0.013 6 | 0.897 9±0.015 7 | 0.905 2±0.018 7 | 0.917 3±0.020 4 |
DNN | 0.901 3±0.027 2 | 0.916 3±0.032 8 | 0.928 3±0.037 1 | 0.937 6±0.036 8 |
DeepFM | 0.878 9±0.017 5 | 0.886 1±0.018 3 | 0.902 5±0.020 9 | 0.921 6±0.021 7 |
DCN | 0.877 8±0.009 4 | 0.884 7±0.010 9 | 0.899 4±0.012 4 | 0.919 6±0.013 4 |
WD | 0.875 9±0.013 3 | 0.881 4±0.014 7 | 0.901 6±0.016 2 | 0.920 2±0.017 9 |
本文模型 | 0.867 2±0.015 8 | 0.870 4±0.016 4 | 0.893 7±0.017 8 | 0.914 5±0.019 2 |
表9
数据集FilmTrust训练集占比不同(不同数据稀疏度)情况的MAE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 0.685 9±0.012 6 | 0.698 5±0.013 9 | 0.729 9±0.016 4 | 0.794 2±0.019 2 |
FM | 0.639 1±0.013 4 | 0.645 9±0.014 7 | 0.660 8±0.017 3 | 0.689 3±0.018 6 |
DNN | 0.651 2±0.022 7 | 0.659 2±0.025 4 | 0.678 4±0.026 2 | 0.691 5±0.029 3 |
DeepFM | 0.630 9±0.011 9 | 0.634 5±0.013 5 | 0.661 3±0.014 9 | 0.674 1±0.016 1 |
DCN | 0.633 8±0.012 5 | 0.637 3±0.013 9 | 0.665 9±0.015 7 | 0.678 6±0.017 3 |
WD | 0.631 7±0.009 6 | 0.640 9±0.010 4 | 0.667 4±0.013 2 | 0.670 3±0.014 5 |
本文模型 | 0.621 4±0.011 3 | 0.634 8±0.012 6 | 0.655 0±0.014 8 | 0.665 3±0.015 8 |
表10
数据集FilmTrust训练集占比不同(不同数据稀疏度)情况的RMSE"
训练集占比 | 80% | 70% | 50% | 30% |
UB | 0.953 9±0.018 7 | 0.978 4±0.021 4 | 1.038 9±0.024 3 | 1.151 0±0.027 9 |
FM | 0.831 6±0.017 3 | 0.846 2±0.019 8 | 0.857 7±0.020 7 | 0.889 4±0.023 2 |
DNN | 0.839 3±0.023 1 | 0.843 8±0.025 5 | 0.866 8±0.027 1 | 0.898 7±0.029 6 |
DeepFM | 0.810 3±0.011 7 | 0.823 1±0.012 8 | 0.838 9±0.013 5 | 0.846 7±0.015 2 |
DCN | 0.811 7±0.010 7 | 0.825 6±0.011 6 | 0.841 9±0.012 9 | 0.854 6±0.014 7 |
WD | 0.809 8±0.008 4 | 0.817 1±0.009 3 | 0.836 5±0.010 4 | 0.843 9±0.011 5 |
本文模型 | 0.800 2±0.010 8 | 0.812 4±0.011 2 | 0.827 4±0.012 6 | 0.837 6±0.013 9 |
[1] | 项亮 . 推荐系统实践[M]. 北京: 人民邮电出版社, 2012. |
XIANG L . The Development of recommendation system[M]. Beijing: Posts & Telecom Press, 2012. | |
[2] | BPRATTOL , CARTAS , FENUG ,et al. Semantics-aware content-based recommender systems:Design and architecture guidelines[J]. Neurocomputing, 2017(254): 79-85. |
[3] | YANG S , KORAYEM M , ALJADDA K ,et al. Combining content-based and collaborative filtering for job recommendation system:a cost-sensitive statistical relational learning approach[J]. Knowledge-Based Systems, 2017(136): 37-45. |
[4] | RENDLE S , . Factorization machines[C]// Proceedings of 2010 IEEE International Conference on Data Mining. Piscataway:IEEE Press, 2010: 995-1000. |
[5] | ZHANG L B , LUO T J , ZHANG F ,et al. A recommendation model based on deep neural network[J]. IEEE Access, 2018,6: 9454-9463. |
[6] | BI J W , LIU Y , FAN Z P . A deep neural networks based recommendation algorithm using user and item basic data[J]. International Journal of Machine Learning and Cybernetics, 2020,11(4): 763-777. |
[7] | CHENG H T , KOC L , HARMSEN J ,et al. Wide & deep learning for recommender systems[C]// DLRS 2016:Proceedings of the 1st Workshop on Deep Learning for Recommender Systems.[S.l.:s.n.], 2016: 7-10. |
[8] | GUO H F , TANG R M , YE Y M ,et al. DeepFM:a factorization-machine based neural network for CTR prediction[C]// Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. California:International Joint Conferences on Artificial Intelligence Organization, 2017. |
[9] | WANG R X , FU B , FU G ,et al. Deep & cross network for ad click predictions[C]// ADKDD'17:Proceedings of the ADKDD'17.[S.l.:s.n.], 2017: 1-7. |
[10] | GUILHERME BRANDAO MARTINS , JOAO PAULO PAPA , HOJJAT ADELI . Deep learning techniques for recommender systems based on collaborative filtering[J]. Expert Systems, 2020,37(6): 1-21. |
[11] | MYLAVARAPU B K . Collaborative filtering and artificial neural network based recommendation system for advanced applications[J]. Journal of Computer and Communications, 2018,06(12): 1-14. |
[12] | TANG H , LEI M , GONG Q ,et al. A BP neural network recommendation algorithm based on cloud model[J]. IEEE Access, 2019(7): 35898-35907. |
[13] | 薛峰, 刘凯, 王东 ,等. 基于深度神经网络和加权隐反馈的个性化推荐[J]. 模式识别与人工智能, 2020,33(4): 295-302. |
XUE F , LIU K , WANG D ,et al. Personalized recommendation algorithm based on deep neural network and weighted implicit feedback[J]. Pattern Recognition and Artificial Intelligence, 2020,33(4): 295-302. | |
[14] | 杨洁, 朱咸军, 周献中 ,等. 基于混杂社会网络的个性化Web服务推荐方法[J]. 电子学报, 2020,48(2): 341-349. |
YANG J , ZHU X J , ZHOU X Z ,et al. Personalized web service recommendation based on heterogeneous social network[J]. Acta Electronica Sinica, 2020,48(2): 341-349. | |
[15] | HUI B , ZHANG L , ZHOU X ,et al. Personalized recommendation system based on knowledge embedding and historical behavior[J]. Applied Intelligence, 2022(52): 954-966. |
[16] | ZHEN W , ALLAM M , LIANG M B . Research on e-commerce personalized recommendation system based on big data technology[C]// Proceedings of 2021 IEEE 2nd International Conference on Information Technology,Big Data and Artificial Intelligence (ICIBA),Piscataway:IEEE Press, 2021, 909-913. |
[17] | 邵英玮, 张敏, 马为之 ,等. 融合商品潜在互补性发现的个性化推荐方法[J]. 软件学报, 2020,31(4): 1090-1100. |
SHAO Y W , ZHANG M , MA W Z ,et al. Integrating latent item-item complementarity with personalized recommendation systems[J]. Journal of Software, 2020,31(4): 1090-1100. | |
[18] | FU M S , QU H , YI Z ,et al. A novel deep learning-based collaborative filtering model for recommendation system[J]. IEEE Transactions on Cybernetics, 2019,49(3): 1084-1096. |
[19] | YAN W J , WANG D , CAO M J ,et al. Deep auto encoder model with convolutional text networks for video recommendation[J]. IEEE Access, 2019(7): 40333-40346. |
[20] | TANG J X , WANG K . Personalized top-N sequential recommendation via convolutional sequence embedding[C]// Proceedings of WSDM '18:Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining.[S.l.:s.n.], 2018: 565-573. |
[21] | 邓路佳, 刘平山 . 基于GMM-FMs的广告点击率预测研究[J]. 计算机工程, 2019,45(5): 122-126. |
DENG L J , LIU P S . Research on click-through rate prediction of advertisement based on GMM-FMs[J]. Computer Engineering, 2019,45(5): 122-126. | |
[22] | YANG D Q , CHEN L H , LIANG J Q ,et al. Social tag embedding for the recommendation with sparse user-item interactions[C]// Proceedings of 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). Piscataway:IEEE Press, 2018: 127-134. |
[23] | 郁豹, 李振华, 张凯 ,等. 基于DeepFM模型的广告推荐系统研究[J]. 计算机应用与软件, 2019,36(7): 307-310,316. |
YU B , LI Z H , ZHANG K ,et al. Advertisement recommendation system based on deepfm model[J]. Computer Applications and Software, 2019,36(7): 307-310,316. | |
[24] | LIAN J X , ZHOU X H , ZHANG F Z ,et al. xDeepFM:combining explicit and implicit feature interactions for recommender systems[C]// Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York:ACM Press, 2018. |
[25] | NAUMOV M , MUDIGERE D , SHI H J M ,et al. Deep learning recommendation model for personalization and recommendation systems[EB]. 2019. |
[26] | SONG W P , SHI C C , XIAO Z P ,et al. AutoInt:automatic feature interaction learning via self-attentive neural networks[C]// Proceedings of CIKM '19:Proceedings of the 28th ACM International Conference on Information and Knowledge Management.[S.l.:s.n.], 2019: 1161-1170. |
[27] | HONGTAO X , QINGSHENG Z , HONGCHUN Q ,et al. User-based collaborative recommendation filtering algorithm using extremely valued ratings[J]. International Journal of Digital Content Technology and Its Applications, 2011,5(9): 47-54. |
[1] | 卢敏, 胡娟, 张先超, 丁伟健, 乐光学. 基于用户多特征融合的个性化推荐模型[J]. 电信科学, 2023, 39(5): 101-115. |
[2] | 诸葛斌, 尹正虎, 斯文学, 颜蕾, 董黎刚, 蒋献. 基于学生知识追踪的多指标习题推荐算法[J]. 电信科学, 2022, 38(9): 129-143. |
[3] | 周杰, Esono Mikue Bernardo Esono, 王学英, 周惠婷, 罗宏. 基于SLM-PTS算法融合的NC-OFDM峰均比优化[J]. 电信科学, 2022, 38(7): 63-74. |
[4] | 李攀攀, 谢正霞, 乐光学, 刘鑫. 基于深度学习的无线通信接收方法研究进展与趋势[J]. 电信科学, 2022, 38(2): 1-17. |
[5] | 陈志宏, 王明晓. 计算机视觉在智慧安防中的应用[J]. 电信科学, 2021, 37(8): 142-147. |
[6] | 唐博恒, 柴鑫刚. 基于云边协同的计算机视觉推理机制[J]. 电信科学, 2021, 37(5): 72-81. |
[7] | 孙姝君, 彭盛亮, 姚育东, 杨喜. 基于深度学习的调制识别综述[J]. 电信科学, 2021, 37(5): 82-90. |
[8] | 彭双, 王晓东, 彭宗举, 陈芬. 基于深度学习的快速QTMT划分[J]. 电信科学, 2021, 37(4): 73-81. |
[9] | 胡道允, 齐进, 陆钱春, 李锋, 房红强. 基于深度学习的流量工程算法研究与应用[J]. 电信科学, 2021, 37(2): 107-114. |
[10] | 张捷, 杨丽花, 王增浩, 呼博, 聂倩. 一种新型的基于深度学习的时变信道预测方法[J]. 电信科学, 2021, 37(1): 39-47. |
[11] | 李远宁,宁柏锋,董召杰. 电网机巡图像分析框架与深度学习方法[J]. 电信科学, 2020, 36(8): 167-174. |
[12] | 张婷婷,章坚武,郭春生,陈华华,周迪,王延松,徐爱华. 基于深度学习的图像目标检测算法综述[J]. 电信科学, 2020, 36(7): 92-106. |
[13] | 郭锐,冉凡春. 基于卷积神经网络的极化码译码算法[J]. 电信科学, 2020, 36(6): 119-124. |
[14] | 盛剑涛,陈茂飞,刘东鑫,汪来富,史国水,金华敏. 基于组合分类器的恶意域名检测技术[J]. 电信科学, 2020, 36(5): 47-55. |
[15] | 汪少敏,王铮,任华. 利用深度学习融合模型提升文本内容安全的研究[J]. 电信科学, 2020, 36(5): 25-30. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||
|