通信学报 ›› 2022, Vol. 43 ›› Issue (5): 214-225.doi: 10.11959/j.issn.1000-436x.2022095

• 学术通信 • 上一篇    

隐马尔可夫模型的异质网络链接预测方法研究

钱榕1,2, 许建婷2, 张克君1,2, 董宏宇1, 邢方远1   

  1. 1 北京电子科技学院网络空间安全系,北京 100070
    2 西安电子科技大学计算机科学与技术学院,陕西 西安 710071
  • 修回日期:2022-04-07 出版日期:2022-05-25 发布日期:2022-05-01
  • 作者简介:钱榕(1970- ),男,福建福州人,博士,北京电子科技学院副教授、硕士生导师,主要研究方向为复杂网络、数据挖掘、云计算安全等
    许建婷(1997- ),女,河北石家庄人,西安电子科技大学硕士生,主要研究方向为复杂网络、数据挖掘等
    张克君(1972- ),男,山东临沂人,博士,北京电子科技学院教授、博士生导师,主要研究方向为智能计算、信息安全等
    董宏宇(1994- ),女,内蒙古乌兰察布人,北京电子科技学院硕士生,主要研究方向为复杂网络、数据挖掘等
    邢方远(1998- ),男,辽宁宽甸人,北京电子科技学院硕士生,主要研究方向为复杂网络、数据挖掘等
  • 基金资助:
    国家重点研发计划基金资助项目(2018YFB1004101)

Research on HMM based link prediction method in heterogeneous network

Rong QIAN1,2, Jianting XU2, Kejun ZHANG1,2, Hongyu DONG1, Fangyuan XING1   

  1. 1 Department of Cyberspace Security, Beijing Electronic Science and Technology Institute, Beijing 100070, China
    2 College of Computer Science and Technology, Xidian University, Xi’an 710071, China
  • Revised:2022-04-07 Online:2022-05-25 Published:2022-05-01
  • Supported by:
    The National Key Research and Development Program of China(2018YFB1004101)

摘要:

为了解决异质网络的结构信息和语义信息挖掘不全面的问题,针对异质网络的链接预测,提出了将基于元路径的分析方式与隐马尔可夫模型相结合的链接预测方法。考虑到聚簇可以有效地捕获异质网络的结构信息,将k-means算法进行改进得到基于距离均方差最小的初始聚簇中心方法,并将其应用到隐马尔可夫模型(HMM)中,设计了基于聚簇的一阶隐马尔可夫模型(C-HMM(1))的链接预测方法,同时提出基于聚簇的二阶隐马尔可夫模型(C-HMM(2))的异质网络的链接预测方法。进一步考虑数据的特征信息,提出了将最大熵模型和二阶隐马尔可夫模型相结合的链接预测方法ME-HMM。实验结果表明,ME-HMM比C-HMM方法的链接预测精确度更高,且ME-HMM因充分考虑到数据的特征信息比C-HMM的性能更加优异。

关键词: 异质网络, 链接预测, 隐马尔可夫模型, 聚簇, 最大熵

Abstract:

In order to solve the problem that incomplete mining of structural information and semantic information in heterogeneous networks, a link prediction method combining meta-path-based analysis and hidden Markov model was proposed for link prediction of heterogeneous network.Considering that clustering could effectively capture the structural information of heterogeneous network, the k-means algorithm was improved to obtain the initial clustering center method based on the minimum distance mean square error, and it was applied to the hidden Markov model, first-order cluster hidden markov model (C-HMM(1)) link prediction method, and a link prediction method for heterogeneous network with second-order cluster hidden Markov model (C-HMM(2)) were designed.Further, considering the feature information of the data, a link prediction method called ME-HMM that combined the maximum entropy model and the second-order Markov model was proposed.The experimental results show that the ME-HMM has higher link prediction accuracy than the C-HMM, and the ME-HMM method has better performance than the C-HMM method because it fully considers the feature information of the data.

Key words: heterogeneous network, link prediction, hidden Markov model, clustering, maximum entropy

中图分类号: 

No Suggested Reading articles found!