通信学报 ›› 2022, Vol. 43 ›› Issue (11): 225-232.doi: 10.11959/j.issn.1000-436x.2022221

• 学术通信 • 上一篇    下一篇

基于多视角图神经网络的欺诈检测算法

陈卓, 朱淼, 杜军威   

  1. 青岛科技大学信息科学技术学院,山东 青岛 266061
  • 修回日期:2022-09-30 出版日期:2022-11-25 发布日期:2022-11-01
  • 作者简介:陈卓(1978− ),女,山东青岛人,博士,青岛科技大学副教授、硕士生导师,主要研究方向为自然语言处理、推荐系统等
    朱淼(1998− ),女,安徽六安人,青岛科技大学硕士生,主要研究方向为图神经网络、异常检测等
    杜军威(1974−),男,山东威海人,博士,青岛科技大学教授、博士生导师,主要研究方向为数据挖掘、知识图谱与知识工程等
  • 基金资助:
    国家自然科学基金资助项目(62172249);国家自然科学基金资助项目(61973180);国家自然科学基金资助项目(62202253);山东省自然科学基金资助项目(ZR2021MF092)

Multi-view graph neural network for fraud detection algorithm

Zhuo CHEN, Miao ZHU, Junwei DU   

  1. School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
  • Revised:2022-09-30 Online:2022-11-25 Published:2022-11-01
  • Supported by:
    The National Natural Science Foundation of China(62172249);The National Natural Science Foundation of China(61973180);The National Natural Science Foundation of China(62202253);The Natural Science Foundation of Shandong Province(ZR2021MF092)

摘要:

针对欺诈检测领域样本标签不平衡、欺诈节点之间缺乏必要连接,导致欺诈检测任务不符合图神经网络同质性假设的问题,提出了基于多视角图神经网络的欺诈检测(MGFD)算法。首先,利用结构无关的编码器对网络中节点进行属性编码,以学习欺诈节点与正常节点之间的差异,使用层次注意力机制对网络中多视角信息进行融合,在学习差异的基础上充分利用网络中不同视角之间的交互信息对节点进行建模;然后,基于数据不平衡比采样子图,依据欺诈节点连接特性构建样本进行分类学习,解决样本标签不平衡的问题;最后,预测标签判别节点是否为欺诈节点。在公开数据集上的实验表明,MGFD算法在基于图的欺诈检测领域检测效果优于对比方法。

关键词: 欺诈检测, 异常检测, 注意力机制, 图表示学习, 不平衡学习

Abstract:

Aiming at the problem that in the field of fraud detection, imbalance labels and lack of necessary connections between fraud nodes, resulting in fraud detection tasks not conforming to the hypothesis of homogeneity of graph neural networks, multi-view graph neural network for fraud detection (MGFD) algorithm was proposed.First, A structure-independent encoder was used to encode the attributes of nodes in the network to learn the difference between the fraud node and the normal node.The hierarchical attention mechanism was designed to integrate the multi-view information in the network, and made full use of the interaction information between different perspectives in the network to model the nodes on the basis of learning differences.Then, based on the data imbalance ratio sampled subgraph, the sample was constructed according to the connection characteristics of fraud nodes for classification, which solved the problem of imbalance sample labels.Finally, the prediction label was used to identify whether a node is fraudulent.Experiments on real-world datasets have shown that the MGFD algorithm outperforms the comparison method in the field of graph-based fraud detection.

Key words: fraud detection, anomaly detection, attention mechanism, graph representation learning, imbalance learning

中图分类号: 

No Suggested Reading articles found!