电信科学 ›› 2021, Vol. 37 ›› Issue (4): 54-61.doi: 10.11959/j.issn.1000-0801.2021044

• 研究与开发 • 上一篇    下一篇

基于混合高斯变分自编码网络的异常检测算法

陈华华, 陈哲, 郭春生, 应娜, 叶学义, 章坚武   

  1. 杭州电子科技大学通信工程学院,浙江 杭州 310018
  • 修回日期:2021-03-06 出版日期:2021-04-20 发布日期:2021-04-01
  • 作者简介:陈华华(1975- ),男,博士,杭州电子科技大学副教授,主要研究方向为数字图像处理、计算机视觉和模式识别
    陈哲(1995- ),男,杭州电子科技大学硕士生,主要研究方向为异常检测
    郭春生(1971- ),男,杭州电子科技大学副教授,主要研究方向为视频分析与模式识别
    应娜(1978- ),女,杭州电子科技大学副教授,主要研究方向为信号处理、语音信号处理及其应用
    叶学义(1973- ),男,博士,杭州电子科技大学副教授,主要研究方向为图像处理、模式识别、信息隐藏
    章坚武(1961- ),男,博士,杭州电子科技大学教授,博士生导师,主要研究方向为移动通信、多媒体信号处理与人工智能、通信网络与信息安全

Anomaly detection algorithm based on Gaussian mixture variational auto encoder network

Huahua CHEN, Zhe CHEN, Chunsheng GUO, Na YING, Xueyi YE, Jianwu ZHANG   

  1. School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China
  • Revised:2021-03-06 Online:2021-04-20 Published:2021-04-01

摘要:

异常数据是指偏离大量正常数据点的数据,往往会对各类系统产生负面影响,存在较大风险。异常检测作为一种有效的防护手段,能够检测数据中的异常,为各类系统的正常运转提供重要支撑,具有重要的现实意义。提出了一种基于混合高斯变分自编码网络的异常检测算法,该算法首先使用混合高斯先验构建变分自编码器,对输入数据进行特征提取,然后以混合高斯变分自编码器构建深度支持向量网络,压缩特征空间,并寻找最小超球体分离正常数据和异常数据,通过计算数据特征到超球体中心的欧氏距离衡量数据的异常分数,并以此进行异常检测。最后在基准数据集MNIST和Fashion-MNIST上评估了该算法,平均AUC分别达到了0.954和0.937。实验结果表明,所提出的算法取得了较好的异常检测效果。

关键词: 异常检测, 变分自编码器, 混合高斯分布, 超球体

Abstract:

Anomalous data, which deviates from a large number of normal data, has a negative impact and contains a risk on various systems.Anomaly detection can detect anomalies in the data and provide important support for the normal operation of various systems, which has important practical significance.An anomaly detection algorithm based on Gaussian mixture variational auto encoder network was proposed, in which a variational autoencoder was built to extract the features of the input data based on Gaussian mixture distribution, and using this variational autoencoder to construct a deep support vector network to compress the feature space and find the minimum hyper sphere to separate the normal data and the abnormal data.Anomalies can be detected by the score from the Euclidean distance from the feature of data to the center of the hypersphere.The proposed algorithm was evaluated on the benchmark datasets MNIST and Fashion-MNIST, and the corresponding average AUC are 0.954 and 0.937 respectively.The experimental results show that the proposed algorithm achieves preferable effects.

Key words: anomaly detection, variational autoencoder, Gaussian mixture distribution, hypersphere

中图分类号: 

No Suggested Reading articles found!