电信科学 ›› 2022, Vol. 38 ›› Issue (12): 65-77.doi: 10.11959/j.issn.1000-0801.2022238

• 研究与开发 • 上一篇    下一篇

基于钉板分布稀疏变分自编码器的异常检测算法研究

陈华华, 陈哲   

  1. 杭州电子科技大学通信工程学院,浙江 杭州 310018
  • 修回日期:2022-08-11 出版日期:2022-12-20 发布日期:2022-12-01
  • 作者简介:陈华华(1975- ),男,博士,杭州电子科技大学副教授,主要研究方向为数字图像处理、计算机视觉和模式识别
    陈哲(1995- ),男,杭州电子科技大学硕士生,主要研究方向为异常检测
  • 基金资助:
    浙江省“领雁”研发攻关计划项目(2022C03065)

Research on anomaly detection algorithm based on sparse variational autoencoder using spike and slab prior

Huahua CHEN, Zhe CHEN   

  1. School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China
  • Revised:2022-08-11 Online:2022-12-20 Published:2022-12-01
  • Supported by:
    The “Leading Goose” Technologies Research and Development Program of Zhejiang Province(2022C03065)

摘要:

异常检测由于其广泛的应用一直是数据挖掘中一个重要的研究分支,它有助于研究人员获得重要的信息进而对数据做出更好的决策。提出了一种基于钉板分布稀疏变分自编码器的异常检测模型。首先,使用离散-连续混合模型钉板分布作为变分自编码器的先验,模拟隐变量所在空间的稀疏性,得到数据特征的稀疏表示;其次,以所提出的自编码器构建深度支持向量网络,对特征空间进行压缩,并采用最优超球体区分正常数据和异常数据;再次,以数据特征和超球体中心之间的欧氏距离完成异常检测;最后,在基准数据集MNIST (modifiednational institute of standards and technology database)和Fashion-MNIST上的实验评估表明,与现存的异常检测算法相比,本文所提出的算法具有更好的检测效果。

关键词: 异常检测, 变分自编码器, 钉板分布, 深度支持向量网络

Abstract:

Anomaly detection remains to be an essential and extensive research branch in data mining due to its widespread use in a wide range of applications.It helps researchers to obtain vital information and make better decisions about data by detecting abnormal data.Considering that sparse coding can get more powerful features and improve the performance of other tasks, an anomaly detection model based on sparse variational autoencoder was proposed.Firstly, the discrete mixed modelspike and slab distribution was used as the prior of variational autoencoder, simulated the sparsity of the space where the hidden variables were located, and obtained the sparse representation of data characteristics.Secondly, combined with the deep support vector network, the feature space was compressed, and the optimal hypersphere was found to discriminate normal data and abnormal data.And then, the abnormal fraction of the data was measured by the Euclidean distance from the data feature to the center of the hypersphere, and then the abnormal detection was carried out.Finally, the algorithm was evaluated on the benchmark datasets MNIST and Fashion-MNIST, and the experimental results show that the proposed algorithm achieves better effects than the state-of-the-art methods.

Key words: anomaly detection, variational autoencoder, spike and slab distribution, deep support vector network

中图分类号: 

No Suggested Reading articles found!