通信学报 ›› 2023, Vol. 44 ›› Issue (8): 241-250.doi: 10.11959/j.issn.1000-436x.2023151

• 学术通信 • 上一篇    

基于全局-局部自注意力网络的视频异常检测方法

杨静1,2, 吴成茂3, 周流平1   

  1. 1 广州铁路职业技术学院信息工程学院,广东 广州 510430
    2 菲律宾圣保罗大学,土格加劳 3500
    3 西安邮电大学电子工程学院,陕西 西安 710121
  • 修回日期:2023-07-19 出版日期:2023-08-01 发布日期:2023-08-01
  • 作者简介:杨静(1986- ),女,陕西西安人,圣保罗大学博士生,广州铁路职业技术学院讲师,主要研究方向为智能视觉识别等
    吴成茂(1968- ),男,四川仪陇人,西安邮电大学高级工程师,主要研究方向为智能信息处理、非线性动力系统与混沌、信息安全等
    周流平(1973- ),男,湖南醴陵人,广州铁路职业技术学院高级工程师,主要研究方向为通信技术、智能信息处理等
  • 基金资助:
    广东省高校青年创新人才基金资助项目(2020KQNCX198);广州市基础研究计划基础与应用基础研究基金资助项目(104267483017)

Novel video anomaly detection method based on global-local self-attention network

Jing YANG1,2, Chengmao WU3, Liuping ZHOU1   

  1. 1 School of Information Engineering, Guang Zhou Railway Ploytechnic, Guangzhou 510430, China
    2 St.Paul University Phillippines, Tuguegarao 3500, Philippines
    3 School of Electronic Engineering, Xi’an University of Posts and Telecommunications, Xi’an 710121, China
  • Revised:2023-07-19 Online:2023-08-01 Published:2023-08-01
  • Supported by:
    The Young Innovative Talents Project of Guangdong Province(2020KQNCX198);Basic and Applied Basic Research Project of Guangzhou Basic Research Program(104267483017)

摘要:

为提升视频异常检测精度,提出一种基于全局-局部自注意力网络的视频异常检测方法。首先,融合视频序列与其对应的 RGB 序列凸显物体的运动变化;其次,通过膨胀卷积层捕获视频序列在局部区域的时序相关性,并利用自注意力网络计算视频全局时序的依赖性,同时,依靠增加基础网络U-Net的深度并结合相关运动和表征约束对网络模型进行端到端的训练学习,从而提升模型的检测精度和鲁棒性;最后,对公开数据集 UCSD Ped2、CUHK Avenue 和ShanghaiTech 进行测试并对所得结果进行可视化分析。实验结果表明,所提方法的检测精度AUC值分别达到了97.4%、86.8%和73.2%,其性能明显优于对比方法。

关键词: 视频异常检测, 自注意力, 预测, 重构

Abstract:

In order to improve the accuracy of video anomaly detection, a novel video anomaly detection method based on global-local self-attention network was proposed.Firstly, the video sequence and the corresponding RGB sequence were fused to highlight the motion change of the object.Secondly, the temporal correlation of the video sequence in the local area was captured by the expansion convolution layer, along with the self-attention network was utilized to compute the global temporal dependencies of the video sequence.Meanwhile, by deepening the basic network U-Net and combining the relevant motion and representation constraints, the network model was trained end-to-end to improve the detection accuracy and robustness of the model.Finally, experiments were carried out on the public data sets UCSD Ped2, CUHK Avenue and ShanghaiTech, as well as the test results were visually analyzed.The experimental results show that the detection accuracy AUC of the proposed method reaches 97.4%, 86.8% and 73.2% respectively, which is obviously better than that of the compared methods.

Key words: video anomaly detection, self-attention, prediction, reconstruction

中图分类号: 

No Suggested Reading articles found!