通信学报 ›› 2022, Vol. 43 ›› Issue (1): 217-226.doi: 10.11959/j.issn.1000-436x.2022016

• 学术通信 • 上一篇    

HRDA-Net:面向真实场景的图像多篡改检测与定位算法

朱叶1,2, 余宜林1, 郭迎春1   

  1. 1 河北工业大学人工智能与数据科学学院,天津 300401
    2 深圳市媒体信息内容安全重点实验室,广东 深圳 518060
  • 修回日期:2021-12-22 出版日期:2022-01-25 发布日期:2022-01-01
  • 作者简介:朱叶(1989- ),女,山东菏泽人,博士,河北工业大学讲师、硕士生导师,主要研究方向为图像安全取证、图像处理与模式识别
    余宜林(1998- ),男,福建南平人,河北工业大学硕士生,主要研究方向为图像安全取证
    郭迎春(1970- ),女,河北张家口人,博士,河北工业大学副教授、硕士生导师,主要研究方向为图像处理与模式识别、人工智能等
  • 基金资助:
    国家自然科学基金资助项目(62102129);国家自然科学基金资助项目(61806071);国家自然科学基金资助项目(91746207);河北省自然科学基金资助项目(F2021202030);河北省自然科学基金资助项目(F2020202025);河北省自然科学基金资助项目(F2019202381);河北省自然科学基金资助项目(F2019202464);河北省高等学校科学技术研究基金资助项目(QN2019207);河北省高等学校科学技术研究基金资助项目(QN2020185)

HRDA-Net: image multiple manipulation detection and location algorithm in real scene

Ye ZHU1,2, Yilin YU1, Yingchun GUO1   

  1. 1 School of Artificial Intelligence, Hebei University of Technology, Tianjin 300401, China
    2 Shenzhen Key Laboratory of Media Security, Shenzhen 518060, China
  • Revised:2021-12-22 Online:2022-01-25 Published:2022-01-01
  • Supported by:
    The National Natural Science Foundation of China(62102129);The National Natural Science Foundation of China(61806071);The National Natural Science Foundation of China(91746207);The Natural Science Foundation of Hebei Province(F2021202030);The Natural Science Foundation of Hebei Province(F2020202025);The Natural Science Foundation of Hebei Province(F2019202381);The Natural Science Foundation of Hebei Province(F2019202464);The Sci-Tech Research Projects of Higher Education of Hebei Province(QN2019207);The Sci-Tech Research Projects of Higher Education of Hebei Province(QN2020185)

摘要:

针对主流篡改数据集单幅图像仅包含一类篡改操作,且对真实图像定位存在“伪影”问题,构建面向真实场景的多篡改数据集(MM Dataset),每幅篡改图像包含拼接和移除2种篡改操作。针对多篡改检测与定位任务,提出端到端的高分辨率扩张卷积注意力网络(HRDA-Net),利用自顶向下扩张卷积注意力(TDDCA)模块融合图像 RGB 域和 SRM 域特征。最后,采用混合扩张卷积模块(MDC)分别提取拼接、移除和篡改检测任务特征,实现篡改区域定位和篡改置信度预测。为提高网络训练效率,提出余弦相似度损失函数作为辅助损失。实验结果表明,在MM Dataset下,与主流语义分割方法相比,HRDA-Net具有较优的性能和较强的稳健性;在单篡改数据集CASIA和NIST下,与主流单篡改定位方法相比,HRDA-Net的F1和AUC分数均较优。

关键词: 深度学习, 多篡改检测与定位, 多篡改数据集, 余弦相似度损失函数

Abstract:

Aiming at the problems that the fake image just contains one tampered operation in mainstream manipulation datasets and the artifact is a common problem in manipulation location.The multiple manipulation dataset (MM Dataset) was constructed for real scene, which contained both splicing and removal in each images.Based on this, an end-to-end high-resolution representation dilation attention network (HRDA-Net) was proposed for multiple manipulation detection and localization, which fused the RGB and SRM features through the top-down dilation convolutional attention (TDDCA).Finally, the mixed dilated convolution (MDC) would respectively extract the features of splicing and removal, which could realize multiple manipulation location and confidence prediction.The cosine similarity loss was proposed as auxiliary loss to improve the efficiency of network.Experimental results on MM Dataset indicate that the performance and robustness of HRDA-Net is better than semantic segmentation methods.Furthermore, the scores of F1 and AUC are greater than state-of-the-art manipulation location methods in CASIA and NIST datasets.

Key words: deep learning, multiple manipulation detection and location, MM Dataset, cosine similarity loss function

中图分类号: 

No Suggested Reading articles found!