Journal on Communications ›› 2021, Vol. 42 ›› Issue (3): 23-35.doi: 10.11959/j.issn.1000-436x.2021025

• Papers • Previous Articles     Next Articles

Generalized Grad-CAM attacking method based on adversarial patch

Nianwen SI1, Wenlin ZHANG1, Dan QU1, Heyu CHANG2, Shengxiang LI1, Tong NIU1   

  1. 1 Department of Information System Engineering, Information Engineering University, Zhengzhou 450001, China
    2 Department of Cryptogram Engineering, Information Engineering University, Zhengzhou 450001, China
  • Revised:2020-12-22 Online:2021-03-25 Published:2021-03-01
  • Supported by:
    The National Natural Science Foundation of China(61673395)

Abstract:

To verify the fragility of the Grad-CAM, a Grad-CAM attack method based on adversarial patch was proposed.By adding a constraint to the Grad-CAM in the classification loss function, an adversarial patch could be optimized and the adversarial image could be synthesized.The adversarial image guided the Grad-CAM interpretation result towards the patch area while the classification result remains unchanged, so as to attack the interpretations.Meanwhile, through batch-training on the dataset and increasing perturbation norm constraint, the generalization and the multi-scene usability of the adversarial patch were improved.Experimental results on the ILSVRC2012 dataset show that compared with the existing methods, the proposed method can attack the interpretation results of the Grad-CAM more simply and effectively while maintaining the classification accuracy.

Key words: convolutional neural network, interpretability, adversarial patch, class activation map, saliency map

CLC Number: 

No Suggested Reading articles found!