Journal on Communications ›› 2021, Vol. 42 ›› Issue (11): 133-144.doi: 10.11959/j.issn.1000-436x.2021215

• Papers • Previous Articles     Next Articles

Threat analysis and defense methods of deep-learning-based data theft in data sandbox mode

Hezhong PAN1, Peiyi HAN2, Xiayu XIANG1, Shaoming DUAN2, Rongfei ZHUANG2, Chuanyi LIU2,3   

  1. 1 School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876, China
    2 School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen 518055, China
    3 Cyberspace Security Research Center, Peng Cheng Laboratory, Shenzhen 518066, China
  • Revised:2021-08-31 Online:2021-11-25 Published:2021-11-01
  • Supported by:
    The National Natural Science Foundation of China(61872110)

Abstract:

The threat model of deep-learning-based data theft in data sandbox model was analyzed in detail, and the degree of damage and distinguishing characteristics of this attack were quantitatively evaluated both in the data processing stage and the model training stage.Aiming at the attack in the data processing stage, a data leakage prevention method based on model pruning was proposed to reduce the amount of data leakage while ensuring the availability of the original model.Aiming at the attack in model training stage, an attack detection method based on model parameter analysis was proposed to intercept malicious models and prevent data leakage.These two methods do not need to modify or encrypt data, and do not need to manually analyze the training code of deep learning model, so they can be better applied to data theft defense in data sandbox mode.Experimental evaluation shows that the defense method based on model pruning can reduce 73% of data leakage, and the detection method based on model parameter analysis can effectively identify more than 95% of attacks.

Key words: data sandbox, data theft, security of AI

CLC Number: 

No Suggested Reading articles found!