网络与信息安全学报 ›› 2021, Vol. 7 ›› Issue (4): 53-67.doi: 10.11959/j.issn.2096-109x.2021038

• 专栏Ⅰ:网络攻防技术 • 上一篇    下一篇

基于侧信道与量化推理缺陷的模型逆向攻击

李景海1, 唐明1,2, 黄诚轩1   

  1. 1 武汉大学国家网络安全学院空天信息安全与可信计算教育部重点实验室,湖北 武汉 430072
    2 密码科学技术国家重点实验室,北京 100878
  • 修回日期:2021-01-20 出版日期:2021-08-15 发布日期:2021-08-01
  • 作者简介:李景海(1996− ),男,重庆人,武汉大学硕士生,主要研究方向为侧信道与AI安全
    唐明(1976− ),女,湖北武汉人,武汉大学教授、博士生导师,主要研究方向为信息安全、密码学、密码芯片
    黄诚轩(1997− ),男,湖北武汉人,武汉大学硕士生,主要研究方向为侧信道与AI安全
  • 基金资助:
    国家自然科学基金(61972295);武汉市科技项目应用基础前沿专项(2019010701011407)

Using side-channel and quantization vulnerability to recover DNN weights

Jinghai LI1, Ming TANG1,2, Chengxuan HUANG1   

  1. 1 Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering, Wuhan University, Wuhan 430072, China
    2 State Key Laboratory of Cryptology, Beijing 100878, China
  • Revised:2021-01-20 Online:2021-08-15 Published:2021-08-01
  • Supported by:
    The National Natural Science Foundation of China(61972295);The Frontier Applied Basic Research Project of Science and Technology Department of Wuhan(2019010701011407)

摘要:

模型逆向攻击旨在恢复部署在推理终端的神经网络模型的结构和权重值,是 AI 安全中的基础问题,为对抗样本等高阶攻击提供数据支撑。提出了一种名为 Cluster-based SCA 的新型模型权重逆向方法,该方法不要求攻击者构造泄露模型。Cluster-based SCA方法以量化推理中存在的安全隐患为出发点,深入分析了量化推理过程,发现在量化推理中存在的输出序列分类不等价现象可以判断猜测权重的正确与否。Cluster-based SCA 将采集到的模型运行时产生的侧信道信息按照假设权重产生的中间值进行分类,以分类后的平均离散系数 σ ¯ 为评判标准,取 σ ¯ 最小时的权重为逆向权重。在仿真实验上验证了 Cluster-based SCA方法的有效性,实验使用汉明重模型来模拟AI芯片的泄露模型,对于目标CNN,Cluster-based SCA方法以52.66%的TOP2恢复率恢复了其第一层卷积层所有卷积核权重,对于取值位于显著区的权重,TOP2的恢复率均达到了100%。

关键词: AI安全, 模型逆向攻击, 量化推理缺陷, 侧信道分析, Cluster-basedSCA

Abstract:

Model extraction attack focuses on reverse engineering architecture and weights of DNN model deployed in edge.Model extraction attack is a basic security problem in AI security, it underlies advanced attacks as data provider, such as adversarial sample and data poisoning.A novel method named Cluster-based SCA was proposed,this method did not need leakage model.Cluster-based SCA was based on vulnerability of quantized inference.There exist a phenomenon in multiplication operation in quantized inference, which the output of different weights were not equivalent in respect of classification.It can be used to distinguish different weights.The proposed method computed output activations of each DNN layer with guessing weight.Then acquired side channel signal were classified into different class, the taxonomy was corresponding output activations' value.Average dispersion of all classes σ ¯ was used to decide whether guess was right.The effectiveness of Cluster-based SCA method was verified by simulation experiment and HW model was used as target leakage model.For all weights from first convolution layer of target CNN model, TOP2 recovery rate was 52.66%.And for large weights in significant interval,TOP2 recover rate was 100%.

Key words: AI security, model extraction attack, quantization vulnerability, side-channel analysis, Cluster-based SCA

中图分类号: 

No Suggested Reading articles found!