通信学报 ›› 2023, Vol. 44 ›› Issue (2): 198-209.doi: 10.11959/j.issn.1000-436x.2023028

• 学术通信 • 上一篇    下一篇

基于目标扰动的AdaBoost算法

张淑芬1,2,3, 董燕灵1,2,4, 徐精诚1,2,4, 王豪石1,2,4   

  1. 1 华北理工大学理学院,河北 唐山 063210
    2 河北省数据科学与应用重点实验室,河北 唐山 063210
    3 唐山市大数据安全与智能计算重点实验室,河北 唐山 063210
    4 唐山市数据科学重点实验室,河北 唐山 063210
  • 修回日期:2022-12-25 出版日期:2023-02-25 发布日期:2023-02-01
  • 作者简介:张淑芬(1972− ),女,河北唐山人,华北理工大学教授、硕士生导师,主要研究方向为云计算、智能信息处理、数据安全与隐私保护
    董燕灵(1998− ),女,浙江宁波人,华北理工大学硕士生,主要研究方向为数据安全与隐私保护
    徐精诚(1996− ),男,江苏常州人,华北理工大学硕士生,主要研究方向为数据安全与隐私保护
    王豪石(1997− ),男,河北邢台人,华北理工大学硕士生,主要研究方向为数据安全与隐私保护
  • 基金资助:
    国家自然科学基金资助项目(U20A20179)

AdaBoost algorithm based on target perturbation

Shufen ZHANG1,2,3, Yanling DONG1,2,4, Jingcheng XU1,2,4, Haoshi WANG1,2,4   

  1. 1 College of Science, North China University of Science and Technology, Tangshan 063210, China
    2 Hebei Key Laboratory of Data Science and Application, Tangshan 063210, China
    3 Tangshan Key Laboratory of Big Data Security and Intelligent Computing, Tangshan 063210, China
    4 Tangshan Key Laboratory of Data Science, Tangshan 063210, China
  • Revised:2022-12-25 Online:2023-02-25 Published:2023-02-01
  • Supported by:
    The National Natural Science Foundation of China(U20A20179)

摘要:

针对 AdaBoost 算法的多轮迭过程会放大为实现差分隐私保护而添加的噪声,从而导致模型收敛缓慢、数据可用性大幅降低的问题,提出了一种基于目标扰动的AdaBoost算法——DPAda,采用目标扰动的方式对样本权值进行加噪,精确计算其敏感度,并赋予其动态的隐私预算。为了解决噪声叠加过多的问题,提出基于摆动数列、随机响应和改进的随机响应3种噪声注入算法。实验结果表明,与DPAda_Random算法和DPAda_Swing算法相比,DPAda_Improved算法能实现数据的隐私保护,拥有更高的分类准确率,优于其他差分隐私AdaBoost算法,并能解决连续加噪带来的噪声过大的问题。

关键词: 差分隐私, 摆动数列, 随机响应, 隐私预算分配, AdaBoost算法

Abstract:

Aiming at the problem that the multi-round iteration process in the AdaBoost algorithm will amplify the noise added to achieve differential privacy protection, which leads to slow model convergence and greatly reduced data availability, an AdaBoost algorithm based on target perturbation—DPAda was proposed.Target perturbation was used to add noise to sample weights, accurately calculated their sensitivity, and a dynamic privacy budget was given.In order to solve the problem of excessive noise superposition, three noise injection algorithms based on swing sequence, random response and improved random response were proposed.The experimental results show that compared with DPAda_Random and DPAda_Swing, DPAda_Improved achieves the privacy protection of data, has higher classification accuracy, as well as better than other differential privacy AdaBoost algorithm, and can also solve the problem of excessive noise caused by continuous noise addition.

Key words: differential privacy, swing sequence, random response, privacy budget allocation, AdaBoost algorithm

中图分类号: 

No Suggested Reading articles found!