Chinese Journal of Network and Information Security ›› 2020, Vol. 6 ›› Issue (3): 39-49.doi: 10.11959/j.issn.2096-109x.2020035

• Special Column: New Technology Exploration on Privacy Protection • Previous Articles     Next Articles

Non-equal-width histogram publishing method based on differential privacy

Lei YANG1,2,Xiao ZHENG1,2,Wei ZHAO1,2   

  1. 1 School of Computer Science and Technology,Anhui University of Technology,Maanshan 243032,China
    2 Anhui Engineering Laboratory for Intelligent Applications and Security of Industrial Internet,Maanshan 243032,China
  • Revised:2019-12-23 Online:2020-06-01 Published:2020-07-01
  • Supported by:
    The Key R & D Program of Anhui Province,China(201904a05020071)

Abstract:

Existing histogram publishing technology based on differential privacy may show phenomenon of"retracting" and "zero bucket" when histogram is used to reflect the real distribution characteristics of data,and "too gentle" in the case of large data volume.In addition,the existing technology of the original histogram difference of privacy protection when not considering the amount of information of each group is different.In view of the above problems,a kind of non-equal-width histogram publishing method based on differential privacy was proposed.First of all,a non-isometric histogram based on the sparseness of the data should bereasonably constructed by empirical distribution function.Secondly,differential privacy protection technology should be applied to non-equal-width histogram to protect the privacy of the original non-equal-width histogram.Finally,the privacy budget should be set for each group according to the class widths of the non-equal-width histogram to improve the privacy of each group of data.The experimental results show that the sparseness of the data distribution is fully taken into account when using the proposed method to perform histogram publishing under differential privacy,effectively avoid the phenomenon of histogram with “retracting” and “zero barrels”,and the accuracy of the published histogram for reflecting the characteristics of the data distribution is guaranteed.Also,when adding noise in line with Laplace mechanism to each group,setting a reasonable privacy budget for each group according to the class widths to some extent increases the privacy of different data segments.

Key words: differential privacy, non-equal-width, histogram publishing, Laplace mechanism, privacy budget

CLC Number: 

No Suggested Reading articles found!