通信学报 ›› 2015, Vol. 36 ›› Issue (12): 172-177.doi: 10.11959/j.issn.1000-436x.2015326

• 隐私保护 • 上一篇    下一篇

基于多变量信源编码的隐私效用均衡方法

谷勇浩1,林九川2   

  1. 1 北京邮电大学 计算机学院 智能通信软件与多媒体北京市重点实验室,北京 100876
    2 公安部第三研究所,上海 201204
  • 出版日期:2015-12-25 发布日期:2017-07-17
  • 基金资助:
    国家自然科学基金资助项目;工信部通信软科学基金资助项目;工信部通信软科学基金资助项目;信息网络安全公安部重点实验室开放课题基金资助项目

Privacy-utility tradeoff method using multi-variable source coding

Yong-hao GU1,Jiu-chuan LIN2   

  1. 1 Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia,School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China
    2 The Third Research Institute of Ministry of Public Security,Shanghai 201204,China
  • Online:2015-12-25 Published:2017-07-17
  • Supported by:
    The National Natural Science Foundation of China;Communication Soft Science Foundation of Ministry of Industry and Information;Communication Soft Science Foundation of Ministry of Industry and Information;Key Lab of Information Network Security Foundation of Ministry of Public Security

摘要:

在大数据时代,数据提供者需要保证自身隐私,数据分析者要挖掘数据潜在价值,寻找数据隐私性与数据可用性间的均衡关系成为研究热点。现有方法多数关注隐私保护方法本身,而忽略了隐私保护方法对数据可用性的影响。在对隐私效用均衡方法研究现状分析的基础上,针对数据集中不同公开信息对隐私保护需求不同的问题,提出基于多变量信源编码的隐私效用均衡方法,并给出隐私效用均衡区域。分析表明,隐私信息与公开信息的关联度越大,对公开信息扰动程度的增加会显著提高隐私保护效果。同时,方差较大的变量对应的公开信息,可选择较小的扰动,确保公开信息可用性较大。

关键词: 隐私保护, 隐私效用均衡, 信源编码, 率失真

Abstract:

In the age of big data,data providers need to ensure their privacy,while data analysts need to mine the value of data.So,how to find the privacy-utility tradeoff has become a research hotspot.Current works mostly focus on privacy preserving methods,ignoring the data utility.Based on the current research of privacy utility equilibrium methods,a privacy-utility tradeoff method using multi-variable source coding was proposed to solve the problem that different public datasets in the same database have different privacy requirements.Two results are obtained by simulations.The first result is that the greater the association degree between the private information and public information,the increase of the distortion degree of public information will significantly improve the effect of privacy preservation.The second result is that public information with larger variance should be less distorted to ensure more utility.

Key words: privacy preservation, privacy-utility tradeoff;, source coding, rate distortion

No Suggested Reading articles found!