通信学报 ›› 2016, Vol. 37 ›› Issue (5): 125-129.doi: 10.11959/j.issn.1000-436x.2016100

• 学术论文 • 上一篇    下一篇

基于聚类匿名化的差分隐私保护数据发布方法

刘晓迁,李千目   

  1. 南京理工大学计算机科学与工程学院,江苏 南京210094
  • 出版日期:2016-05-25 发布日期:2016-06-01
  • 基金资助:
    中央高校基本科研业务专项基金资助项目;国家自然科学基金资助项目;江苏省未来网络前瞻性研究基金资助项目;江苏省产学研前瞻性基金资助项目;江苏省产学研前瞻性基金资助项目;江苏省产学研前瞻性基金资助项目;江苏省普通高校研究生创新计划基金资助项目

Differentially private data release based on clustering anonymization

Xiao-qian LIU,Qian-mu LI   

  1. School of Computer Science and Engineering,Nanjing University of Science and Technology,Nanjing 210094,China
  • Online:2016-05-25 Published:2016-06-01
  • Supported by:
    The Fundational Research Funds for the Central Universities;The National Natural Science Foundation of China;The Future Network Prospective Study Project of Jiangsu Province;The Industry-University-Research Perspective Project of Jiangsu Province;The Industry-University-Research Perspective Project of Jiangsu Province;The Industry-University-Research Perspective Project of Jiangsu Province;Graduate Students Research Innovation Plan of Jiangsu Province

摘要:

基于匿名化技术的理论基础,采用DBSCAN聚类算法对数据记录进行聚类,实现将个体记录匿名化隐藏于一组记录中。为提高隐私保护程度,对匿名化划分的数据添加拉普拉斯噪声,扰动个体数据真实值,以实现差分隐私保护模型的要求。通过聚类,分化查询函数敏感性,提高数据可用性。对算法隐私性进行证明,并实验说明发布数据的可用性。

关键词: 差分隐私, 隐私保护, 聚类, 数据发布, 匿名化

Abstract:

Based on the theory of anonymization,the DBSCAN method was applied to divide all the data records into different groups to cover individuals.To provide priv enhancement,the Laplace noise was added to the anonymized partitioned data to perturb the real value of data record so that the requirements of differential privacy model were satis-fied.With the clustering operation,the sensitivity of the query function has been partitioned to improve data utility.The proof of privacy has been given and experimental results have been provided to evaluate the utility of the released data.

Key words: differential privacy, privacy preservation, clustering, data release, anonymization

No Suggested Reading articles found!