通信学报 ›› 2015, Vol. 36 ›› Issue (8): 125-134.doi: 10.11959/j.issn.1000-436x.2015204

• 学术论文 • 上一篇    下一篇

基于投影区域密度划分的k匿名算法

王超,杨静,张健沛,吕刚   

  1. 哈尔滨工程大学 计算机科学与技术学院,黑龙江 哈尔滨 150001
  • 出版日期:2015-08-25 发布日期:2015-08-25
  • 基金资助:
    国家自然科学基金资助项目;国家自然科学基金资助项目;国家自然科学基金资助项目;高等学校博士学科点专项科研基金资助项目;高等学校博士学科点专项科研基金资助项目;黑龙江省自然科学基金资助项目;哈尔滨市科技创新人才研究专项(优秀学科带头人)基金资助项目

Algorithm for k-anonymity based on projection area density partition

Chao WANG,Jing YANG,Jian-pei ZHANG,Gang LV   

  1. College of Computer Science and Technology,Harbin Engineering University,Harbin 150001,China
  • Online:2015-08-25 Published:2015-08-25
  • Supported by:
    The National Natural Science Foundation of China;The National Natural Science Foundation of China;The National Natural Science Foundation of China;The Research Fund for the Doctoral Program of Higher Education of China;The Research Fund for the Doctoral Program of Higher Education of China;The Natural Science Foundation of Heilongjiang Province;The Harbin Special Funds for Technological Innovation Research

摘要:

在数据发布的隐私保护中,现有的算法在划分临时匿名组时,没有考虑临时匿名组中相邻数据点的距离,在划分过程中极易产生许多不必要的信息损失,从而影响发布匿名数据集的可用性。针对以上问题,提出矩形投影区域,投影区域密度和划分表征系数等概念,旨在通过提高记录点的投影区域密度来合理地划分临时匿名组,使划分后的匿名组产生的信息损失尽量小;并提出基于投影区域密度划分的k匿名算法,通过优化取整划分函数和属性维选择策略,在保证匿名组数量不减少的同时,减少划分过程中不必要的信息损失,进一步提高发布数据集的可用性。通过理论分析和实验验证了算法的合理性和有效性。

关键词: 隐私保护, 临时匿名组, 矩形投影区域, 投影区域密度, 划分表征系数

Abstract:

In data publishing privacy preserving,while classifying temporary anonymous groups,the existing algorithms didn’t consider the distance between adjacent data points,and could easily produce a lot of unnecessary information loss,thus affecting the availability of released anonymous data sets.To solve the above problem,the concept of rectangular projection area,the projection area density and partition coefficient characterization were presented,aim to increase the recording points’s projection area density to divide temporary anonymous group reasonably,and to make the information loss of divided anonymous groups as small as possible.And presents the algorithm for k-anonymity based on projection area density partition,by optimizing the rounded partition function and properties dimension selection strategy,to reduce unnecessary information loss and to further improve the availability of released data sets,without reducing the number of anonymous groups.The rationality and validity of the algorithm are verified by theoretical analysis and multiple experiments.

Key words: privacy preserving, temporary anonymous group, rectangular projection area, projection area density

No Suggested Reading articles found!