电信科学 ›› 2010, Vol. 26 ›› Issue (11): 85-88.doi: 10.3969/j.issn.1000-0801.2010.11.023

• 研究与开发 • 上一篇    下一篇

自中心网络生成的高效分布式设计与实现

金欣,王晶,沈奇威   

  1. 北京邮电大学网络与交换技术国家重点实验室 北京100876;东信北邮信息技术有限公司 北京100191
  • 出版日期:2010-11-15 发布日期:2010-11-15
  • 基金资助:
    国家杰出青年科学基金资助项目;国家“973”计划资助项目;国家“973”计划资助项目;国家自然科学基金资助项目;国家自然科学基金资助项目;中央高校基本科研业务费专项资金资助项目;电子信息产业发展基金资助项目(基于3G的移动业务应用系统)

High Efficiency Implementation of a Distributed Egocentric Network Generating Algorithm

Xin Jin,Jing Wang,Qiwei Shen   

  1. State Key Laboratory of Networking and Switching Technology,Beijing University of Posts and Telecommunications,Beijing 100876,China;EB Information Technology Co.,Ltd.,Beijing 100191,China
  • Online:2010-11-15 Published:2010-11-15

摘要:

近年来数据挖掘在电信领域的应用越来越广泛,而自中心网络从人与环境交互的角度为数据分析提出了新的思路,但是由于数据量、数据维度、计算复杂度等原因,传统的计算方式不能应付海量数据的自中心网络生成和分析的需求。本文首先给出了基于mapreduce模型的传统自中心网络生成算法的实现,然后提出了新的基于三角形提取自中心网络生成算法,并给出了基于mapreduce编程模型的实现,该算法针对mapreduce模型和真实的社会网络进行了优化并实现性能提升,最后对两种算法进行了运行时间和IO的比较。

关键词: 社会网络分析, 分布式, hadoop, mapreduce, 数据挖掘, 自中心网络

Abstract:

Basing on the researching of the relation between personal and the culture he is in and the interactive between he and the other persons,we can find a lot of character of the person by analysis of egocentric network.Recently,data mining is more and more widely used in the telecommunication area,and egocentric is a new idea that trade the person as a part of the whole network.But because of the size of data,dimensionality of data and the complication of the computation,traditional methods is not suitable for this kind of application.In this article,we give the implementation of the traditional egocentric algorithm based on mapreduce module.And then we propose a new egocentric network generating algorithm based on the discovery of triangles.After that,we give the implementation of the new algorithm based on the mapreduce module.In this new algorithm,we make some optimize in connection with the mapreduce module and the character of the real social network to enhance the efficiency.At last,we will compare these two algorithms by time consumed and IO.

Key words: social network analysis, distribute, hadoop, mapreduce, data mining, egocentric network

No Suggested Reading articles found!