Chinese Journal of Network and Information Security ›› 2022, Vol. 8 ›› Issue (2): 139-149.doi: 10.11959/j.issn.2096-109x.2021099

• Papers • Previous Articles     Next Articles

Two-stage community detection algorithm based on label propagation

Xueliang SUN1, Wei WANG2, Junheng HUANG2, Guodong XIN2, Bailing WANG2   

  1. 1 Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
    2 School of Computer Science and Technology, Harbin Institute of Technology, Weihai 264209, China
  • Revised:2021-10-08 Online:2022-04-15 Published:2022-04-01
  • Supported by:
    The National Key R&D Program of China(2020YFB2009502);The Fundamental Research Funds for the Central Universities(HIT.NSRIF.2020098);The Key R&D Program of Shandong Province(2017CXGC0706)

Abstract:

Community detection is an important research topic of complex network analysis.The detection results help to understand the community structure of complex networks and provide support for downstream tasks, such as content recommendation, link detection.Considering the challenge of community detection in complex networks, a two-stage community detection algorithm based on label propagation (TS-LPA) was proposed.The TS-LPA algorithm quantified the propagation capability of nodes based on the extended neighborhood.Then a new evaluation index was proposed to measure the probability of influence between nodes using the information of nodes and the weight of edges in the network.Based on the calculation of node centrality, the algorithm determined the updating sequence of node labels and the selection strategy of seed nodes, which eliminated the instability of the algorithm in the updating process.The TS-LPA algorithm used the breadth-first propagation idea and introduced the second-stage label propagation method to improve the quality of community detection.When label began to spread, all neighboring nodes had an influence on the label of the related node.Meanwhile, the influence of neighboring seed nodes was added to complete label updating, in order to reduce the dominance of neighboring nodes on the updated node.The experimental results of different real data sets and synthetic data sets show that TS-LPA algorithm can eliminate randomness and show strong stability while effectively improving the quality of community detection.

Key words: community detection, label propagation, node centrality, breadth-first search

CLC Number: 

No Suggested Reading articles found!