Journal on Communications ›› 2020, Vol. 41 ›› Issue (12): 72-81.doi: 10.11959/j.issn.1000-436X.2020229

• Papers • Previous Articles     Next Articles

Social media user geolocalization based on multiple mention relationships

Yaqiong QIAO1,2, Xiangyang LUO1,2, Jiangtao MA3, Chenliang LI4, Meng ZHANG1,2, Ruixiang LI1,2   

  1. 1 Information Engineering University, Zhengzhou 450001, China
    2 State Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou 450001, China
    3 School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450001, China
    4 School of Cyber Science and Engineering, Wuhan University, Wuhan 430075, China
  • Revised:2020-07-20 Online:2020-12-25 Published:2020-12-01
  • Supported by:
    The National Natural Science Foundation of China(U1804263);The National Natural Science Foundation of China(U1636219);The National Natural Science Foundation of China(61872287);The National Natural Science Foundation of China(U1736214);The National Key Research and Development Program of China(2016QY01W0105);The National Key Research and Development Program of China(2016YFB0801303);Zhongyuan Talents Program-Zhongyuan Science and Technology Innovation Leading Talent Project(1052020KJLJ0025);The Plan for Scientific Inno-vation Talent of Henan Province(184200510018);The Scientific and Technological Project of Henan Province(202102310237)

Abstract:

Aiming at the problem that the existing joint user geolocalization methods based on social media text and social relationships do not sufficiently mine the location correlation between heterogeneous data in social media, a social media user geolocalization method based on multiple mention relationships was proposed.First, a heterogeneous network was constructed by comprehensively considering the mention relationship between users, the user's mention relationship with location indicative words, and the user's mention relationship with geographic nouns.Then, a network simplification strategy was proposed to construct a user-location heterogeneous network that connects users live nearby more closely based on the common mention relationship.After that, a biased random walk algorithm was proposed for the graph node sampling to fully explore the network structure and alleviate the sparsity problem of known locations.Finally, a neural network classifier based on a multilayer perceptron was used to infer the user's location.Experimental results on three representative Twitter data sets of GEOTEXT, TW-US and TW-WORLD show that the proposed method can significantly improve the user geolocalization accuracy.

Key words: social media, heterogeneous network, user geolocalization, mention relationship

CLC Number: 

No Suggested Reading articles found!