Chinese Journal of Network and Information Security ›› 2016, Vol. 2 ›› Issue (5): 30-38.doi: 10.11959/j.issn.2096-109x.2016.00049

• Papers • Previous Articles     Next Articles

Micro-blog topic detection algorithm based on topic model

Hua-jun HUANG,Jun-shan TAN,Jiao-hua QIN   

  1. College of Computer and Information Engineering,Central South University of Forestry &Technology,Changsha 410004,China
  • Revised:2016-05-06 Online:2016-05-15 Published:2020-03-26
  • Supported by:
    The National Natural Science Foundation of China(61304208);The Natural Science Foundation of Hunan Province(13JJ2031);Youth Scientific Research Foundation of Central South University of Forestry &Technology(QJ2012009A)

Abstract:

Micro-blog data has the characteristic of real-time,volume,short-text,and noise-rich.So it is a challenge for the traditional topic detection technology.A novel micro-blog topic detection algorithm based on topic model was proposed.Firstly,the micro-blog data was expressed as text word matrix and word relation matrix.The topic word was extracted from the two vectors.Secondly,the topic model was obtained with clustering.Finally,the topic detection of micro-blog was obtained by clustering text and topic model.Experimental results show that the algorithm proposed can effectively detection the text topic,and with the best parameter group of precision,recall rate,F,and the value F is about 95%.

Key words: topic detection, topic model, text word matrix, word relation matrix

CLC Number: 

No Suggested Reading articles found!