Telecommunications Science ›› 2017, Vol. 33 ›› Issue (10): 134-140.doi: 10.11959/j.issn.1000-0801.2017258

• research and development • Previous Articles     Next Articles

A method of micro-blog topic discovery based on feature words selection and text similarity

Hongyang CHEN,Linlin WANG,Yingsheng CHEN,Jiangkun LU,Xue ZUO   

  1. College of Computer Engineering,Chongqing College of Humanities Science and Technology,Chongqing 401524,China
  • Revised:2017-08-29 Online:2017-10-01 Published:2017-11-13
  • Supported by:
    The Commission of Science and Technology Plan Project of Chongqing(KJ1601601);The National Natural Science Foundation of China(61173184)

Abstract:

Some words existing in micro-blog short text have a bad effect on the accuracy of text similarity calculation,further affecting the quality of topic discovery.And these words are the same in shape or semantic meaning,but remote from the topic.A novel method of feature words selection based on micro-blog short text content and structured information was proposed,which could effectively choose some important feature words from the text.Moreover,in computing the similarity between texts,an improvement on computing the similarity between the text and the topic was made.Finally,the methods were combined together and applied to discover micro-blog topics.Experimental results show that the new method of topic discovery can effectively reduce the average missing rate and false detection rate,and improve the quality of topic discovery.

Key words: micro-blog, feature word, selection, similarity, topic discovery

CLC Number: 

No Suggested Reading articles found!