Telecommunications Science ›› 2017, Vol. 33 ›› Issue (1): 77-84.doi: 10.11959/j.issn.1000-0801.2017001

• research and development • Previous Articles     Next Articles

Blog screening and mining based on temporal features and hybrid search in big data

Lina ZHANG,Tai KUANG,Diqing JIANG   

  1. Department of Information Engineering,Zhejiang College of Security Technology,Wenzhou 325000,China
  • Revised:2016-09-14 Online:2017-01-01 Published:2017-06-04
  • Supported by:
    Educational Technology Research Prgram of Zhejiang Province in 2016(JB139)

Abstract:

Concerning that the correlation degree of the existing methods of blog screen and mining is loose and the information retrieval of the methods is deficient,a method based on temporal feature and hybrid search method was proposed.Considering the user reviews are important sources of evidence combination,the average number of reviews for blogs,the sources of BM25 relevance scores,the longest blog BM25 scores and time range between the latest related blog paper and the oldest related blog paper are being as the temporal feature sets.In addition,considering local search advantage of linear search(LS) and global search advantage of differential evolution(DE),the two kinds of information search methods were combined.BlogS06 data set was used in the experiment which was consists of blog home pages,XML source files and its blog portal pages,it was used for TREC 2007 and TREC 2008 blog mining experiments.Experimental results show that the proposed method can obtain satisfactory results in terms of running time and effectiveness.

Key words: blog screening and mining, temporal feature, linear search, differential evolution, big data, BM25

CLC Number: 

No Suggested Reading articles found!