Journal on Communications ›› 2015, Vol. 36 ›› Issue (Z1): 141-148.doi: 10.11959/j.issn.1000-436x.2015293

• Academic paper • Previous Articles     Next Articles

Efficient segment pattern based method for malicious URL detection

Hai-lun LIN1,Yan LI2,Wei-ping WANG1,Yin-liang YUE1,Zheng LIN1   

  1. 1 Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093,China
    2 National Computer Network Emergency Response and Coordination Center,Beijing 100029,China
  • Online:2015-11-25 Published:2015-12-29
  • Supported by:
    The National High Technology Research and Development Program of China (863 Program);The National Natural Science Foundation of China;The National Natural Science Foundation of China;The National Natural Science Foundation of China;The National Natural Science Foundation of China

Abstract:

An efficient segment based method for detecting malicious URL was proposed.Firstly it analyzed the annotated malicious URLs in terms of three semantic segments,i.e.,domain segment,path segment and file segment.Secondly it quickly calculated the common pattern of each semantic segment exploiting the tri-gram model based inverted index.Finally it decided whether a given URL was malicious based on the segment patterns returned by searching the inverted index.Moreover,this method also supported the Jaccard based random domain name identification technique for deciding malicious URLs with random domain name.Experimental results show that proposed method outperforms the state-of-the-art baseline methods,and can achieve good efficiency and scalability on malicious URL detection.

Key words: malicious URL, segment pattern, tri-gram, inverted index, random name

No Suggested Reading articles found!