Journal on Communications ›› 2015, Vol. 36 ›› Issue (10): 172-180.doi: 10.11959/j.issn.1000-436x.2015215

• academic paper • Previous Articles     Next Articles

HashTrie:a space-efficient multiple string matching algorithm

Ping ZHANG1,2,3,Yan-bing LIU1,3,Jing YU1,3,Jian-long TAN1,3   

  1. 1 Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093,China
    2 University of Chinese Academy of Sciences,Beijing 100049,China
    3 National Engineering Laboratory for Information Security Technologies,Beijing 100093,China
  • Online:2015-10-25 Published:2015-10-27
  • Supported by:
    The National Natural Science Foundation of China;The National High Technology Research and Development of China(863 Program);The Strategic Priority Research Program of the Chinese Academy of Sci-ences

Abstract:

The famous multiple string matching algorithm AC consumed huge memory when the string signatures were massive,thus unable to process high speed network traffic efficiently.To solve this problem,a space-efficient multiple string matching algorithm-HashTrie was proposed.This algorithm adopted recursive hash function to store the patterns in bit-vectors in place of the state transition table in order to reduce space consumption.Further more it made use of the rank operation for fast verification.Theoretic analysis shows that the space complexity of HashTrie is O(|P|),which is linear with the size of pattern set |P|and is independent of the alphabetsize σ.The space complexity is superior to the complexity O(|P|σlog|P|)of AC.Experiments on synthetic datasets and real-world datasets(such as Snort,ClamAV and URL)show that HashTrie saves up to 99.6% storage cost compared with AC,and in the meanwhile it runs at a matching speed that is about half of AC.HashTrie is a space-efficient multiple string matching algorithm that is appropriate to search large scale pattern strings with short lengths.

Key words: intrusion detection, multiple string matching, bit-vector; recursive hash function, space-efficient

No Suggested Reading articles found!