Telecommunications Science ›› 2017, Vol. 33 ›› Issue (9): 58-68.doi: 10.11959/j.issn.1000-0801.2017193

• research and development • Previous Articles     Next Articles

SLSB-forest:approximate k nearest neighbors searching on high dimensional data

Tu QIAN,Jiangbo QIAN,Yihong DONG,Huahui CHEN   

  1. College of Information Science and Engineering,Ningbo University,Ningbo 315211,China
  • Revised:2017-06-07 Online:2017-09-01 Published:2017-09-11
  • Supported by:
    The National Natural Science Foundation of China(61472194);The National Natural Science Foundation of China(61572266);Zhejiang Provincial Natural Science Foundation of China(LY16F020003)

Abstract:

The study of approximate k nearest neighbors query has attracted broad attention.Local sensitive hash is one of the mainstream ways to solve this problem.Local sensitive hash and its varients have noted the following problems:the uneven distribution of hashed data in the buckets,it cannot calculate the appropriate query range (for the parameter k) to build index.To tackle the above problem,a new data struct which called SLSB-forest was presented.The SLSB-forest combined the LSH and the B-tree to maintain the amount of bucket’s data in reasonable range.Two query algorithms were proposed:fast and accurate priority search.Theory and experimental results prove that query range can dynamic change during searching approximate k nearest neighbors.

Key words: approximate k nearest neighbor, locality sensitive hash, high dimensionality

CLC Number: 

No Suggested Reading articles found!