Telecommunications Science ›› 2017, Vol. 33 ›› Issue (6): 73-85.doi: 10.11959/j.issn.1000-0801.2017100

• research and development • Previous Articles     Next Articles

Nearest neighbor search algorithm for high dimensional data based on weighted self-taught hashing

Cong PENG,Jiangbo QIAN(),Huahui CHEN,Yihong DONG   

  1. College of Information Science and Engineering,Ningbo University,Ningbo 315211,China
  • Revised:2017-03-31 Online:2017-06-01 Published:2017-06-27
  • Supported by:
    The National Natural Science Foundation of China(61472194);The National Natural Science Foundation of China(61572266);Zhejiang Provincial Natural Science Foundation of China(LY16F020003)

Abstract:

Because of efficiency in query and storage,learning hash is applied in solving the nearest neighbor search problem.The learning hash usually converts high-dimensional data into binary codes.In this way,the similarities between binary codes from two objects are conserved as they were in the original high-dimensional space.In practical applications,a lot of data which have the same distance from the query point but with different code will be returned.How to reorder these candidates is a problem.An algorithm named weighted self-taught hashing was proposed.Experimental results show that the proposed algorithm can reorder the different binary codes with the same Hamming distances efficiently.Compared to the naive algorithm,the F1-score of the proposed algorithm is improved by about 2 times and it is better than the homologous algorithms,furthermore,the time cost is reduced by an order of magnitude.

Key words: nearest neighbor search, learning hash, weighted self-taught, high-dimensional data

CLC Number: 

No Suggested Reading articles found!