高效的top-k相似字符串查询算法

Efficient top-k string similarity query algorithms

Online:2014-12-25 Published:2014-12-15

Abstract

Abstract: Computing top-k similar strings based on edit distance, i.e., given a query string σ and string set S, finding k similar strings to σ based on edit distance from S. Firstly, two adaptive filter strategies based on length-skip index are proposed, such that to reduce the times of edit distance computation between two strings. Then the lower bound of edit distance between query string and unmatched string set is proposed, such that to further reduce the times of edit distance computation when processing strings that do not have common signatures with the query string. Finally efficient algorithms to return top-k similar strings are proposed. Experimental results on three real datasets verify the benefits over the state-of-the-art algorithm.

Efficient top-k string similarity query algorithms

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 0

Metrics

Recommended 0