Journal on Communications

Previous Articles     Next Articles

Efficient top-k string similarity query algorithms

  

  • Online:2014-12-25 Published:2014-12-15

Abstract: Computing top-k similar strings based on edit distance, i.e., given a query string σ and string set S, finding k similar strings to σ based on edit distance from S. Firstly, two adaptive filter strategies based on length-skip index are proposed, such that to reduce the times of edit distance computation between two strings. Then the lower bound of edit distance between query string and unmatched string set is proposed, such that to further reduce the times of edit distance computation when processing strings that do not have common signatures with the query string. Finally efficient algorithms to return top-k similar strings are proposed. Experimental results on three real datasets verify the benefits over the state-of-the-art algorithm.

No Suggested Reading articles found!