Journal on Communications ›› 2013, Vol. 34 ›› Issue (Z2): 157-162.doi: 10.3969/j.issn.1000-436x.2013.Z2.030

• Digital Campus Application • Previous Articles     Next Articles

Similar text positioning method based on slope-density cluster

Du ZOU1,Wen-jun TANG1,Wei-jiang LONG2,Ling ZHANG3   

  1. 1 Information Network Engineering and Research Center,South China University of Technology,Guangzhou 510640,China
    2 School of Science,South China University of Technology,Guangzhou 510640,China
    3 School of Computer Science &Engineering,South China University of Technology,Guangzhou 510640,China
  • Online:2013-12-25 Published:2017-06-16
  • Supported by:
    The National Natural Science Foundation of China

Abstract:

Similar text positioning is an important part of plagiarism detection.The existing positioning method directly merges text or fingerprint to obtain similar text.Due to the disturb information in the similar text,the positioning accuracy is poor.The semantic features of the match fingerprints were analyzed,and a cluster method based on slope density for similar text positioning was proposed,which converts the text merge problem into dense sample points clustering problem,and improves the efficiency and accuracy of the positioning.Through the experiment on the PAN public corpus,the result shows it performs better than the PAN10 top three.This method has been used in the South China University of Technology 's feature professional teaching platform to detect the plagiarism of homework.

Key words: plagiarism detection, similar text positioning, cluster, fingerprint

No Suggested Reading articles found!