Journal on Communications ›› 2020, Vol. 41 ›› Issue (8): 155-164.doi: 10.11959/j.issn.1000-436x.2020152

• Papers • Previous Articles     Next Articles

CMDC:an iterative algorithm for complementary multi-view document clustering

Ruizhang HUANG1,2,Ruina BAI1,Yanping CHEN1,2,Yongbin QIN1,2,Xinyu CHENG1,3,Youliang TIAN1,2   

  1. 1 College of Computer Science and Technology,Guizhou University,Guiyang 550025,China
    2 Guizhou Provincial Key Laboratory of Public Big Data,Guiyang 550025,China
    3 Guizhou Intelligent Human-Computer Interaction Engineering Technology Research Center,Guiyang 550025,China
  • Revised:2020-06-10 Online:2020-08-25 Published:2020-09-05
  • Supported by:
    The National Natural Science Foundation of China(61462011);The National Natural Science Foundation of China(91746116);The Joint Funds of the National Natural Science Foundation of China(U1836205);The Key Projects of Science and Technology of Guizhou([2020]1Z055)

Abstract:

In response to the problems traditional multi-view document clustering methods separate the multi-view document representation from the clustering process and ignore the complementary characteristics of multi-view document clustering,an iterative algorithm for complementary multi-view document clustering——CMDC was proposed,in which the multi-view document clustering process and the multi-view feature adjustment were conducted in a mutually unified manner.In CMDC algorithm,complementary text documents were selected from the clustering results to aid adjusting the contribution of view features via learning a local measurement metric of each document view.The complementary text document of the results among the dimensionality clusters was selected by CMDC,and used to promote the feature tuning of the clusters.The partition consistency of the multi-dimensional document clustering was solved by the measure consistency of the dimensions.Experimental results show that CMDC effectively improves multi-dimensional clustering performance.

Key words: multi-view document clustering, complementary text, constrained document clustering, metric calculation

CLC Number: 

No Suggested Reading articles found!