Journal on Communications ›› 2021, Vol. 42 ›› Issue (7): 231-237.doi: 10.11959/j.issn.1000-436x.2021133

• Correspondences • Previous Articles    

Speaker verification method based on deep information divergence maximization

Chen CHEN1,2, Yafeng RONG1, Chaoqun JI1, Deyun CHEN1,2, Yongjun HE1   

  1. 1 School of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China
    2 Postdoctoral Research Station of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China
  • Revised:2021-06-15 Online:2021-07-25 Published:2021-07-01
  • Supported by:
    The National Natural Science Foundation of China(61673142);The Natural Science Foundation of Heilongjiang Province(JJ2019JQ0013);Heilongjiang Postdoctoral Fund(LBH-Z20020);The Fundamental Research Founds for the Central Universities of Heilongjiang Province(2020-KYYWF-0341)

Abstract:

To solve the problem that the nonlinear relationship between speaker representations cannot be accurately captured in speaker verification, an objective function based on depth information divergence maximization was proposed.It could implicitly represent the nonlinear relationship between speaker representations by calculating the similarity between their distributions.Under the supervision of the optimization goal of maximizing the statistical correlation, the deep neural network was optimized towards the direction that the within-class data was more compact and the between-class data were far away from each other, and finally the discrimination of deep speaker representation space could be effectively improved.Experimental results show that compared with other deep learning methods, the relative EER of the proposed method is reduced by 15.80% at most, which significantly improves the system performance.

Key words: speaker verification, objective function, deep information divergence, representation learning

CLC Number: 

No Suggested Reading articles found!