Telecommunications Science ›› 2017, Vol. 33 ›› Issue (3): 59-66.doi: 10.11959/j.issn.1000-0801.2017046

• research and development • Previous Articles     Next Articles

Continuous speech speaker recognition based on CNN

Zhendong WU,Shucheng PAN,Jianwu ZHANG   

  1. Hangzhou Dianzi University, Hangzhou 310018, China
  • Revised:2017-02-13 Online:2017-03-01 Published:2017-04-05
  • Supported by:
    Zhejiang Natural Science Foundation of China(LY16F020016);National Key Research and Development Program of China(2016YFB0800201);Zhejiang Province Science and Technology Innovation Program(2013TD03)

Abstract:

In the last few years, with the constant improvement of the social life level, the requirement for speech recognition is getting higher and higher. GMM-HMM (Gaussian mixture-hidden Markov model) have been the main method for speaker recognition. Because of the bad modeling capability of big data and the bad performance of robustness, the development of this model meets the bottleneck.In order to solve this question, researchers began to focus on deep learning technologies. CNN deep learning model for continuous speech speaker recognition was introduced and CSR-CNN model was put forward. The model extracts fixed-length and right-order phonetic fraction to form an ordered sound spectrograph. Then input the voiceprint extract from CNN model to a reward-penalty function to continuous measurement. Experimental results show that CSR-CNN model has very good recognition effectin continuous speech speaker recognition field.

Key words: continuous speech, sound spectrograph, GMM-HMM, deep learning

CLC Number: 

No Suggested Reading articles found!