Telecommunications Science ›› 2018, Vol. 34 ›› Issue (5): 90-98.doi: 10.11959/j.issn.1000-0801.2018020

• research and development • Previous Articles     Next Articles

A playback speech detection algorithm based on log inverse Mel-frequency spectral coefficient

Lang LIN,Rangding WANG,Diqun YAN,Can LI   

  1. Ningbo University,Ningbo 315211,China
  • Revised:2017-12-07 Online:2018-05-01 Published:2018-05-30
  • Supported by:
    The National Natural Science Foundation of China(61672302);The National Natural Science Foundation of China(61300055);The Natural Science Foundation of Zhejiang Province of China(LZ15F020002);The Natural Science Foundation of Zhejiang Province of China(LY17F020010);The Scientific Research Foundation of Ningbo University(XKXL1405);The Scientific Research Foundation of Ningbo University(XKXL1420);The Scientific Research Foundation of Ningbo University(XKXL1509);The Scientific Research Foundation of Ningbo University(XKXL1503);K.C.Wong Magna Fund in Ningbo University

Abstract:

The popularity and portability of high-fidelity audio recording equipment and playback equipment poses a serious challenge for speaker recognition systems against playback attacks.Based on the differences between the original speech and the playback speech in high frequency region,the algorithm reversed the Mel-filter bank in Mel-frequency cepstral coefficient (MFCC) calculation,and the coefficients before the DCT were used as the features of the algorithm.SVM was utilized as the classifier.Experimental results show that this algorithm can effectively detect the playback speech.In addition,the algorithm is integrated into the GMM-UBM speaker recognition system,which significantly improves the systems’ capability of resisting the playback attack.

Key words: speaker recognition, playback speech detection, log Mel-frequency spectrum, inverse Mel-filter group

CLC Number: 

No Suggested Reading articles found!