Telecommunications Science ›› 2023, Vol. 39 ›› Issue (11): 107-115.doi: 10.11959/j.issn.1000-0801.2023187

• Research and Development • Previous Articles    

A method of synthetic speech spoofing detection using constant Q modulation envelope

Jia XU1, Zhihua JIAN1,2, Honghui JIN1, Chao WU1   

  1. 1 School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China
    2 Key Laboratory of Data Storage and Transmission Technology of Zhejiang Province, Hangzhou 310018, China
  • Revised:2023-09-30 Online:2023-11-01 Published:2023-11-01
  • Supported by:
    The National Natural Science Foundation of China(61201301);The National Natural Science Foundation of China(61772166);The National Natural Science Foundation of China(61901154)

Abstract:

In response to the low accuracy of synthetic speech spoofing detection based on traditional acoustic feature parameters, poor detection performance for unknown types of synthetic speech, and performance degradation in noisy environments, a method for detecting spoofing synthetic speech was proposed using constant Q modulation envelope (CQME) .The motivation of the method was from the fact that the temporal envelope of speech contained abundant information and there was a big difference in detail between the envelope of synthetic speech and genuine speech.The modulation envelope spectrum of speech was obtained by employing constant Q transform (CQT), and the root mean square of each frequency component was calculated to derive the CQME feature vector.And then the CQME feature vector was used to train the random forest classifier for discriminating genuine speech from spoofing synthetic speech.Experimental results demonstrate that the random forest trained with CQME features achieves high detection performance on the ASVspoof 2019 dataset and exhibites good detection efficacy for unknown types of synthetic speech.Furthermore, the proposed method shows high detection performance even under various noise conditions, having excellent noise robustness.

Key words: synthetic speech, spoofing speech detection, constant Q modulation envelope, random forest

CLC Number: 

No Suggested Reading articles found!