[1] |
TAN C B , HIJAZI M H A , KHAMIS N ,et al. A survey on presentation attack detection for automatic speaker verification systems:state-of-the-art,taxonomy,issues and future direction[J]. Multimedia Tools and Applications, 2021,80(21-23): 32725-32762.
|
[2] |
徐嘉, 简志华, 金宏辉 ,等. 基于中心对称局部二值模式的合成伪装语音检测方法[J]. 电信科学, 2023,39(1): 72-78.
|
|
XU J , JIAN Z H , JIN H H ,et al. A method for synthetic spoofing speech detection based on center-symmetric local binary pattern[J]. Telecommunications Science, 2023,39(1): 72-78.
|
[3] |
MITTAL A , DUA M . Automatic speaker verification systems and spoof detection techniques:review and analysis[J]. International Journal of Speech Technology, 2021,25(1): 105-134.
|
[4] |
ALZANTOT M , WANG Z , SRIVASTAVA M B . Deep residual neural networks for audio spoofing detection[C]// Proceedings of 20th Annual Conference of the International Speech Communication Association 2019 (INTERSPEECH 2019). Graz,Austria:ISCA, 2019: 1078-1082.
|
[5] |
NAGAKRISHNAN R , REVATHI A . Generic speech based person authentication system with genuine and spoofed utterances:different feature sets and models[J]. Multimedia Tools and Applications, 2021,81(1): 1179-1208.
|
[6] |
TODISCO M , HéCTOR D , EVANS N . Constant Q cepstral coefficients:a spoofing countermeasure for automatic speaker verification[J]. Computer Speech & Language, 2017(45): 516-535.
|
[7] |
RAJAN P , PARTHASARATHI S , MURTHY H A . Robustness of phase based features for speaker recognition[C]// Proceedings of 10th Annual Conference of the International Speech Communication Association 2009 (INTERSPEECH 2009). Brighton:ISCA, 2009: 2299-2302.
|
[8] |
SARATXAGA I , SANCHEZ J , WU Z ,et al. Synthetic speech detection using phase information[J]. Speech Communication, 2016(81): 30-41.
|
[9] |
DRULLMAN R , FESTEN J M , PLOMP R . Effect of temporal envelope smearing on speech reception[J]. The Journal of the Acoustical Society of America, 1994,95(2): 1053-1064.
|
[10] |
LU X , UNOKI M , NAKAMURA S . Sub-band temporal modulation envelopes and their normalization for automatic speech recognition in reverberant environments[J]. Computer Speech &Language, 2011,25(3): 571-584.
|
[11] |
DING N , PATEL A D , CHEN L ,et al. Temporal modulations in speech and music[J]. Neuroscience & Biobehavioral Reviews, 2017(81): 181-187.
|
[12] |
NING Y , HE S , WU Z ,et al. A review of deep learning based speech synthesis[J]. Applied Sciences, 2019,9(19): 4050.
|
[13] |
林朗, 王让定, 严迪群 ,等. 基于逆梅尔对数频谱系数的回放语音检测算法[J]. 电信科学, 2018,34(5): 90-98.
|
|
LIN L , WANG R D , YAN D Q ,et al. A playback speech detection algorithm based on log inverse Mel-frequency spectral coefficient[J]. Telecommunications Science, 2018,34(5): 90-98.
|
[14] |
BROWN J C . Calculation of a constant Q spectral transform[J]. Journal of the Acoustical Society of America, 1998,89(1): 425-434.
|
[15] |
HAMSA S , SHAHIN I , IRAQI Y ,et al. Emotion recognition from speech using wavelet packet transform cochlear filter bank and random forest classifier[J]. IEEE Access, 2020(8): 96994-97006.
|
[16] |
CHEN L , SU W , FENG Y ,et al. Two-layer fuzzy multiple random forest for speech emotion recognition in human-robot interaction[J]. Information Sciences, 2020(509): 150-163.
|
[17] |
RAMOSAJ B , PAULY M . Consistent estimation of residual variance with random forest out-of-bag errors[J]. Statistics &Probability Letters, 2019(151): 49-57.
|
[18] |
WANG X , YAMAGISHI J , TODISCO M ,et al. ASVspoof 2019:a large-scale public database of synthesized,converted and replayed speech[J]. Computer Speech & Language, 2020(64): 101114.
|
[19] |
KINNUNEN T , DELGADO H , EVANS N ,et al. Tandem assessment of spoofing countermeasures and automatic speaker verification:fundamentals[J]. IEEE/ACM Transactions on Au-dio,Speech,and Language Processing, 2020(28): 2195-2210.
|
[20] |
WANG X , TAKAKI S , YAMAGISHI J . Neural source-filterbased waveform model for statistical parametric speech synthesis[C]// 2019 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP). Piscataway:IEEE Press, 2019: 5916-5920.
|