电信科学 ›› 2022, Vol. 38 ›› Issue (6): 91-99.doi: 10.11959/j.issn.1000-0801.2022089

• 研究与开发 • 上一篇    下一篇

基于联合特征与随机森林的伪装语音检测

于佳祺1, 简志华1, 徐嘉1, 游林2, 汪云路2, 吴超1   

  1. 1 杭州电子科技大学通信工程学院,浙江 杭州 310018
    2 杭州电子科技大学网络空间安全学院,浙江 杭州 310018
  • 修回日期:2022-05-15 出版日期:2022-06-20 发布日期:2022-06-01
  • 作者简介:于佳祺(1997- ),男,杭州电子科技大学通信工程学院硕士生,主要研究方向为语音伪装检测、特征提取与分析
    简志华(1978- ),男,博士,杭州电子科技大学通信工程学院副教授、硕士生导师,主要研究方向为语音转换、伪装语音检测、声纹识别等
    徐嘉(1998- ),女,杭州电子科技大学通信工程学院硕士生,主要研究方向为语音伪装及检测
    游林(1966- ),男,博士,杭州电子科技大学网络空间安全学院教授、硕士生导师,主要研究方向为生物信息处理、信息安全、密码学等
    汪云路(1980- ),女,博士,杭州电子科技大学网络空间安全学院讲师,主要研究方向为音频信息处理、信息隐藏
    吴超(1988- ),男,博士,杭州电子科技大学通信工程学院讲师,主要研究方向为导航信号处理及欺骗干扰检测
  • 基金资助:
    国家自然科学基金资助项目(61201301);国家自然科学基金资助项目(61772166);国家自然科学基金资助项目(61901154)

Spoofing speech detection algorithm based on joint feature and random forest

Jiaqi YU1, Zhihua JIAN1, Jia XU1, Lin YOU2, Yunlu WANG2, Chao WU1   

  1. 1 School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China
    2 School of Cyberspace Security, Hangzhou Dianzi University, Hangzhou 310018, China
  • Revised:2022-05-15 Online:2022-06-20 Published:2022-06-01
  • Supported by:
    The National Natural Science Foundation of China(61201301);The National Natural Science Foundation of China(61772166);The National Natural Science Foundation of China(61901154)

摘要:

为了能较为全面地描述语音信号的特征信息,提高伪装检测率,提出了一种基于均匀局部二值模式纹理特征与常数Q倒谱系数声学特征相结合,并以随机森林为分类模型的伪装语音检测方法。利用均匀局部二值模式提取语音信号语谱图中的纹理特征矢量,并与常数Q倒谱系数构成联合特征,再用所获得的联合特征矢量训练随机森林分类器,从而实现了伪装语音检测。实验中,分别对其他特征参数以及支持向量机分类器模型所构建的几种伪装检测系统进行了性能对照,结果表明,所提联合特征与随机森林模型相结合的语音伪装检测系统具有最优的检测性能。

关键词: 伪装语音检测, 声学特征, 纹理特征, 均匀局部二值模式, 随机森林

Abstract:

In order to describe the characteristic information of the speech signal more comprehensively and improve the detection rate of camouflage, a spoofing speech detection method based on the combination of uniform local binary pattern texture feature and constant Q cepstrum coefficient acoustic feature was proposed, which used random forest as the classifier model.The texture feature vector in the speech signal spectrogram was extracted by using the uniform local binary mode, and the joint feature was formed with the constant Q cepstrum coefficient.Then, the obtained joint feature vector was used to train the random forest classifier, so as to realize the camouflage speech detection.In the experiment, the performances of several spoofing detection systems constructed by other feature parameters and the support vector machine classifier model were compared, and the results show that the proposed speech spoofing detection system combined with the joint feature and the random forest model has the best performance.

Key words: spoofing speech detection, acoustic feature, texture feature, uniform local binary pattern, random forest

中图分类号: 

No Suggested Reading articles found!