电信科学 ›› 2017, Vol. 33 ›› Issue (8): 145-154.doi: 10.11959/j.issn.1000-0801.2017126

• 研究与开发 • 上一篇    下一篇

基于相位谱的翻录语音攻击检测算法

李璨,王让定(),严迪群,陈亚楠   

  1. 宁波大学信息科学与工程学院,浙江 宁波 315211
  • 修回日期:2017-03-20 出版日期:2017-08-01 发布日期:2017-08-25
  • 作者简介:李璨(1992-),女,宁波大学信息科学与工程学院硕士生,主要研究方向为多媒体通信与信息安全等。|王让定(1962-),男,博士,宁波大学高等技术研究院教授、博士生导师,主要研究方向为多媒体通信与取证、信息隐藏与隐写分析、智能抄表及传感网络技术等。|严迪群(1979-),男,博士,宁波大学信息科学与工程学院副教授、硕士生导师,主要研究方向为多媒体通信、信息安全、基于深度学习的数字语音取证等。|陈亚楠(1990-),女,宁波大学信息科学与工程学院硕士生,主要研究方向为多媒体通信与信息安全等。
  • 基金资助:
    国家自然科学基金资助项目(61672302);国家自然科学基金资助项目(61300055);浙江省自然科学基金资助项目(LZ15F020010);浙江省自然科学基金资助项目(Y17F020051);宁波大学科研基金资助项目(XKXL1405);宁波大学科研基金资助项目(XKXL1420);宁波大学科研基金资助项目(XKXL1509);宁波大学科研基金资助项目(XKXL1503);宁波大学王宽诚幸福基金资助项目

Recapture voice replay detection based on phase spectrum

Can LI,Rangding WANG(),Diqun YAN,Yanan CHEN   

  1. College of Information Science and Engineering,Ningbo University,Ningbo 315211,China
  • Revised:2017-03-20 Online:2017-08-01 Published:2017-08-25
  • Supported by:
    The National Natural Science Foundation of China(61672302);The National Natural Science Foundation of China(61300055);Natural Science Foundation of Zhejiang Province of China(LZ15F020010);Natural Science Foundation of Zhejiang Province of China(Y17F020051);The Scientific Research Foundation of Ningbo University(XKXL1405);The Scientific Research Foundation of Ningbo University(XKXL1420);The Scientific Research Foundation of Ningbo University(XKXL1509);The Scientific Research Foundation of Ningbo University(XKXL1503);K.C.Wong Magna Fund in Ningbo University

摘要:

因与原始语音具有高度相似性,经高保真设备回放的翻录语音常被不法分子用于对说话人认证(ASV)系统进行攻击,以达到非法认证的目的。为提高系统抵抗翻录语音攻击的顽健性,通过研究原始语音与翻录语音产生的实际过程,发现两者在频率域相位上有明显差异,并在此基础上提出了一种基于相位谱的翻录语音检测方法。分析讨论了FFT和不同偷录、回放设备对翻录语音检测率的影响。实验结果表明,该方法能够准确地判断待测语音是否为翻录语音,其检测率达到了99.04%。并且,将该算法加载到说话人识别系统中,使系统的等错误概率(EER)降低了约22%,有效提高了系统抵抗翻录语音攻击的性能。

关键词: 说话人认证系统, 翻录语音检测, 相位谱

Abstract:

Due to a high similarity between the recaptured voice recorded by high-fidelity ripping equipment and the original voice,the automatic speaker verification(ASV)system used to be attacked illegally by the recaptured voice.In order to improve the ability of resisting the attack,a recaptured voice detection method was proposed based on the difference of phase spectrum between original and recaptured voices for the ASV system.In addition,the effects of different recording and replay devices,the FFT were discussed.Experimental results show that the proposed method can accurately recognize the recording voice,of which detection rate is 99.04%。Meanwhile,the equal error rate (EER) of the ASV system has dropped about 22% with this method being integrated,which indicates that the system’s ability of resisting playback attack is enhanced.

Key words: ASV system, recaptured voice detection, phase spectrum

中图分类号: 

No Suggested Reading articles found!