通信学报 ›› 2014, Vol. 35 ›› Issue (2): 87-94.doi: 10.3969/j.issn.1000-436x.2014.02.012

• 学术论文 • 上一篇    下一篇

基于多任务稀疏表达的二元麦克风小阵列话音增强算法

杨立春1,2,叶敏超1,钱沄涛1   

  1. 1 浙江大学 计算机科学与技术学院,浙江 杭州 310027;
    2 浙江万里学院 智能控制技术研究所,浙江 宁波
  • 出版日期:2014-02-25 发布日期:2017-07-25
  • 基金资助:
    国家自然科学基金资助项目;国家重点基础研究发展计划(“973”计划)基金资助项目;国家科技支撑计划基金资助项目

Speech enhancement based on multi-task sparse representation for dual small microphone arrays

Li-chun YANG1,2,Min-chao YE1,Yun-tao QIAN1   

  1. 1 College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China;
    2 Intelligent Control Research Institute, Zhejiang Wanli University, Ningbo 315101, China
  • Online:2014-02-25 Published:2017-07-25
  • Supported by:
    The National Natural Science Foundation of China;The National Basic Research Program of China (973 Program);The National Key Technology R&D Program of China

摘要:

针对常规二元麦克风小阵列话音增强算法通常需要话音活动检测技术支持,并且难以有效抑制第一帧含目标信号的噪声。提出了一种基于多任务稀疏表达的二元麦克风小阵列话音增强算法,首先利用字典学习方法分别获得目标信号和噪声信号的过完备字典,然后利用e2/ 1e 混合范数对信号在其字典上的表示系数进行正则化稀疏约束,使得2个阵元接收到信号中的噪声信号被抑制,而话音信号尽量保持不变,从而达到话音增强的目标。仿真和实验数据表明,无论开始位置是否含有目标话音信号,所提出的非话音活动检测支持的二元麦克风小阵列话音增强算法均能有效实现话音增强的目标。

关键词: 麦克风小阵列, 话音增强, 字典学习, 多任务稀疏表达

Abstract:

Speech enhancement algorithms for dual small microphone arrays usually rely on the voice activity detec-tion(VAD), and they may fail in some cases when target speech signal is included in the first frame. A multi-task sparse representation based speech enhancement algorithm was proposed. First, dictionaries for signal and noise were respec-tively formed via dictionary learning. Then the noise in signals obtain from two microphones was reduced by e2/ 1e regu-larized sparse representation on the over-complete dictionary, while the target speech signals were mostly preserved, hence the speech signals were enhanced. Experimental results from synthetic and real-world data show that the proposed speech enhancement algorithm without VAD works well in all cases no matter speech signal is included in the first frame or not.

Key words: small microphone arrays, speech enhancement, dictionary learning, multi-task sparse representation

No Suggested Reading articles found!