通信学报 ›› 2014, Vol. 35 ›› Issue (1): 183-190.doi: 10.3969/j.issn.1000-436x.2014.01.021

• 学术通信 • 上一篇    下一篇

复杂环境下基于时延估计的声源定位技术研究

张大威,鲍长春(),夏丙寅   

  1. 北京工业大学 电子信息与控制工程学院 话音与音频信号处理研究室,北京 100124
  • 出版日期:2014-01-25 发布日期:2017-06-17
  • 基金资助:
    北京市教育委员会科技发展计划重点基金资助项目;国家自然科学基金资助项目

Source localization based on time delay estimation in complex environment

Da-wei ZHANG,Chang-chun BAO(),Bing-yin XIA   

  1. Speech and Audio Signal Processing Lab,School of Electronic Information and Control Engineering,Beijing University of Technology,Beijing 100124,China
  • Online:2014-01-25 Published:2017-06-17
  • Supported by:
    The National Natural Science Foundation of China

摘要:

为了改善在复杂环境下声源定位算法的性能,提出了一种新的时延估计(TDE)方法,即基于传递函数比的统计模型方法(ATFR-SM)。该方法采用统计模型去除噪声对传递函数(ATF)的影响,在计算传递函数时对功率谱密度(PSD)进行平滑和“白化”,以去除混响对传递函数的影响。同时,算法中引入话音激活检测(VAD)去除对求取传递函数无用的噪声段,以提高时延估计的准确性。此外,将所提时延估计方法与线性定位法相结合,构成一套完整的声源定位方法。实验结果表明,在复杂环境下,时延估计方法具有更低的异常点百分比(PAP)和均方根误差(RMSE),且明显优于传统的参考算法,同时声源定位方法具有更高的定位精度。

关键词: 时延估计, 传递函数比, VAD, 统计模型, 声源定位

Abstract:

In order to improve the performance of source localization in noisy and reverberant environments,a novel time delay estimation (TDE) method was proposed.This method is called acoustical transfer function ratio based on statistical model (ATFR-SM).In the proposed algorithm,the noise reduction method based on the statistical model was adopted to reduce the effect of noise on acoustical transfer Function (ATF).In the ATF method,the power spectral density (PSD) was smoothed and whitened to reduce the effect of reverberations.voice activity detection (VAD) was used to distinguish the speech period from the noise period,and the TDE was performed in the speech period to improve the estimation accuracy.Moreover,the proposed TDE method and the linear closed-form method for source localization were combined to constitute a source localization system.The results of performance evaluation show that,in both the noisy and reverberant conditions,the lower percentage of abnormal points (PAP) and lower root mean square error (RMSE) can be achieved by the proposed TDE method than those of the reference methods.Meanwhile,the source localization has higher accuracy than the reference methods.

Key words: TDE, ATF ratio, VAD, statistical model, source localization

No Suggested Reading articles found!