Journal on Communications ›› 2023, Vol. 44 ›› Issue (3): 105-116.doi: 10.11959/j.issn.1000-436x.2023043

• Papers • Previous Articles     Next Articles

Objective assessment of communication speech interference effect based on feature fusion

Yun LIN1, Huaitao XU1, Sen WANG1, Sicheng ZHANG1, Long ZHUANG2   

  1. 1 College of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China
    2 School of Integrated Circuits, Anhui University, Hefei 230039, China
  • Revised:2022-11-30 Online:2023-03-25 Published:2023-03-01
  • Supported by:
    The National Natural Science Foundation of China(62201172);The Fundamental Research Funds for the Central Universities(3072022CF0804);The Fundamental Research Funds for the Central Universities(3072022CF0601)

Abstract:

In view of the objective assessment problem of the effect of communication speech interference, methods based on multi-measurements and multimodal fusion were proposed.First, the interfered speech was preprocessed by the endpoint detection algorithm and time warping algorithm.Then, the content of speech was extracted and performed measurement calculated with the standard speech to obtain five kinds of measure.After the fusion of five measures, random forest model was used to assessed the quality level.Finally, a neural network model based on residual structure was designed combined multimodal fusion technique, which fused the graph domain and measure domain features of the interfered speech data and performed quality level assessment.Experimental results show that the accuracy of two methods have reached more than 90%.Among them, the multimodal assessment method improves the accuracy by about 3.269% compared with the existing research methods, which proves that it has a better performance.

Key words: speech quality assessment, speech signal processing, multimodal fusion, deep neural network

CLC Number: 

No Suggested Reading articles found!