Telecommunications Science ›› 2024, Vol. 40 ›› Issue (1): 35-47.doi: 10.11959/j.issn.1000-0801.2024015

• Research and Development • Previous Articles    

Deep learning-based prediction of multi-level just noticeable distortion

Haifeng XU1, Hongkui WANG1, Haibing YIN1, Chuqiao CHEN2   

  1. 1 College of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China
    2 College of Cyberspace Security, Hangzhou Dianzi University, Hangzhou 310018, China
  • Revised:2024-01-12 Online:2024-01-01 Published:2024-01-01
  • Supported by:
    The National Natural Science Foundation of China(62202134);The National Natural Science Foundation of China(62031009);The National Natural Science Foundation of China(61972123);Ministry of Science and Technology Key Research and Development Project Funding Program(2023YFB4502800);Zhejiang Provincial “Pioneer” and“Leading Goose” Research and Development Project(2023C01149);Zhejiang Provincial “Pioneer” and“Leading Goose” Research and Development Project(2022C01068);The Natural Science Foundation of Zhejiang Province(LDT23F01014F01);The Natural Science Foundation of Zhejiang Province(LDT23F01011F01)

Abstract:

Visual just noticeable distortion (JND) directly reflects the sensitivity of the human visual system to visual signal noise, and is widely used in image and video processing.Aiming at the multilevel prediction problem of video JND threshold, it was transformed into the prediction problem of satisfied user ratio (SUR) curve, and a feature fusion-based SUR curve prediction model was proposed.The model was mainly divided into key frame extraction module, feature extraction and fusion module, and SUR score regression module.In the key frame extraction module, according to the visual perception mechanism, the spatial-temporal domain perception complexity was proposed and used as the video key frame judgment index.In the feature extraction and fusion module, a multi-scale dense residual network was proposed based on dense residual block (RDB) to realize image feature extraction and multi-scale fusion.The experimental results show that the proposed SUR curve prediction model is overall better than the existing models in terms of JND prediction accuracy and reduces the time cost by 8.1% on average in terms of operational efficiency.Meanwhile, the model can also be used to predict other layers of JND thresholds, which can be directly applied to video multilevel perceptual coding optimization.

Key words: just noticeable distortion, deep learning, quality evaluation

CLC Number: 

No Suggested Reading articles found!