通信学报 ›› 2023, Vol. 44 ›› Issue (10): 58-71.doi: 10.11959/j.issn.1000-436x.2023197

• 学术论文 • 上一篇    

基于球域失真空-时依赖的全景视频编码

杨栩1,2, 朱策3, 郭红伟3,4, 罗雷1,3   

  1. 1 重庆邮电大学通信与信息工程学院,重庆 400065
    2 成都师范学院物理与工程技术学院,四川 成都 611130
    3 电子科技大学信息与通信工程学院,四川 成都 611731
    4 红河学院工学院,云南 蒙自 661100
  • 修回日期:2023-09-28 出版日期:2023-10-01 发布日期:2023-10-01
  • 作者简介:杨栩(1983− ),男,四川苍溪人,重庆邮电大学博士生,成都师范学院讲师,主要研究方向为全景视频编码与通信
    朱策(1969− ),男,四川自贡人,博士,电子科技大学教授、博士生导师,主要研究方向为视频编码与通信、视频分析和处理
    郭红伟(1980− ),男,彝族,云南金平人,博士,红河学院教授,电子科技大学在站博士后,主要研究方向为视频编码与通信
    罗雷(1986- ),男,重庆人,博士,重庆邮电大学副教授,电子科技大学在站博士后,主要研究方向为视频编码与通信
  • 基金资助:
    国家自然科学基金资助项目(62020106011);国家自然科学基金资助项目(U19A2052);国家自然科学基金资助项目(62061015);四川省科技厅基金资助项目(2022ZHCG0116);重庆市自然科学基金资助项目(2023NSCQ-MSX2930);重庆邮电大学青创基金资助项目(SCIE-QN-2022-05)

Panoramic video coding based on spherical distortion with spatio-temporal dependency

Xu YANG1,2, Ce ZHU3, Hongwei GUO3,4, Lei LUO1,3   

  1. 1 School of Information and Communication Engineering, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
    2 School of Physics and Engineering Technology, Chengdu Normal University, Chengdu 611130, China
    3 School of Communication and Information Engineering, University of Electronic Science and Technology, Chengdu 611731, China
    4 School of Engineering, Honghe University, Mengzi 661100, China
  • Revised:2023-09-28 Online:2023-10-01 Published:2023-10-01
  • Supported by:
    The National Natural Science Foundation of China(62020106011);The National Natural Science Foundation of China(U19A2052);The National Natural Science Foundation of China(62061015);The Project of the Science and Technology Department in Sichuan Province(2022ZHCG0116);The Chongqing Natural Science Foundation(2023NSCQ-MSX2930);Youth Innovation Group Support Program of ICE Discipline of CQUPT(SCIE-QN-2022-05)

摘要:

全景视频平面编码失真和球域感知失真不同域使主客观质量评价不一致,进而损失编码性能。此外,独立率失真优化技术没有考虑球域失真的时域依赖性对编码的影响,编码性能还有提升空间。针对上述问题,提出一种空-时域依赖的球域失真模型以优化全景视频编码。首先,提出一种球域失真到编码失真的空域映射模型,使主客观质量评价趋近一致;然后,提出一种球域失真时域传播模型,以提升传播链上所有编码单元的整体编码性能;最后,计算球域失真空域映射权重和时域传播权重来调整编码参数。实验结果表明,在低延时编码配置下,相较于通用视频编码基准VTM14.0,所提算法有平均7.4%(最高达22.1%)的码率节省和9%的编码时间节省。

关键词: 全景视频编码, 时域依赖率失真优化, 投影, 球域失真, 编码失真

Abstract:

The planar coding distortion, which affects objective quality, and spherical distortion, which affects subjective quality, as well as existing independent rate-distortion optimization techniques that fail to consider the temporal propagation of spherical distortion and its impact on coding performance, result in coding performance degradation.To address these issues, a spatio-temporal dependent spherical distortion model was proposed for optimizing panoramic video coding.Firstly, a spatial mapping model was proposed to map spherical distortions to coding distortions, aiming to align subjective and objective quality assessments.Secondly, a temporal propagation model for spherical distortions was introduced to enhance the overall coding performance of all coding units in the propagation chain.Finally, the coding parameters were adjusted by computing the weights for spatial mapping and temporal propagation of spherical distortions.Experimental results demonstrate that, under low-delay encoding configurations, compared to the VTM14.0, a state-of-the-art video coding benchmark, the proposed algorithm achieves an average bitrate savings of 7.4% (up to 22.1%) and reduction in coding time of 9%.

Key words: panoramic video coding, temporal dependent rate-distortion optimization, projection, spherical distortion, coding distortion

中图分类号: 

No Suggested Reading articles found!