电信科学 ›› 2023, Vol. 39 ›› Issue (7): 99-108.doi: 10.11959/j.issn.1000-0801.2023132

• 研究与开发 • 上一篇    下一篇

一种基于深度可分离卷积的VVC帧内编码快速块划分算法

叶振1, 王国相1, 宋俊锋1, 刘昊坤2, 黎天送2   

  1. 1 丽水学院,浙江 丽水 323000
    2 重庆师范大学,重庆 401331
  • 修回日期:2023-06-13 出版日期:2023-07-20 发布日期:2023-07-01
  • 作者简介:叶振(1984- ),男,博士,丽水学院计算机系讲师,主要研究方向为人工智能、视频图像处理
    王国相(1989- ),男,丽水学院助教,主要研究方向为信号处理、人工智能、视频图像处理
    宋俊锋(1984- ),男,丽水学院高级实验师,主要研究方向为虚拟现实、人工智能
    刘昊坤(1999- ),男,重庆师范大学计算机与信息科学学院硕士生,主要研究方向为H.266/VVC视频编码、深度学习
    黎天送(1987- ),男,博士,重庆师范大学计算机与信息科学学院讲师,主要研究方向为图像/视频编码、多视点视频编码、多媒体信号处理、人工智能
  • 基金资助:
    重庆市科技局自然基金项目(CSTB2022NSCQ-MSX1231);重庆市教委青年项目(KJQN202200519);重庆师范大学人才基金项目(21XLB031)

A fast block partitioning algorithm for VVC intra coding based on depthwise separable convolution

Zhen YE1, Guoxiang WANG1, Junfeng SONG1, Haokun LIU2, Tiansong LI2   

  1. 1 Lishui University, Lishui 323000, China
    2 Chongqing Normal University, Chongqing 401331, China
  • Revised:2023-06-13 Online:2023-07-20 Published:2023-07-01
  • Supported by:
    The Natural Science Foundation Project of Chongqing Science and Technology Bureau(CSTB2022NSCQ-MSX1231);The Science and Technology Research Program of Chongqing Municipal Education Commission(KJQN202200519);The Talents Fund Project of Chongqing Normal University(21XLB031)

摘要:

最近,联合视频探索工作组(JVET)将通用视频编码(VVC)作为新一代视频编码标准,它利用复杂的四叉树加多类型树(QTMTT)划分结构有效地提升了编码性能,但也导致编码复杂度急剧攀升,大幅地增加了编码时间。为解决上述问题,提出了一种基于深度可分离卷积的VVC帧内编码快速块划分算法,将编码单元(CU)的原始像素值作为输入,利用轻量化的深度可分离卷积神经网络提取CU纹理信息特征指导CU的划分模式选择,实现精准的划分模式预测。该方案通过跳过低概率的划分模式,减少CU划分模式的遍历,大幅地降低编码器的复杂度。实验结果表明,所提算法在VTM 15.2平台上实现了18%~48%的编码时间节省,仅仅带来了平均0.15%的性能损失,并且轻量化的深度可分离卷积计算带来的额外复杂性也可以忽略不计。

关键词: 视频编码, 深度学习, 帧内编码, 编码单元划分

Abstract:

The joint video exploration team (JVET) proposed versatile video coding (VVC) as a new video coding standard, and its quadtree plus multi-type tree (QTMTT) partition structure brings effective coding performance improvements.However, it brings about a sharp increase in encoding complexity, which greatly increases the encoding time.In order to solve the above problems, a fast block partitioning algorithm for VVC intra coding based on depthwise separable convolution was proposed.The pixel of coding unit (CU) was used as input, and the texture information feature of CU was extracted through depth-separable convolution.Therefore, accurate partition mode prediction was realized in the QTMT structure in VVC, and the complexity of the encoder was reduced by skipping low-probability partition modes.Experimental results show that the proposed algorithm saves 18% to 48% of encoding time on the VTM 15.2, and only brings an average performance loss of 0.15%.And the additional complexity brought by the lightweight depthwise separable convolution calculation is also negligible.

Key words: video coding, deep learning, intra coding, coding unit division

中图分类号: 

No Suggested Reading articles found!