通信学报 ›› 2016, Vol. 37 ›› Issue (11): 181-188.doi: 10.11959/j.issn.1000-436x.2016235

• 学术通信 • 上一篇    下一篇

基于多类支持向量机的3D-HEVC深度视频帧内编码快速算法

刘晟,彭宗举,陈嘉丽,陈芬,郁梅,蒋刚毅   

  1. 宁波大学信息科学与工程学院,浙江 宁波315211
  • 出版日期:2016-11-25 发布日期:2016-11-30
  • 基金资助:
    国家高技术研究发展计划(“863”计划)基金资助项目;国家自然科学基金资助项目;国家自然科学基金资助项目;国家自然科学基金资助项目;浙江省自然科学基金资助项目;浙江省自然科学基金资助项目;浙江省自然科学基金资助项目;宁波市自然科学基金资助项目;宁波市自然科学基金资助项目;宁波大学科研基金(理)/学科基金资助项目

Multi-class support vector machine-based fast algorithm for 3D-HEVC depth video intra coding

Sheng LIU,Zong-ju PENG,Jia-li CHEN,Fen CHEN,Mei YU,Gang-yi JIANG   

  1. Faculty of Information Science and Engineering, Ningbo University, Ningbo 315211, China
  • Online:2016-11-25 Published:2016-11-30
  • Supported by:
    The National High-Tech R&D Program of China (863 Program);The National Natural Science Foundation of China;The National Natural Science Foundation of China;The National Natural Science Foundation of China;The Natural Science Foundation of Zhejiang Prov-ince;The Natural Science Foundation of Zhejiang Prov-ince;The Natural Science Foundation of Zhejiang Prov-ince;The Natural Science Foundation of Ningbo;The Natural Science Foundation of Ningbo;Ningbo University Research Foundation (Science)/Discipline Project

摘要:

深度视频编码中最优深度划分和模式选择过程具有非常高的计算复杂度。提出了基于多类支持向量机(MSVM, multi-class support vector machine)的深度视频帧内编码快速算法。该算法包括离线模型训练和快速编码2个部分。在离线模型训练中,用深度视频最大编码单元(LCU, largest coding unit)的最优划分深度作为标签,当前LCU的空域复杂度、空域相邻LCU的最优划分深度和彩色视频对应LCU的最优划分深度作为特征去构造MSVM模型。在编码时,提取LCU的特征,根据MSVM模型得到划分深度的预测值。根据该预测值提前终止编码单元递归划分和模式选择过程。实验结果表明,提出的算法在几乎不影响虚拟视点质量的情况下,平均节省35.91%的总体编码时间和40.04%的深度编码时间。

关键词: 深度视频编码, 3D-HEVC, 帧内编码, 最大编码单元, 多类支持向量机

Abstract:

The recursive splitting process of largest coding unit (LCU) and the mode search process of coding unit imposed enormous computational complexity on encoder. A multi-class support vector machine-based (MSVM) fast coding unit (CU) size decision algorithm for 3D-HEVC depth video intra-coding was proposed. The algorithm included two steps: off-line training and fast CU size and mode decision. In the process of off-line training, a MSVM model was constructed, where the texture complexity of current LCU, the optimal partition depth of its spatial neighboring LCU and co-located LCU in texture video were treated as feature vectors, and the optimal partition depth of LCU was utilized as corresponding class label. In the process of fast CU size and mode decision, features of LCU were extracted before cod-ing a LCU, then, a MSVM model was used to predict the class label. Finally, the class label that represents the largest parti-tion depth of the current LCU was employed to terminate the CU recursive splitting process and CU mode search process. Experimental results show that the proposed algorithm saves the encoding time of 3D-HEVC by 35.91% on average, and the encoding time of depth video by 40.04% on average, with negligible rendered virtual view image degradation.

Key words: depth video coding, 3D-HEVC, intra coding, largest coding unit, multi-class support vector machine

No Suggested Reading articles found!