电信科学 ›› 2017, Vol. 33 ›› Issue (8): 16-25.doi: 10.11959/j.issn.1000-0801.2017248
周建同1,杨海涛1,刘东2,马祥1,王田1
修回日期:
2017-08-08
出版日期:
2017-08-01
发布日期:
2017-08-25
作者简介:
周建同(1980-),男,华为技术有限公司主任工程师,主要研究方向为多媒体应用系统和视频通信。|杨海涛(1983-),男,华为技术有限公司主任工程师,主要研究方向为图像视频处理、压缩和通信。|刘东(1983-),男,中国科学技术大学副教授,主要研究方向为图像视频压缩和多媒体数据挖掘。|马祥(1987-),男,华为技术有限公司工程师,主要研究方向为视频压缩。|王田(1967-),男,华为技术有限公司媒体技术实验室主任,主要研究方向为多媒体通信系统、虚拟/增强现实和计算机视觉。
Jiantong ZHOU1,Haitao YANG1,Dong LIU2,Xiang MA1,Tian WANG1
Revised:
2017-08-08
Online:
2017-08-01
Published:
2017-08-25
摘要:
现有视频编码采用基于块的混合编码架构,利用预测、变换、量化和熵编码技术实现对视频信号的高效压缩。在现有架构基础上进一步优化,提供针对视频图像信号局部特性的更加灵活的处理和编码。基于机器学习的视频编码技术有望部分或全面地改变现有的混合编码框架,给视频编码带来新的研究思路。未来视频除了现有的二维平面视频,还需要编码面向AR/VR应用的球面视频数据和体视频数据,这些新的视频源数据格式也给视频编码技术研究带来新的机会和挑战。
中图分类号:
周建同,杨海涛,刘东,马祥,王田. 视频编码的技术基础及发展方向[J]. 电信科学, 2017, 33(8): 16-25.
Jiantong ZHOU,Haitao YANG,Dong LIU,Xiang MA,Tian WANG. Trends and technologies of video coding[J]. Telecommunications Science, 2017, 33(8): 16-25.
[1] | 施唯佳, 蒋力, 贾立鼎 . OTT TV和IPTV的技术比较分析[J]. 电信科学, 2014,30(5): 15-19,26. |
SHI W J , JIANG L , JIA L D . Technique comparative analysis of OTT TV and IPTV[J]. Telecommunications Science, 2014,30(5): 15-19,26. | |
[2] | 魏峥, 施唯佳, 祝谷乔 . 互联网视频中多屏互动技术的应用[J]. 电信科学, 2014,30(5): 27-32,39. |
WEI Z , SHI W J , ZHU G Q . Multi-screen interaction technologies on internet streaming video[J]. Telecommunications Science, 2014,30(5): 27-32,39. | |
[3] | 张敏, 宋杰, 刘晓峰 . 电信运营商面对 OTT 的战略选择[J]. 电信科学, 2014,30(2): 142-146,151. |
ZHANG M , SONG J , LIU X F . Strategic selection of telecom operators to counter OTT[J]. Telecommunications Science, 2014,30(2): 142-146,151. | |
[4] | MPEG. Presentations of the brainstorming session of the future of video coding standardization:MPEG-w15050[S]. 2014. |
[5] | MPEG. Steps towards a future video compression standard:MPEG-w15272[S]. 2015. |
[6] | MPEG. Requirements for a future video coding standard:MPEG-w15090[S]. 2015. |
[7] | MPEG. Request for contributions on future video compression technology:MPEG-w15273[S]. 2015. |
[8] | JVET. Joint call for evidence on video compression with capability beyond HEVC:JVET-F1002[S]. 2017. |
[9] | MPEG. Joint group on future video coding technology exploration (JVET):MPEG-w15897[S]. 2015. |
[10] | ITU. Coding tools investigation for next generation video coding:ITU-T SG16-C806[S]. 2015. |
[11] | JVET. JVET common test conditions and software reference configurations:JVET-B1010[S]. 2016. |
[12] | JVET. Algorithm description of joint exploration test model 6:JVET-F1001[S]. 2017. |
[13] | YUAN Y , KIM I K , ZHENG X ,et al. Quadtree based nonsquare block structure for inter frame coding in high efficiency video coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2012,22(12): 1707-1719. |
[14] | AN J , CHEN Y W , ZHANG K ,et al. Block partitioning structure for next generation video coding:COM 16–C966[S]. 2015. |
[15] | JVET. Multi-type-tree:JVET-D0117[S]. 2016. |
[16] | YANG H , FU J , LIN S ,et al. Description of video coding technology proposal by Huawei Technologies & Hisilicon Technologies[C]// ISO/IEC JTC1/SC29/WG11,JCTVC-A111,April 15-23,2010,Dresden,Germany.[S.1.:s.n]. 2010. |
[17] | KAMP S , WIEN M . Description of video coding technology proposal by RWTH Aachen University[C]// JVT on Video Coding of ITU-T VCEG and ISO/IEC MPEG 1st Meeting,JCTVC,JCTVC-A112,April 15-23,2010,Dresden,Germany.[S.1.:s.n]. 2010. |
[18] | KAMP S , WIEN M . Decoder-side motion vector derivation for block-based video coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2012,22(12): 1732-1745. |
[19] | CHIU Y , XU L , ZHANG W ,et al. Description of video coding technology proposal:self derivation of motion estimation and adaptive (Wiener) loop filtering[C]// JCT-VC 1st Meeting,JCTVC-A106,April 15-23,2010,Dresden,Germany.[S.1.:s.n]. 2010. |
[20] | CHEN J , CHIEN W J , KARCZEWICZ M ,et al. Further improvements to HMKTA-1.0[J]. Doc VECG-AZO7, 2015. |
[21] | LIN S , CHEN H , ZHANG H ,et al. Affine transform prediction for next generation video coding[J]. ITU-T SG16 Doc COM16-C1016, 2015. |
[22] | CHEN H , LIANG F , LIN S . Affine SKIP and MERGE modes for video coding[C]// 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP),Oct 19-21,2015,Xiamen,China. New Jersey:IEEE Press, 2015: 1-5. |
[23] | LI L , LI H , LIU D ,et al. An efficient four-parameter affine motion model for video coding[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2017. |
[24] | WITTMANN S , WEDI T . Transmission of post-filter hints for video coding schemes[C]// 2007 IEEE International Conference on Image Processing,Sept 16-Oct 19,San Antonio,TX,USA. New Jersey:IEEE Press, 2007: 81-84. |
[25] | ITU. Adaptive (Wiener) filter for video compression:ITU-T SG16 Contribution C,VCEG-C437[S]. 2008. |
[26] | ITU. Adaptive loop filter for improving coding efficiency:ITU-T SG16 Contribution C,VCEG-C402[S]. 2008. |
[27] | TSAI C Y , CHEN C Y , YAMAKAGE T ,et al. Adaptive loop filtering for video coding[J]. IEEE Journal of Selected Topics in Signal Processing, 2013,7(6): 934-945. |
[28] | JIANG J . Image compression with neural networks–a survey[J]. Signal Processing:Image Communication, 1999,14(9): 737-760. |
[29] | TODERICI G , O'MALLEY S M , HWANG S J ,et al. Variable rate image compression with recurrent neural networks[J]. arXiv preprint arXiv:1511.06085, 2015. |
[30] | TODERICI G , VINCENT D , JOHNSTON N ,et al. Full resolution image compression with recurrent neural networks[J]. arXiv preprint arXiv:1608.05148, 2016. |
[31] | DUMAS T , ROUMY A , GUILLEMOT C . Image compression with stochastic winner-take-all auto-encoder[C]// 2017 IEEE International Conference on Acoustics (ICASSP 2017),March 5-9,2017,New Orleans,USA. New Jersey:IEEE Press, 2017: 1512-1516. |
[32] | PRAKASH A , MORAN N , GARBER S ,et al. Semantic perceptual image compression using deep convolution networks[J]. arXiv preprint arXiv:1612.08712, 2016. |
[33] | BALLé J , LAPARRA V , SIMONCELLI E P . End-to-end optimization of nonlinear transform codes for perceptual quality[J]. arXiv preprint arXiv:1607.05006, 2016. |
[34] | BALLé J , LAPARRA V , SIMONCELLI E P . End-to-end optimized image compression[J]. arXiv preprint arXiv:1611.01704, 2016. |
[35] | DONG C , DENG Y , CHANGE Loy C ,et al. Compression artifacts reduction by a deep convolutional network[C]// 2017 IEEE International Conference on Computer Vision (ICCV 2015),Dec 7-13,2015,Santiago,Chile. New Jersey:IEEE Press, 2017: 576-584. |
[36] | PARK W S , KIM M . CNN-based in-loop filtering for coding efficiency improvement[C]// 2016 IEEE Image,Video,and Multi dimensional Signal Processing Workshop (IVMSP),July 11-12,2016,Bordeaux,France. New Jersey:IEEE Press, 2016: 1-5. |
[37] | DAI Y , LIU D , WU F . A convolutional neural network approach for post-processing in HEVC intra coding[C]// 2017 International Conference on Multimedia Modeling (MMM 2017),January 4-6,2017,Reykjavik,Iceland. Heidelberg:Springer, 2017: 28-39. |
[38] | LIU Z , YU X , CHEN S ,et al. CNN oriented fast HEVC intra CU mode decision[C]// 2016 IEEE International Symposium on Circuits and Systems (ISCAS 2016),May 22-25,2016,Montreal,Canada. New Jersey:IEEE Press, 2016: 2270-2273. |
[39] | LAFRUIT G , QUACKENBUSH S , FOESSEL S ,et al. Technical report of the joint ad hoc group for digital representations of light/sound fields for immersive media applications[R]. 2016. |
[40] | TULVAN C , MEKURIA R , LI Z ,et al. Use cases for point cloud compression[R]. 2016. |
[41] | MEKURIA R , LI Z , TULVAN C . Call for proposals for point cloud compression[R]. 2017. |
[42] | PALOMO C M . Interactive image-based rendering for virtual view synthesis from depth image[D]. Rio de Janeiro:Pontífícia Universidade Católica do Rio de Janeiro, 2009. |
[1] | 郭泽华, 朱昊文, 徐同文. 面向分布式机器学习的网络模态创新[J]. 电信科学, 2023, 39(6): 44-51. |
[2] | 刘雅琼, 吕哲, 赵亚飞, 寿国础. AI技术在卫星通信/互联网领域的应用综述[J]. 电信科学, 2023, 39(2): 10-24. |
[3] | 章坚武, 安彦军, 邓黄燕. DNS攻击检测与安全防护研究综述[J]. 电信科学, 2022, 38(9): 1-17. |
[4] | 祁伟, 殷海兵, 王鸿奎, 黄晓峰, 牛伟宏. 基于统计建模的VVC快速码率估计算法[J]. 电信科学, 2022, 38(12): 35-45. |
[5] | 周志超, 冯毅, 夏小涵, 冯瑜瑶, 蔡超, 邱佳慧, 杨立辉, 乌云霄. 基于移动蜂窝网的机器学习室外指纹定位方案[J]. 电信科学, 2021, 37(8): 85-95. |
[6] | 孟翔, 殷海兵, 黄晓峰. 基于统计建模的HEVC快速率失真估计算法[J]. 电信科学, 2021, 37(1): 58-68. |
[7] | 李想,李原,张子飞,杨哲. 基于密度聚类的网络性能故障大数据分析方法[J]. 电信科学, 2020, 36(9): 51-58. |
[8] | 肖子玉. 面向2030的未来网络关键技术综述——Beyond 5G[J]. 电信科学, 2020, 36(9): 114-121. |
[9] | 邱亚星,王希栋,边森,岳磊. 基于聚类分析和深度学习的多频多模网络负载均衡优化[J]. 电信科学, 2020, 36(7): 156-162. |
[10] | 欧阳晔,杨爱东,孟凡语. 一种博弈论辅助的机器学习算法检测用户流失行为[J]. 电信科学, 2020, 36(6): 79-89. |
[11] | 林锋,徐柳婧,陈晓华,戚伟强,陈可,朱添田. 一种基于多视角特征融合的Webshell检测方法[J]. 电信科学, 2020, 36(6): 125-132. |
[12] | 盛剑涛,陈茂飞,刘东鑫,汪来富,史国水,金华敏. 基于组合分类器的恶意域名检测技术[J]. 电信科学, 2020, 36(5): 47-55. |
[13] | 庞涛,丘海华,潘碧莹. 手机终端人工智能关键技术研究[J]. 电信科学, 2020, 36(5): 145-151. |
[14] | 钟其柱. 基于机器学习分析VoLTE视频通话质量的研究及应用[J]. 电信科学, 2020, 36(3): 156-165. |
[15] | 刘姿杉,程强,吕博. 面向机器学习的隐私保护关键技术研究综述[J]. 电信科学, 2020, 36(11): 18-27. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||
|