Citation: | GUO Hongwei, ZHU Ce, LI Shuai, et al. Manhattan distance based inter-frame weighted prediction algorithm[J]. Journal of Beijing University of Aeronautics and Astronautics, 2019, 45(12): 2415-2422. doi: 10.13700/j.bh.1001-5965.2019.0371(in Chinese) |
Merge mode saves the number of bits required to encode motion information by sharing the motion vector (MV) in neighboring blocks, which improves the rate-distortion performance of encoders effectively. However, motion compensation prediction (MCP) is not accurate enough in the merge mode currently. Therefore, this paper analyses the characteristics of residual distribution after MCP in the merge mode, and presents a Manhattan distance based weighted prediction method as an additional candidate for the merge mode. First, several predicted blocks are obtained by MCP with motion vectors in neighboring candidates. Second, the additional candidate is obtained by a weighted average method according to Manhattan distances from the neighboring candidate to the pixel points in the predicted blocks obtained. Finally, the best merge mode is selected by rate distortion optimization (RDO) among the additional candidate and the original candidates. The experimental results show that, on the joint exploration test model 7.0 (JEM 7.0), the proposed method achieves rate distortion performance improvement under the different configurations of encoder, where a bitrate saving of 1.34% on average is obtained under the configuration of low delay P frame.
[1] |
马思伟.AVS视频编码标准技术回顾及最新进展[J].计算机研究与发展, 2015, 52(1):27-37. http://d.old.wanfangdata.com.cn/Periodical/jsjyjyfz201501004
MA S W.History and recent developments of AVS video coding standards[J].Journal of Computer Research and Development, 2015, 52(1):27-37(in Chinese). http://d.old.wanfangdata.com.cn/Periodical/jsjyjyfz201501004
|
[2] |
SULLIVAN G J, OHM J R, HAN W J, et al.Overview of the high efficiency video coding(HEVC) standard[J].IEEE Transactions on Circuits and Systems for Video Technology, 2012, 22(12):1649-1668. doi: 10.1109/TCSVT.2012.2221191
|
[3] |
GAO Y, ZHU C, LI S, et al.Temporal dependent rate-distortion optimization for low-delay hierarchical video coding[J].IEEE Transactions on Image Processing, 2017, 26(9):4457-4470. doi: 10.1109/TIP.2017.2713598
|
[4] |
LI S, ZHU C, GAO Y B, et al.Lagrangian multiplier adaptation for rate-distortion optimization with inter-frame dependency[J].IEEE Transactions on Circuits and Systems for Video Technology, 2016, 26(1):117-129. doi: 10.1109/TCSVT.2015.2450131
|
[5] |
LI S, ZHU C, GAO Y, et al.Inter-frame dependent rate-distortion optimization using Lagrangian multiplier adaption[C]//Proceedings of IEEE International Conference on Multimedia and Expo(ICME).Piscataway, NJ: IEEE Press, 2015: 1-6.
|
[6] |
HELLE P, OUDIN S, BROSS B, et al.Block merging for quadtree-based partitioning in HEVC[J].IEEE Transactions on Circuits and Systems for Video Technology, 2012, 22(12):1720-1731. doi: 10.1109/TCSVT.2012.2223051
|
[7] |
PAN Z Q, KWONG S, SUN M T, et al.Early merge mode decision based on motion estimation and hierarchical depth correlation for HEVC[J].IEEE Transactions on Broadcasting, 2014, 60(2):405-412. doi: 10.1109/TBC.2014.2321682
|
[8] |
TARIQ J, KWONG S, YUAN H.Spatial/temporal motion consistency based merge mode early decision for HEVC[J].Journal of Visual Communication and Image Representation, 2017, 44:198-213. doi: 10.1016/j.jvcir.2017.01.029
|
[9] |
蒋洁, 叶德周, 潘勉.一种基于自适应阈值的Merge模式快速选择方法[J].光电子·激光, 2016, 27(9):980-986. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=gdzjg201609013
JIANG J, YE D Z, PAN M.A fast candidate selection method for Merge mode based on adaptive threshold[J].Journal of Optoelectronics·Laser, 2016, 27(9):980-986(in Chinese). http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=gdzjg201609013
|
[10] |
KIM T S, RHEE C E, LEE H J.Merge mode estimation for a hardware-based HEVC encoder[J].IEEE Transactions on Circuits and Systems for Video Technology, 2016, 26(1):195-209. doi: 10.1109/TCSVT.2015.2496820
|
[11] |
SONG Y X, JIA K B.Early merge mode decision for texture coding in 3D-HEVC[J].Journal of Visual Communication and Image Representation, 2015, 33:60-68. doi: 10.1016/j.jvcir.2015.07.001
|
[12] |
HEO Y S, BANG G, PARK G H.Adaptive merge list construction for 3D-HEVC fast encoder[J].Electronics Letters, 2016, 52(8):604-690. doi: 10.1049/el.2015.4323
|
[13] |
宋雨新, 贾克斌, 吴强.3D-HEVC合并模式快速判决方法研究[J].信号处理, 2016, 32(1):46-55. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=xhcl201601007
SONG Y X, JIA K B, WU Q.Research on fast merge mode decision method for 3D-HEVC[J].Journal of Signal Processing, 2016, 32(1):46-55(in Chinese). http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=xhcl201601007
|
[14] |
WU J F, GUO B L, HOU J, et al.Fast CU encoding schemes based on merge mode and motion estimation for HEVC inter prediction[J].KSII Transactions on Internet and Information Systems, 2016, 10(3):1195-1211. http://cn.bing.com/academic/profile?id=cc02e852a11680f79c55a31e336390c6&encoded=0&v=paper_preview&mkt=zh-cn
|
[15] |
CHENG Z X, SUN H M, ZHOU D J, et al.Accelerating HEVC inter prediction with improved merge mode handling[J].IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences, 2017, 100(2):546-554. http://www.wanfangdata.com.cn/details/detail.do?_type=perio&id=J-STAGE_660436
|
[16] |
VANNE J, VIITANEN M, HAMALAINEN T D.Efficient mode decision schemes for HEVC inter prediction[J].IEEE Transactions on Circuits and Systems for Video Technology, 2014, 24(9):1579-1593. doi: 10.1109/TCSVT.2014.2308453
|
[17] |
ZHANG N, FAN X P, ZHAO D B, et al.Merge mode for deformable block motion information derivation[J].IEEE Transactions on Circuits and Systems for Video Technology, 2017, 27(11):2437-2449. doi: 10.1109/TCSVT.2016.2589818
|
[18] |
CHEN J, ALSHINA E, SULLIVAN G J, et al.Algorithm description of joint exploration test model 7(JEM 7)[C]//Joint Video Exploration Team(JVET) of ITU-T SG16WP3 and ISO/IEC JTC 1/SC 29/WG 117th Meeting, 2017: 10-15.
|
[19] |
SUEHRING K, LI X.JVET common test conditions and software reference configurations[C]//Joint Video Exploration Team (JVET) of ITU-T SG16WP3 and ISO/IEC JTC 1/SC 29/WG112nd Meeting, 2016: 1-4.
|