Fast motion and disparity estimation for HEVC based 3D video coding
Tóm tắt
The emerging international standard for high efficiency video coding (HEVC) based 3D video coding (3D-HEVC) is an extension of HEVC. In the test model of 3D-HEVC, variable size motion estimation (ME) and disparity estimation (DE) are both employed to select the best coding mode for each treeblock in the encoding process. This technique achieves the highest possible coding efficiency, but it brings extremely high computational complexity which limits 3D-HEVC from practical applications. In this paper, a fast ME/DE algorithm based on inter-view and spatial correlations is proposed to reduce 3D-HEVC computational complexity. Since the multi-view videos represent the same scene with similar characteristic, there is a high correlation among the coding information from inter-view prediction. Besides, the homogeneous regions in texture video have a strong spatial correlation, and thus spatially neighboring treeblocks have similar coding information. Therefore, we can determine ME search range and skip some specific ME and DE rarely used in the previously coded view frames and spatially neighboring coding unit. Experimental results demonstrate that the proposed algorithm can significantly reduce computational complexity of 3D-HEVC encoding while maintaining almost the same rate-distortion performance.
Tài liệu tham khảo
Bjontegaard, G. (2001, Apr.). Calculation of average PSNR differences between RD-curves. ITU-T SG16/Q.6 Doc. VCEG-M33. Austin, TX.
Deng, Z., Chan, Y., Jia, K., Fu, C., & Siu, W. (2012). Fast motion and disparity estimation with adaptive search range adjustment in stereoscopic video coding. IEEE Transactions on Broadcasting, 58(1), 24–33.
Kim, Y. T., Kim, J. Y., & Sohn, Kh. (2007). Fast disparity and motion estimation for multi-view video coding. IEEE Transactions on Consumer Electronics, 53(2), 712–719.
Ma, S., Wang, S., & Gao, W. (2014). Low complexity adaptive view synthesis optimization in HEVC based 3D video coding. IEEE Transactions on Multimedia, 16(1), 266–271.
Mora, E., Jung, J., Cagnazzo, M., & Pesquet-Popescu, B. (Sep. 2013). Modification of the merge candidate list for dependent views in 3D-HEVC. In Proceeding on IEEE international conference on image processing, pp. 1709–1713.
Müller, K., Schwarz, H., Marpe, D., Bartnik, C., Bosse, S., Brust, H., et al. (2013). 3D High efficiency video coding for multi-view video and depth data. IEEE Transactions on Image Processing, 22(9), 3366–3378.
Rusanovskyy, D., Mueller, K., & Vetro, A. (2013, Aug.). Common test conditions of 3DV core experiments. Joint collaborative team on 3D video coding extensions (JCT-3V) document JCT3V-E1100, 5th meeting: Vienna, AT.
Shen, L., Liu, Z., Liu, S., Zhang, Z., & An, P. (2009). Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding. IEEE Transactions on Broadcasting, 55(4), 761–766.
Shen, L., Liu, Z., Yan, T., Zhang, Z., & An, P. (2010). View-adaptive motion estimation and disparity estimation for low complexity multiview video coding. IEEE Transactions on Circuits and Systems for Video Technology, 20(6), 925–930.
Shen, L., Liu, Z., An, P., Ma, R., & Zhang, Z. (2011). Low-complexity mode decision for MVC. IEEE Transactions on Circuits and Systems for Video Technology, 6(21), 837–843.
Sullivan, G. J., Ohm, J.-R., Han, W.-J., & Wiegand, T. (2012). Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1649–1668.
Tanimoto, M., Fujii, T., & Suzuki, K. (2008, Feb.). View synthesis algorithmin view synthesis reference software 2.0 (VSRS 2.0). Lausanne, Switzerland, ISO/IEC JTC1/SC29/WG11 M16090.
Tohidypour, H. R., Pourazad, M. T., Nasiopoulos, P., & Leung, V. (2013, Jul. ). A content adaptive complexity reduction scheme for HEVC-Based 3D video coding. In Proceedings on 18th international conference on digital signal processing (DSP 2013), pp. 1–5.
Vetro, A., Wiegand, T., & Sullivan, G. J. (2011). Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard. Proceedings of the IEEE, 99(4), 626–642.
Zhang, L., Chen, Y., Li, X., & Xue, S. (2014, Jun.). Low-complexity advanced residual prediction design in 3D-HEVC. In Proceedings on 2014 IEEE international symposium on circuits and systems (ISCAS), pp. 13–16.
Zhang, L., Tech, G., Wegner, K., & Yea, S. (2013). 3D-HEVC Test Model 5. Joint collaborative team on 3D video coding extensions (JCT-3V) document JCT3V-E1005, 5th meeting: Vienna, AT, 27 July–2 Aug. 2013.
Zhang, Q., An, P., Zhang, Y., Shen, L., & Zhang, Z. (2011). Low complexity multiview video plus depth coding. IEEE Transactions on Consumer Electronics, 57(4), 1857–1865.