Bù Đắp Chuyển Động Tầng Bậc cho Mã Hóa Video Hiệu Quả Cao

Journal of Signal Processing Systems - Tập 89 - Trang 363-377 - 2016
Hyo-Song Kim1, Rae-Hong Park1
1Department of Electronic Engineering, Sogang University, Seoul, South Korea

Tóm tắt

Ước lượng chuyển động, kết hợp với bù đắp chuyển động trong giai đoạn giải mã, là một phương pháp cơ bản để đạt được hiệu suất nén cao trong mã hóa video. Phương pháp này đạt được hiệu suất nén cao bằng cách giảm thiểu độ dư thừa tạm thời giữa các hình ảnh liền kề. Các hình ảnh dư tạm thời liền kề cũng chứa một số độ dư thừa mặc dù kỹ thuật ước lượng chuyển động đã loại bỏ một số thông tin dư thừa. Dựa trên đặc điểm này, bài báo này đề xuất một thuật toán bù đắp chuyển động dư đa cấp (MRMC), lặp lại ME và MC trên các tín hiệu dư nhằm kết hợp khả năng giảm thiểu độ dư thừa trong thông tin dư. Bài báo này chủ yếu giải quyết sự đánh đổi giữa số lượng bit cho các tín hiệu dư và các vector chuyển động tương ứng (MVs) dựa trên tối ưu hóa tỷ lệ - méo. Ngoài ra, một phương pháp nhanh được trình bày để sử dụng thực tiễn phương pháp MRMC được đề xuất. Phương pháp được đề xuất đã được triển khai trên mô hình thử nghiệm phần mềm Mã hóa Video Hiệu quả Cao phiên bản 16.0. Kết quả thực nghiệm cho thấy có thể đạt được mức tiết kiệm tỷ lệ Bjøntegaard-delta trung bình là 9.16% với chi phí tăng 17.0% về thời gian mã hóa và 2.3% về thời gian giải mã.

Từ khóa

#bù đắp chuyển động #mã hóa video hiệu quả cao #ước lượng chuyển động #độ dư thừa #tối ưu hóa tỷ lệ-méo

Tài liệu tham khảo

Sullivan, G. J., Ohm, J.-R., Han, W.-J., & Wiegand, T. (2012). Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1649–1668. Video Codec for Audiovisual Services at px64 kbit/s. ITU-T Recommendation H.261 (1990). Video Coding for Low Bit Rate Communication. ITU-T Recommendation H.263 (1995). Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbit/s—Part 2: Video. ISO/IEC 11172–2 (MPEG-1) (1993). Coding of Audio-Visual Objects—Part 2: Visual. ISO/IEC 14496–2 (MPEG-4 Visual version 1). (1999). Generic Coding of Moving Pictures and Associated Audio Information—Part 2: Video. ITU-T Recommendation H.262 and ISO/IEC 13818–2 (MPEG 2 Video) (1994). Advanced Video Coding for Generic Audio-Visual Services. ITU-T Recommendation H.264 and ISO/IEC 14496–10 (AVC) (2003). Wiegand, T., Sullivan, G. J., Bjøntegaard, G., & Luthra, A. (2003). Overview of the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology, 13(7), 560–576. Ciuperca, G., Girardin, V., & Lhote, L. (2011). Computation and estimation of generalized entropy rates for denumerable Markov chains. IEEE Transactions on Information Theory, 57(7), 4026–4034. Sullivan, G. J. & Baker, R. L. (1991). Rate-distortion optimized motion compensation for video compression using fixed or variable size blocks. International Conference on Global Telecommunications, (pp. 85–90). Girod, B. (1993). Motion-compensating prediction with fractional-pel accuracy. IEEE Transactions on Consumer Electronics, 41(4), 604–612. Wiegand, T., Zhang, X., & Girod, B. (1999). Long-term memory motion compensated prediction. IEEE Transactions on Circuits and Systems for Video Technology, 9(1), 70–84. Wiegand, T., Schwarz, H., Joch, A., Kossentini, F., & Sullivan, G. J. (2003). Rate-constrained coder control and comparison of video coding standards. IEEE Transactions on Circuits and Systems for Video Technology, 13(7), 688–703. Schwarz, H., Marpe, D., & Wiegand, T. (2007). Overview of the scalable video coding extension of the H.264/AVC standard. IEEE Transactions on Circuits and Systems for Video Technology, 17(9), 1103–1120. Kim, H., Cosman, P. C., & Milstein, L. B. (2013). Motion- compensated scalable video transmission over MIMO wireless channels. IEEE Transactions on Circuits and Systems for Video Technology, 23(1), 116–127. Schwarz, H., Marpe, D., & Wiegand, T. (2005). Hierarchical B Pictures. Joint Video Team (JVT) Document JVT-P014. Schwarz, H., Hinz, T., Marpe, D., & Wiegand, T. (2005). Constrained interlayer prediction for single-loop decoding in spatial scalability. IEEE International Conference on Image Processing (pp. 870–873). Bross, B., Han, W.-J., Ohm, J.-R., Sullivan, G. J., Wang, Y.-K., & Wiegand, T. (2013). High Efficiency Video Coding (HEVC) Text Specification Draft 10. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-L1003. Ohm, J., Sullivan, G. J., Schwarz, H., Tan, T. K., & Wiegand, T. (2012). Comparison of the coding efficiency of video coding standards—including high efficiency video coding (HEVC). IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1669–1684. McCann, K., Han, W.-J., & Kim, I.-K. (2010). Samsung’s Response to the Call for Proposals on Video Compression Technology. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-A124. McCann K. (2010). Tool Experiment 12: Evaluation of TMuC Tools. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-B312. K. McCann, W.-J. Han, & I.-K. Kim (2010). Samsung’s Response to the Call for Proposals on Video Compression Technology. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-A124. Kuo, T.-Y., & Lu, H.-J. (2008). Efficient reference frame selector for H.264. IEEE Transactions on Circuits and Systems for Video Technology, 18(3), 400–405. K. Choi, S.-H. Park, & E. S. Jang (2011). Coding tree pruning based CU early termination. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-F092. J. Yang, J. Kim, K. Won, H. Lee, & B. Jeon (2011). Early SKIP Detection for HEVC. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-G543. R. H. Gweon, Y.-L. Lee, & J. Lim (2011). Early Termination of CU Encoding to Reduce HEVC Complexity. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-F045. Kamp, S., & Wien, M. (2012). Decoder-side motion vector derivation for block-based video coding. IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1732–1745. Peng, W.-H., & Chen, C.-C. (2013). An interframe prediction technique combining template matching prediction and block-motion compensation for high-efficiency video coding. IEEE Transactions on Circuits and Systems for Video Technology, 23(8), 1432–1446. Kim, H.-S., Lee, J.-H., Kim, C.-K., & Kim, B.-G. (2012). Zoom motion estimation using block-based fast local area scaling. IEEE Transactions on Circuits and Systems for Video Technology, 22(9), 1280–1291. Kordasiewicz, R. C., Gallant, M. D., & Shirani, S. (2007). Encoding of affine motion vectors. IEEE Transactions on Multimedia, 9(7), 1346–1356. Kim, D.-Y., Lim, H.-G., & Park, H.-W. (2013). Iterative true motion estimation for motion-compensated frame interpolation. IEEE Transactions on Circuits and Systems for Video Technology, 23(3), 445–454. Tsai, T.-H., & Lin, H.-Y. (2013). Hybrid frame rate upconversion method based on motion vector mapping. IEEE Transactions on Circuits and Systems for Video Technology, 23(11), 1901–1910. Kang, J.-W., Lou, C.-C., Kim, S.-H., & Kuo, C.-C. J. (2013). Efficient HD video coding with joint first-order-residual (FOR) and second-order-residual (SOR) coding technique. Journal of Visual Communication and Image Representation, 24(1), 1–11. Kamisli, F. & Lim, J. S. (2009). Transforms for the motion compensation residual. IEEE International Conference on Acoustics Speech and Signal Processing (pp. 789–792). Kang, J.-W., Gabbouj, M., & Kuo, C.-C. J. (2013). Sparse/DCT (S/DCT) two-layered representation of prediction residuals for video coding. IEEE Transactions on Image Processing, 22(7), 2711–2722. Gu, Z., Lin, W., Lee, B.-S., & Lau, C. T. (2012). Rotated orthogonal transform (ROT) for motion-compensation residual coding. IEEE Transactions on Image Processing, 21(12), 4770–4781. McCann, K., Rosewarne, C., Bross, B., Naccari, M., Sharman, K., & Sullivan, G. J. (2014). High Efficiency Video Coding (HEVC) Test Model 16 (HM 16) Improved Encoder Description. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-S1002. Bossen, F., Bross, B., Suhring, K., & Flynn, D. (2012). HEVC complexity and implementation analysis. IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1685–1696. Chen, M.-J., Li, G.-L., Chiang, Y.-Y., & Hsu, C.-T. (2006). Fast multiframe motion estimation algorithms by motion vector composition for the MPEG-4/AVC/H.264 standard. IEEE Transactions on Multimedia, 8(3), 478–487. Po, L.-M., Ting, C.-W., Wong, K.-M., & Ng, K.-H. (2007). Novel point-oriented inner searches for fast block motion estimation. IEEE Transactions on Multimedia, 9(1), 9–15. Jin, S.-J., Park, S.-J., & Jeong, J.-C. (2007). Adaptive fast full search algorithm using partitioned region and optimized search order. IEEE Transactions on Consumer Electronics, 53(4), 1703–1711. Kannangara, C. S., Richardson, I. E., Bystrom, M., & Zhao, Y. (2009). Complexity control of H.264/AVC based on mode-conditional cost probability distributions. IEEE Transactions on Multimedia, 11(3), 433–442. Shen, L., Liu, Z., Zhang, X., Zhao, W., & Zhang, Z. (2013). An effective CU size decision method for HEVC encoders. IEEE Transactions on Multimedia, 15(2), 465–470. Xiong, J., Li, H., Wu, Q., & Meng, F. (2014). A fast HEVC inter CU selection method based on pyramid motion divergence. IEEE Transactions on Multimedia, 16(2), 559–564. Tao, B., & Orchard, M. T. (2001). A parametric solution for optimal overlapped block motion compensation. IEEE Transactions on Image Processing, 10(3), 341–350. Tseng, Y.-C., Wu, C.-H., Chen, Y.-W., Wang, T.-W., & Peng W.-H. (2010). On the analysis and design of motion sampling structure for advanced motion-compensated prediction. IEEE International Conference on Image Processing (pp. 949–952). Chen, Y.-W., & Peng, W.-H. (2012). Parametric OBMC for pixel-adaptive temporal prediction on irregular motion sampling grids. IEEE Transactions on Circuits and Systems for Video Technology, 22(1), 113–127. Bossen, F. (2013). Common Test Conditions and Software Reference Configurations. Joint Collaborative Team on Video Coding (JCT-VC) Document JCTVC-L1100. Cho, S., & Kim, M. (2013). Fast CU splitting and pruning for suboptimal CU partitioning in HEVC intra coding. IEEE Transactions on Circuits and Systems for Video Technology, 23(9), 1555–1564. Bjøntegaard G. (2008). Improvements of the BD-PSNR model. Video Coding Experts Group (VCEG) Document VCEG-AI11.