Distributed multiview video coding with 3D-DCT transform domain Wyner-Ziv codec
Abstract
The need for efficient multiview video coding schemes is expected to strongly increase in the near future. The distributed multiview video coding (DMVC) approach seems very promising since it can achieve good compression efficiency while keeping the complexity low. The main contribution of this paper is to investigate how to improve the classic DMVC framework based on transform domain WZ video coding (TDWZ) by means of the introduction of the 3D-DCT. The main advantage of this combination resides in the limited computational complexity of the overall framework, which however does not penalise the compression performance since both the spatial and the temporal domain correlation can be exploited due to the use of the 3D-DCT. The framework is designed in a flexible way so that it can handle both traditional and residual-frame-based WZ coding. The simulation results confirm the validity of the proposed framework in terms of video quality improvements, with gains up to 4.4 dB PSNR compared to a pixel domain WZ technique, and up to 0.6 dB PSNR compared to a 2D-DCT-based one.
Keywords
References
- 1. A. Aaron, S. Rane, E. Setton, B. Girod, '‘Transform-domain Wyner-Ziv codec for video’' Proc. of SPIE (2004) Google Scholar
- 2. A. Aaron, D. Varodayan, B. Girod, '‘Wyner-Ziv residual coding of video’' The 25th Proc. Picture Coding Symposium (2006) Google Scholar
- 3. A. Aaron, R. Zhang, B. Girod, '‘Wyner-Ziv coding of motion video’' (2002) Google Scholar
- 4. A.J. Ahumada, H. Peterson, '‘Luminance-model-based DCT quantization for color image compression’' Proc. of Human Vision, Visual Processing, and Digital Display III (1992) Google Scholar
- 5. X. Artigas, E. Angeli, L. Torres, '‘Side information generation for multiview distributed video coding using a fusion approach’' (2007) Google Scholar
- 6. X. Artigas, J. Ascenso, M. Dalai, S. Klomp, D. Kubasov, M. Ouaret, '‘The DISCOVER codec: architecture, techniques and evaluation’' (2007) Google Scholar
- 7. M.B. Badem, H. Kodikara Arachchi, S.T. Worrall, A.M. Kondoz, '‘Transform domain residual coding technique for distributed video coding’' Proc. of Intl.Picture Coding Symposium (2007) Google Scholar
- 8. C. Brites, F. Pereira, '‘Encoder rate control for transform domain Wyner-Ziv video coding’' (2007) Google Scholar
- 9. X. Cao, Y. Liu, Q. Dai, '‘A flexible client-driven 3DTV system for real-time acquisition, transmission, and display of dynamic scenes’' Eurasip Journal on Advances in Signal Processing (2009) Google Scholar
- 10. L.F. Ding, P.K. Tsung, S.Y. Chien, W.Y. Chen, L.G. Chen, '‘Content-aware prediction algorithm with inter-view mode decision for multiview video coding’' IEEE Transactions on Multimedia (2008) Google Scholar
- 11. B. Girod, A.M. Aaron, S. Rane, D. Rebollo-Monedero, '‘Distributed video coding’' Proceedings of theIEEE (Special Issue on Video Coding and Delivery) (2005) Google Scholar
- 12. C. Guillemot, F. Pereira, L. Torres, T. Ebrahimi, R. Leonardi, J. Ostermann, '‘Distributed monoview and multiview video coding’' IEEE Signal Processing Magazine (2007) Google Scholar
- 13. X. Guo, Y. Lu, F. Wu, D. Zhao, W. Gao, '‘Wyner-Ziv-based multiview video coding’' IEEE Trans. Circuits and Systems for Video Technology (2008) Google Scholar
- 14. X. Guo, Y. Lu, F. Wu, W. Gao, S. Li, '‘Distributed multi-view video coding’' Proceedings of SPIE – The International Society for Optical Engineering (2006) Google Scholar
- 15. D. Kubasov, K. Lajnef, C. Guillemot, '‘A hybrid encoder/decoder rate control for Wyner-Ziv video coding with a feedback channel’' (2007) Google Scholar
- 16. M.C. Lee, R.K.W. Chan, D.A. Adjeroh, '‘Quantization of 3D-DCT coefficients and scan order for video compression’' Elsevier Journal of Visual Communication and Image Representation (1997) Google Scholar
- 17. X. Li, D. Zhao, S. Ma, W. Gao, '‘Fast disparity and motion estimation based on correlations for multiview video coding’' IEEE Transactions on Consumer Electronics (2008) Google Scholar
- 18. Y. Li, X. Ji, D. Zhao, W. Gao, '‘Region-based fusion strategy for side information generation in DMVC’' Proceedings of SPIE – The International Society for Optical Engineering (2008) Google Scholar
- 19. P. Merkle, K. Muller, '‘Efficient prediction structures for multiview video coding’' IEEE Transactions on Circuits and Systems for Video Technology (2007) Google Scholar
- 20. M. Morbee, L. Tessens, J. Prades-Nebot, A. Pizurica, W. Philips, '‘A distributed coding-based extension of a mono-view to a multi-view video system' Proc. 3DTV Conference (2007a) Google Scholar
- 21. M. Morbee, L. Tessens, H. Quang Luong, J. Prades-Nebot, A. Pizurical, W. Philips, '‘A distributed coding-based content-aware multi-view video system’' (2007b) Google Scholar
- 22. M. Morbee, J. Prades-Nebot, A. Pizurica, W. Philips, '‘Rate allocation algorithm for pixel-domain distributed video coding without feedback channel’' (2007c) Google Scholar
- 23. M. Morbee, J. Prades-Nebot, A. Roca, A. Pizurica, W. Philips, '‘Improved, pixel-based rate allocation for pixel-domain distributed video coders without feedback channel’' Lecture Notes in Computer Science (2007d) Google Scholar
- 24. K. Muller, A. Smolic, K. Dix, P. Merkle, P. Kauff, T. Wiegand, '‘View synthesis for advanced 3D video systems’' Eurasip Journal on Image and Video Processing (2008) Google Scholar
- 25. F. Pereira, C. Brites, J. Ascenso, '‘Distributed video coding: basics, codecs, and performance’' Distributed Source Coding (2009) Google Scholar
- 26. A. Roca, M. Morbee, J. Prades-Nebot, E.J. Delp, '‘Rate control algorithm for, pixel-domain Wyner-Ziv video coding’' Proceedings of SPIE – The International Society for Optical Engineering (2008) Google Scholar
- 27. A. Smolic, K. Mueller, N. Stefanoski, J. Osteraiann, A. Gotchev, G.B. Akar, G. Triantafyllidis, A. Koz, '‘Coding algorithms for 3DTV – a survey’' IEEE Trans. Circuits and Systems for Video Technology (2007) Google Scholar
- 28. R.S. Wang, Y. Wang, '‘Multiview video sequence analysis, compression, and virtual viewpoint synthesis’' IEEE Transactions on Circuits and Systems for Video Technology (2000) Google Scholar
- 29. A.B. Watson, '‘DCT quantization matrices visually optimized for individual images’' Proceedings of SPIE – The International Society for Optical Engineering (1993) Google Scholar
- 30. S. Yea, A. Vetro, '‘View synthesis prediction for multiview video coding’' Signal Processing: Image Communication (2009) Google Scholar