Copyright © 2008 The Institute of Electronics, Information and Communication Engineers
Regular Section -- Papers -- Image Processing and Video Processing |
Video Encoding Scheme Employing Intra and Inter Prediction Based on Averaged Template Matching Predictors
1 The authors are with NTT DoCoMo, Inc., Yokosuka-shi, 239–8536 Japan. E-mail: suzukiyos{at}nttdocomo.co.jp, 2 The author is with M-Sphere Consulting Pte. Ltd., Singapore 808379.
In video compression, the information transmitted from the encoder to the decoder can be classified into two categories: side information, which carries action instructions to be performed, and data such as the residual error of the texture. As video compression technology has matured, better compression has been achieved by increasing the ratio of side information to data, while reducing the overall bit rate. However, there is a limit to this method because the side information becomes a significant fraction of the overall bit rate. In recent video compression technologies, the decoder tends to share the burden of the decision making in order to achieve a higher compression ratio. To further improve the coding efficiency, we tried to provide the decoder with a more active role in reducing the amount of data. According to this approach, by using reconstructed pixels that surround a target block to produce a better sample predictor of the target block, the amount of side information and the residual error of the texture are reduced. Furthermore, multiple candidates of the sample predictor are utilized to create a better sample predictor without increasing the amount of side information. In this paper, we employ a template matching method that makes the decoder more active. The template matching method is applied to the conventional video codec to improve the prediction performance of intra, inter, and bi-directional pictures in video. The results show that improvements in coding efficiency up to 5.8% are achieved.
Key Words: video coding, image coding, image generation and motion compensation
Manuscript received August 8, 2007. Manuscript revised November 14, 2007.
Reference
[1] ITU-T Rec.H261, "Video codec for audio visual services at px64kbit/s," March 1993. [2] "Text of ISO/IEC 11172-2: 1993, Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s," MPEG-1 Part 2: Video, Aug. 1993. [3] "ITU-T Rec. H.262 (2000) | ISO/IEC 13818-2:2000 (2nd edition), Generic coding of moving pictures and associated audio information," MPEG-2 Part 2:, Dec. 2000. [4] "Text of ISO/IEC 14496-2: 2004 (Third edition), Coding of audio-visual objects," MPEG-4 Part 2: Visual, June 2004. [5] ISO/IEC JTC1 and ITU-T, "H.264/AVC — Advanced video coding for generic audiovisual services," ITU-T Recommendation H.264 and ISO/IEC 14496-10 (MPEG-4 part 10), version 3, 2005. [6] H. Li, A. Lundmark, and R. Forchheimer, "Image sequence coding at very low bitrates: A review," IEEE Trans. Image Process., vol.3, no.5, pp.589–609, Sept. 1994. [7] A.N. Netravali and J.D. Robbins, "Motion-compensated television coding: Part I," Bell Syst. Tech. J., vol.BSTJ-58, no.3, pp.399–412, March 1979. [8] S.N. Efstratiadis and A.K. Katsaggelos, "A model-based pel-recursive motion estimation algorithm," Proc. ICIP 1990, Oct. 1990. [9] B. Girod, A. Aaron, S. Rane, and D. Rebollo-Monedero, "Distributed video coding," Proc. IEEE, Special Issue on Video Coding and Delivery, vol.93, no.1, pp.71–83, Jan. 2005. [10] A.D. Wyner and J. Ziv, "The rate-distortion function for source coding with side information at the decoder," IEEE Trans. Inf. Theory, vol.IT-22, no.1, pp.1–10, Jan. 1976. [11] S. Pradhan, J. Chen, and G.W. Wornell, "The duality between information embedding and source coding with side information and some applications," IEEE Trans. Inf. Theory, vol.49, no.5, pp.1181–1203, May 2003. [12] R. Zamir, "The rate loss in the Wyner-Ziv problem," IEEE Trans. Inf. Theory, vol.42, no.6, pp.2073–2084, Nov. 1996. [13] K. Sugimoto, M. Kobayashi, Y. Suzuki, S. Kato, and C.S. Boon, "Inter frame coding with template matching spatio-temporal prediction," Proc. ICIP 2004, pp.465–468, Singapore, Oct. 2004. [14] M. Kobayashi, Y. Suzuki, C.S. Boon, and T. Horikoshi, "Reduction of information with motion prediction using template matching," Proc. 20th Picture Coding Symposium of Japan, pp.17–18, Nov. 2005. [15] Y. Suzuki, C.S. Boon, and S. Kato, "Block-based reduced resolution inter frame coding with template matching prediction," Proc. ICIP 2006, Atlanta, GA, USA, Oct. 2006. [16] T.K. Tan, C.S. Boon, and Y. Suzuki, "Intra prediction by template matching," Proc. ICIP 2006, Atlanta, GA, USA, Oct. 2006. [17] L. Wei and M. Levoy, "Fast texture synthesis using tree-structures vector quantization," Proc. SIG-GRAPH 2000, pp.479–488, July 2000. [18] M. Ashikhmin, "Synthesizing natural textures," 2001 ACM Symposium on Interactive 3D Graphics, pp.217–226, March 2001. [19] M. Flierl, T. Wiegand, and B. Givod, "Rate-constrained multihypothesis prediction for motion-compensated video compression," IEEE Trans. Circuits Syst. Video Technol., vol.12, no.11, pp.957–969, Nov. 2002. [20] Y. Suzuki and C.S. Boon, "Reduction of coding bits for residuals by smoothing of prediction signal in block-based inter frame coding," Proc. 21st Picture Coding Symposium of Japan, Nov. 2006. [21] Y. Suzuki, C.S. Boon, and T.K. Tan, "Inter frame coding with template matching averaging," Proc. ICIP 2007, San Antonio, TX, USA, Sept. 2007. [22] T.K. Tan, C.S. Boon, and Y. Suzuki, "Intra prediction by averaged template matching predictors," Proc. CCNC 2007, Las Vegas, NV, USA, Jan. 2007. [23] G. Bjontegaard, "Coding improvement by using 4x4 blocks for motion vectors and transform," ITU-T/Study Group 16/Video Coding Experts Group (Question 15), Eibsee, Bavaria, Germany, document Q15-C-23, Dec. 1997. [24] G. Bjontegaard, "Response to call for proposals for H.26L," ITU-T/Study Group 16/Video Coding Experts Group (Question 15), Seoul, Korea, document Q15-F-11, Nov. 1998. [25] Nokia Research Center, "MVC video codec — proposal for H.26L," ITU-T/Study Group 16/Video Coding Experts Group (Question 15), Seoul, Korea, document Q15-F-24, Nov. 1998. [26] G. Conklin, "More results on new intra prediction modes," Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), Geneva, CH, document JVT-B080, Jan.-Feb.2002. [27] G.J. Sullivan and T. Wiegand, "Rate-distortion optimization for video compression," IEEE Signal Process. Mag., vol.15, no.6, pp.74–90, Nov. 1998. [28] T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G.J. Sullivan, "Rate-constrained coder control and comparison of video coding standards," IEEE Trans. Circuits Syst. Video Technol., vol.13, no.7, pp.688–703, July 2003. [29] G. Bjontegaard, "Calculation of average PSNR differences between RD curves," ITU-T SC16/Q6, 13th VCEG Meeting, Austin, Texas, USA, document VCEG-M33, April 2001.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This Article ![]()
![]()
Abstract
![]()
Full Text (PDF)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Request Permissions
![]()
Google Scholar ![]()
![]()
Articles by SUZUKI, Y.
![]()
Articles by TAN, T. K.
![]()
Social Bookmarking ![]()
![]()
What's this?