Long-term global motion estimation and its application for sprite coding, content description, and segmentation

作者: A. Smolic , T. Sikora , J.-R. Ohm

DOI: 10.1109/76.809158

关键词: Structure from motionMathematicsComputer visionAffine transformationMotion compensationBlock-matching algorithmMotion fieldOptical flowMotion estimationArtificial intelligenceQuarter-pixel motion

摘要: We present a new technique for long-term global motion estimation of image objects. The estimated parameters describe the continuous and time-consistent over whole sequence relatively to fixed reference coordinate system. proposed method is suitable affine as well higher order models like parabolic model-combining advantages feature matching optical flow techniques. A hierarchical strategy applied estimation, first translation, motion, finally parameters, which robust computationally efficient. closed-loop prediction scheme avoid problem error accumulation in estimation. presented results indicate that very accurate approach can be used applications such MPEG-4 sprite coding or MPEG-7 description. also show efficiency significantly increased if model applied, we on-line applications. further demonstrate estimator serves powerful tool segmentation video sequences.

参考文章(11)
A. Azarbayejani, T. Starner, B. Horowitz, A. Pentland, Visually controlled graphics IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 15, pp. 602- 605 ,(1993) , 10.1109/34.216730
A.A. Alatan, L. Onural, M. Wollborn, R. Mech, E. Tuncel, T. Sikora, Image sequence analysis for emerging interactive multimedia services-the European COST 211 framework IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 8, pp. 802- 813 ,(1998) , 10.1109/76.735378
P. Kauff, B. Makai, S. Rauthenberg, U. Golz, J.L.P. De Lameillieure, T. Sikora, Functional coding of video using a shape-adaptive DCT algorithm and an object-based motion prediction toolbox IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 7, pp. 181- 196 ,(1997) , 10.1109/76.554429
Haibo Li, R. Forchheimer, Two-view facial movement estimation IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 4, pp. 276- 287 ,(1994) , 10.1109/76.305872
H. Li, P. Roivainen, R. Forchheimer, 3-D motion estimation in model-based facial image coding IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 15, pp. 545- 555 ,(1993) , 10.1109/34.216724
Chang Seek Choi, K. Aizawa, H. Harashima, T. Takebe, Analysis and synthesis of facial image sequences in model-based image coding IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 4, pp. 257- 275 ,(1994) , 10.1109/76.305871
T. Sikora, The MPEG-4 video standard verification model IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 7, pp. 19- 31 ,(1997) , 10.1109/76.554415
A. Smolic, B. Makai, T. Sikora, Real-time estimation of long-term 3-D motion parameters for SNHC face animation and model-based coding applications IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 9, pp. 255- 263 ,(1999) , 10.1109/76.752093
G. Bozdagi, A.M. Tekalp, L. Onural, 3-D motion estimation and wireframe adaptation including photometric effects for model-based coding of facial image sequences IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 4, pp. 246- 256 ,(1994) , 10.1109/76.305870
Saul A. Teukolsky, Brian P. Flannery, William T. Vetterling, William H. Press, Numerical recipes in C Cambridge University Press. ,(1994)