A subspace approach to layer extraction, patch-based SFM, and video compression

作者: Qifa Ke , Takeo Kanade

DOI: 10.1184/R1/6591431.V1

关键词:

摘要: Abstract: "Representing videos with layers has important applications such as video compression, motion analysis, 3D modeling and rendering. This thesis proposes a subspace approach to extracting from by taking advantages of the fact that homographies induced planar patches in scene form low dimensional linear subspace. In subspace, input images are mapped onto well-defined clusters, can be reliably identified standard clustering algorithm (e.g., mean-shift). Global optimality is achieved since both spatial temporal redundancy simultaneously taken into account, noise effectively reduced enforcing constraint. The existence also enables outlier detection, making computation robust. Based on constraint, we propose patch-based scheme for affine structure (SFM), which recovers plane equation each patch scene, well camera epipolar geometry. We two approaches SFM: (1) factorization approach; (2) layer based approach. Patch-based SFM provides compact representation used construct high quality texture map layer. plan apply our generating Video Object Planes (VOPs) defined MPEG-4 standard. VOP generation critical but unspecified step Our model consists global localized deformations, closed-form solution. goals are: combining different level cues VOPs; VOPs undergo more complicated (non-planar or non-rigid)."

参考文章(11)
Joseph L. Mundy, Andrew Zisserman, Geometric invariance in computer vision MIT Press. ,(1992)
James R. Bergen, P. Anandan, Keith J. Hanna, Rajesh Hingorani, Hierarchical model-based motion estimation Computer Vision — ECCV'92. pp. 237- 252 ,(1992) , 10.1007/3-540-55426-2_27
R. Gnanadesikan, J. R. Kettenring, ROBUST ESTIMATES, RESIDUALS, AND OUTLIER DETECTION WITH MULTIRESPONSE DATA Biometrics. ,vol. 28, pp. 81- ,(1972) , 10.2307/2528963
Chris Harris, Structure-from-motion under orthographic projection Image and Vision Computing. ,vol. 9, pp. 329- 332 ,(1991) , 10.1016/0262-8856(91)90037-P
George H. Dunteman, Principal Components Analysis ,(1989)
Martin A. Fischler, Robert C. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography Communications of The ACM. ,vol. 24, pp. 381- 395 ,(1981) , 10.1145/358669.358692
A. Jepson, M.J. Black, Mixture models for optical flow computation computer vision and pattern recognition. pp. 760- 761 ,(1993) , 10.1109/CVPR.1993.341161
J.Y.A. Wang, E.H. Adelson, Layered representation for motion analysis computer vision and pattern recognition. pp. 361- 366 ,(1993) , 10.1109/CVPR.1993.341105
Carlo Tomasi, Takeo Kanade, Shape and motion from image streams under orthography: a factorization method International Journal of Computer Vision. ,vol. 9, pp. 137- 154 ,(1992) , 10.1007/BF00129684
J.Y.A. Wang, E.H. Adelson, Representing moving images with layers IEEE Transactions on Image Processing. ,vol. 3, pp. 625- 638 ,(1994) , 10.1109/83.334981