Actions as Space-Time Shapes

作者: L. Gorelick , M. Blank , E. Shechtman , M. Irani , R. Basri

DOI: 10.1109/TPAMI.2007.70711

关键词: Computer visionSilhouetteFeature extractionSpace timeTorsoArtificial intelligenceMathematicsCognitive neuroscience of visual object recognitionPoisson's equationCluster analysisShape analysis (digital geometry)

摘要: Human action in video sequences can be seen as silhouettes of a moving torso and protruding limbs undergoing articulated motion. We regard human actions three-dimensional shapes induced by the space-time volume. adopt recent approach [14] for analyzing 2D generalize it to deal with volumetric shapes. Our method utilizes properties solution Poisson equation extract features such local saliency, dynamics, shape structure, orientation. show that these are useful recognition, detection, clustering. The is fast, does not require alignment, applicable (but limited to) many scenarios where background known. Moreover, we demonstrate robustness our partial occlusions, nonrigid deformations, significant changes scale viewpoint, high irregularities performance an action, low-quality video.

参考文章(32)
Niyogi, Adelson, Analyzing and recognizing walking figures in XYT computer vision and pattern recognition. pp. 469- 474 ,(1994) , 10.1109/CVPR.1994.323868
Yan Ke, R. Sukthankar, M. Hebert, Efficient visual event detection using volumetric features international conference on computer vision. ,vol. 1, pp. 166- 173 ,(2005) , 10.1109/ICCV.2005.85
A.F. Bobick, J.W. Davis, The recognition of human movement using temporal templates IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 23, pp. 257- 267 ,(2001) , 10.1109/34.910878
Andrew Ng, Michael Jordan, Yair Weiss, None, On Spectral Clustering: Analysis and an algorithm neural information processing systems. ,vol. 14, pp. 849- 856 ,(2001)
Olivier Chomat, JéerΩe Martin, James L. Crowley, A Probabilistic Sensor for the Perception and the Recognition of Activities Computer Vision - ECCV 2000. pp. 487- 503 ,(2000) , 10.1007/3-540-45054-8_32
Efros, Berg, Mori, Malik, Recognizing action at a distance international conference on computer vision. pp. 726- 733 ,(2003) , 10.1109/ICCV.2003.1238420
H. Blum, A transformation for extracting new descriptors of shape Models for the preception of speech and visual form. ,(1967)
Stefan Carlsson, Josephine Sullivan, Action Recognition by Shape Matching to Key Frames ,(2002)
Ramprasad Polana, Randal C. Nelson, Detection and Recognition of Periodic, Nonrigid Motion International Journal of Computer Vision. ,vol. 23, pp. 261- 282 ,(1997) , 10.1023/A:1007975200487
Thomas B. Sebastian, Philip N. Klein, Benjamin B. Kimia, Shock-Based Indexing into Large Shape Databases european conference on computer vision. pp. 731- 746 ,(2002) , 10.1007/3-540-47977-5_48