Detection of activities and events without explicit categorization

作者: Masakazu Matsugu , Masao Yamanaka , Masashi Sugiyama

DOI: 10.1109/ICCVW.2011.6130432

关键词:

摘要: We address the problem of unsupervised detection events (e.g., changes or meaningful states human activities) without any similarity test against specific models probability density estimation category learning). Rather than estimating densities, very difficult to calculate in general settings, we formulate event as binary classification with ratio [9] a hierarchical probabilistic framework. The proposed method takes pairs video stream data (i.e., past and current) input differing time-scales, generates way online learning, judges if there is ‘meaningful difference’ between them based on multiple estimations. Through experimental studies real-world scenes domains using challenging datasets from sports scene tennis match) complex background, demonstrate potential advantage our approach over state-of-the-art terms precision efficiency.

参考文章(27)
Gu Xu, Yu-Fei Ma, Hong-Jiang Zhang, Shiqiang Yang, A HMM based semantic analysis framework for sports game event detection international conference on image processing. ,vol. 1, pp. 25- 28 ,(2003) , 10.1109/ICIP.2003.1246889
Lamberto Ballan, Marco Bertini, Alberto Del Bimbo, Lorenzo Seidenari, Giuseppe Serra, Effective Codebooks for human action categorization international conference on computer vision. pp. 506- 513 ,(2009) , 10.1109/ICCVW.2009.5457658
Xi Zhou, Xiaodan Zhuang, Shuicheng Yan, Shih-Fu Chang, Mark Hasegawa-Johnson, Thomas S. Huang, SIFT-Bag kernel for video event analysis Proceeding of the 16th ACM international conference on Multimedia - MM '08. pp. 229- 238 ,(2008) , 10.1145/1459359.1459391
Ivan Laptev, On Space-Time Interest Points international conference on computer vision. ,vol. 64, pp. 107- 123 ,(2005) , 10.1007/S11263-005-1838-7
Wei Jiang, Courtenay Cotton, Shih-Fu Chang, Dan Ellis, Alexander Loui, Short-term audio-visual atoms for generic video concept classification acm multimedia. pp. 5- 14 ,(2009) , 10.1145/1631272.1631277
Lamberto Ballan, Marco Bertini, Alberto Del Bimbo, Giuseppe Serra, Video event classification using string kernels Multimedia Tools and Applications. ,vol. 48, pp. 69- 87 ,(2010) , 10.1007/S11042-009-0351-3
Changhu Wang, Lei Zhang, Hong-Jiang Zhang, Learning to reduce the semantic gap in web image retrieval and annotation Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '08. pp. 355- 362 ,(2008) , 10.1145/1390334.1390396
Bangpeng Yao, Li Fei-Fei, Grouplet: A structured image representation for recognizing human and object interactions computer vision and pattern recognition. pp. 9- 16 ,(2010) , 10.1109/CVPR.2010.5540234
Lamberto Ballan, Marco Bertini, Alberto Del Bimbo, Lorenzo Seidenari, Giuseppe Serra, Event detection and recognition for semantic annotation of video Multimedia Tools and Applications. ,vol. 51, pp. 279- 302 ,(2011) , 10.1007/S11042-010-0643-7
Tetsu Matsukawa, Takio Kurita, Action Recognition Using Three-Way Cross-Correlations Feature of Local Moton Attributes international conference on pattern recognition. pp. 1731- 1734 ,(2010) , 10.1109/ICPR.2010.428