Learning, detection and representation of multi-agent events in videos

作者: Asaad Hakeem , Mubarak Shah

DOI: 10.1016/J.ARTINT.2007.04.002

关键词:

摘要: In this paper, we model multi-agent events in terms of a temporally varying sequence sub-events, and propose novel approach for learning, detecting representing videos. The proposed has three main steps. First, order to learn the event structure from training videos, automatically encode sub-event dependency graph, which is learnt that depicts conditional between sub-events. Second, pose problem detection videos as clustering maximally correlated sub-events using normalized cuts. principal assumption made work are composed highly chain have high weights (association) within cluster relatively low (disassociation) clusters. does not require prior knowledge number agents involved an make any assumptions about length event. Third, recognize fact abstract should extend representations related human understanding events. Therefore, extension CASE representation natural languages allows plausible means interface users computer. We show results detection, meeting, surveillance, railroad monitoring domains.

参考文章(55)
Gu Xu, Yu-Fei Ma, Hong-Jiang Zhang, Shiqiang Yang, A HMM based semantic analysis framework for sports game event detection international conference on image processing. ,vol. 1, pp. 25- 28 ,(2003) , 10.1109/ICIP.2003.1246889
Yaser Sheikh, Mubarak Shah, Asaad Hakeem, CASE E : a hierarchical event representation for the analysis of videos national conference on artificial intelligence. pp. 263- 268 ,(2004)
Mubarak Shah, Asaad Hakeem, Multiple agent event detection and representation in videos national conference on artificial intelligence. pp. 89- 94 ,(2005)
Stephen S. Intille, Aaron F. Bobick, A framework for recognizing multi-agent action from visual evidence national conference on artificial intelligence. pp. 518- 525 ,(1999)
N. Babaguchi, R. Jain, Event detection from continuous media international conference on pattern recognition. ,vol. 2, pp. 1209- 1212 ,(1998) , 10.1109/ICPR.1998.711915
Omar Javed, Mubarak Shah, Tracking and Object Classification for Automated Surveillance european conference on computer vision. pp. 343- 357 ,(2002) , 10.1007/3-540-47979-1_23
Emmon Bach, Robert T. Harms, Universals in Linguistic Theory ,(1970)
Nir Friedman, Stuart Russell, Kevin Murphy, Learning the structure of dynamic probabilistic networks uncertainty in artificial intelligence. pp. 139- 147 ,(1998)
D. Koller, N. Heinze, H.H. Nagel, Algorithmic characterization of vehicle trajectories from image sequences by motion verbs computer vision and pattern recognition. pp. 90- 95 ,(1991) , 10.1109/CVPR.1991.139667
Fengjun Lv, Ramakant Nevatia, Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost Computer Vision – ECCV 2006. pp. 359- 372 ,(2006) , 10.1007/11744085_28