Multimedia event detection with multimodal feature fusion and temporal concept localization

作者： Sangmin Oh , Scott McCloskey , Ilseo Kim , Arash Vahdat , Kevin J. Cannons

关键词:

摘要: … of multimedia event detection (MED), where the goal is detecting or classifying video clips by the main event occurring in the clip. In particular, we focus on high-level events such as ‘…

参考文章(57)

Jingchen Liu, Scott McCloskey, Yanxi Liu, Local Expert Forest of Score Fusion for Video Event Classification Computer Vision – ECCV 2012. pp. 397- 410 ,(2012) , 10.1007/978-3-642-33715-4_29

Ilseo Kim, Sangmin Oh, Byungki Byun, A. G. Amitha Perera, Chin-Hui Lee, Explicit performance metric optimization for fusion-based video retrieval international conference on computer vision. pp. 395- 405 ,(2012) , 10.1007/978-3-642-33885-4_40

Aude Oliva, Antonio Torralba, Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision. ,vol. 42, pp. 145- 175 ,(2001) , 10.1023/A:1011139631724

Mirella Lapata, Yansong Feng, Topic Models for Image Annotation and Text Illustration north american chapter of the association for computational linguistics. pp. 831- 839 ,(2010)

Ameesh Makadia, Vladimir Pavlovic, Sanjiv Kumar, A New Baseline for Image Annotation Lecture Notes in Computer Science. pp. 316- 329 ,(2008) , 10.1007/978-3-540-88690-7_24

Chin-Hui Lee, F.K. Soong, Bing-Hwang Juang, A segment model based approach to speech recognition international conference on acoustics speech and signal processing. pp. 501- 541 ,(1988) , 10.1109/ICASSP.1988.196629

Jia Deng, Alexander C. Berg, Kai Li, Li Fei-Fei, What does classifying more than 10,000 image categories tell us? european conference on computer vision. pp. 71- 84 ,(2010) , 10.1007/978-3-642-15555-0_6

Lu Jiang, Alexander G. Hauptmann, Guang Xiang, Leveraging high-level and low-level features for multimedia event detection Proceedings of the 20th ACM international conference on Multimedia - MM '12. pp. 449- 458 ,(2012) , 10.1145/2393347.2393412

Yu Tsao, Hanwu Sun, Haizhou Li, Chin-Hui Lee, An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition international conference on acoustics, speech, and signal processing. pp. 4422- 4425 ,(2010) , 10.1109/ICASSP.2010.5495617

10.

Quoc V. Le, Will Y. Zou, Serena Y. Yeung, Andrew Y. Ng, Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis CVPR 2011. pp. 3361- 3368 ,(2011) , 10.1109/CVPR.2011.5995496

Multimedia event detection with multimodal feature fusion and temporal concept localization

来源期刊

我的账户

Multimedia event detection with multimodal feature fusion and temporal concept localization

来源期刊

相似文章 10

我的账户