Concept detection and keyframe extraction using a visual thesaurus

作者: Evaggelos Spyrou , Giorgos Tolias , Phivos Mylonas , Yannis Avrithis

DOI: 10.1007/S11042-008-0237-9

关键词: Artificial intelligenceRepresentation (mathematics)Cluster analysisThesaurus (information retrieval)Computer visionBasis (linear algebra)Selection (linguistics)Latent semantic analysisFrame (networking)SequenceComputer science

摘要: This paper presents a video analysis approach based on concept detection and keyframe extraction employing visual thesaurus representation. Color texture descriptors are extracted from coarse regions of each frame is constructed after clustering regions. The clusters, called region types, used as basis for representing local material information through the construction model vector frame, which reflects composition image in terms types. Model representation selection either shot or across an entire sequence. process ensures that all types represented. A number high-level detectors then trained using global annotation Latent Semantic Analysis applied. To enhance performance per shot, employed selected keyframes framework proposed working very large data sets.

参考文章(42)
Hari Sundaram, Shih-Fu Chang, Video Analysis and Summarization at Structural and Semantic Levels Springer Berlin Heidelberg. pp. 75- 94 ,(2003) , 10.1007/978-3-662-05300-3_4
Bertrand Le Saux, Giuseppe Amato, IMAGE CLASSIFIERS FOR SCENE ANALYSIS ICCVG. pp. 39- 44 ,(2006) , 10.1007/1-4020-4179-9_7
Ronald R. Yager, Henri Prade, Didier Dubois, Fuzzy information engineering: a guided tour of applications John Wiley & Sons, Inc.. ,(1997)
Svetlana Lazebnik, Cordelia Schmid, Jean Ponce, A Discriminative Framework for Texture and Object Recognition Using Local Image Features Toward Category-Level Object Recognition. ,vol. 4170, pp. 423- 442 ,(2006) , 10.1007/11957959_22
Javier Molina, Evaggelos Spyrou, Natasa Sofou, José M. Martínez, On the Selection of MPEG-7 Visual Descriptors and Their Level of Detail for Nature Disaster Video Sequences Classification Semantic Multimedia. pp. 70- 73 ,(2007) , 10.1007/978-3-540-77051-0_6
Yannis Avrithis, Hervé Le Borgne, Evaggelos Spyrou, Noel O'Connor, Eddie Cooke, Theofilos Mailis, Fusing MPEG-7 visual descriptors for image classification international conference on artificial neural networks. pp. 847- 852 ,(2005) , 10.1007/11550907_134
G. Csurka, Visual categorization with bags of keypoints european conference on computer vision. ,vol. 1, pp. 22- ,(2004)