Videos Semantic Indexing using Image Classification.

作者： Fengjun Lv , Shenghuo Zhu , Mert Dikmen , Yuanqing Lin , Thomas S. Huang

DOI:

关键词: Computer science 、 TRECVID 、 Software 、 Support vector machine 、 Search engine indexing 、 Artificial intelligence 、 Pattern recognition 、 NIST 、 Learning to rank 、 Contextual image classification 、 Coding (social sciences)

摘要: This notebook paper summarizes Team NEC-UIUC’s approaches for TRECVid 2010 Evaluation of Semantic Indexing. Our submissions mainly take advantage advanced image classification methods using linear coordinate coding (LCC) local features powered by the distributed computing software Hadoop. For every video shot, we evenly sample key frames and extract dense including DHOG LBP, which are encoded coding. Then, concept large-scale SVM classifiers trained based on spatial pyramid LCC features. Finally, employ multiple instance learning to rank shots according scores individual frames. systems achieve mean extended inferred average precision (mean xinfAP) 7.40% 30 concepts evaluated NIST 28.63% 1/5 development data as validation set total 130 concepts.

illinois.edu 本地加速

uni-trier.de 本地加速

nist.gov PDF 下载加速

参考文章(8)

Hsiang-Fu Yu, Cho-Jui Hsieh, Kai-Wei Chang, Chih-Jen Lin, Large linear classification when data cannot fit in memory knowledge discovery and data mining. pp. 833- 842 ,(2010) , 10.1145/1835804.1835910

B. T. Polyak, A. B. Juditsky, Acceleration of stochastic approximation by averaging Siam Journal on Control and Optimization. ,vol. 30, pp. 838- 855 ,(1992) , 10.1137/0330046

Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Chih-Jen Lin, Xiang-Rui Wang, LIBLINEAR: A Library for Large Linear Classification Journal of Machine Learning Research. ,vol. 9, pp. 1871- 1874 ,(2008)

Kai Yu, Yihong Gong, Tong Zhang, Nonlinear Learning using Local Coordinate Coding neural information processing systems. ,vol. 22, pp. 2223- 2231 ,(2009)

David G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints International Journal of Computer Vision. ,vol. 60, pp. 91- 110 ,(2004) , 10.1023/B:VISI.0000029664.99615.94

S. Lazebnik, C. Schmid, J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories computer vision and pattern recognition. ,vol. 2, pp. 2169- 2178 ,(2006) , 10.1109/CVPR.2006.68

T. Ojala, M. Pietikainen, T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 24, pp. 971- 987 ,(2002) , 10.1109/TPAMI.2002.1017623

Cha Zhang, John C. Platt, Paul A. Viola, Multiple Instance Boosting for Object Detection neural information processing systems. ,vol. 18, pp. 1417- 1424 ,(2005)

Videos Semantic Indexing using Image Classification.

来源期刊

我的账户

Videos Semantic Indexing using Image Classification.

来源期刊

相似文章 3

Architecture and protocol of a semantic system designed for video tagging with sensor data in mobile devices.

A compact shot representation for video semantic indexing

A frame-based decision pooling method for video classification

我的账户