Videos Semantic Indexing using Image Classification.

作者: Fengjun Lv , Shenghuo Zhu , Mert Dikmen , Yuanqing Lin , Thomas S. Huang

DOI:

关键词: Computer scienceTRECVIDSoftwareSupport vector machineSearch engine indexingArtificial intelligencePattern recognitionNISTLearning to rankContextual image classificationCoding (social sciences)

摘要: This notebook paper summarizes Team NEC-UIUC’s approaches for TRECVid 2010 Evaluation of Semantic Indexing. Our submissions mainly take advantage advanced image classification methods using linear coordinate coding (LCC) local features powered by the distributed computing software Hadoop. For every video shot, we evenly sample key frames and extract dense including DHOG LBP, which are encoded coding. Then, concept large-scale SVM classifiers trained based on spatial pyramid LCC features. Finally, employ multiple instance learning to rank shots according scores individual frames. systems achieve mean extended inferred average precision (mean xinfAP) 7.40% 30 concepts evaluated NIST 28.63% 1/5 development data as validation set total 130 concepts.

参考文章(8)
Hsiang-Fu Yu, Cho-Jui Hsieh, Kai-Wei Chang, Chih-Jen Lin, Large linear classification when data cannot fit in memory knowledge discovery and data mining. pp. 833- 842 ,(2010) , 10.1145/1835804.1835910
B. T. Polyak, A. B. Juditsky, Acceleration of stochastic approximation by averaging Siam Journal on Control and Optimization. ,vol. 30, pp. 838- 855 ,(1992) , 10.1137/0330046
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Chih-Jen Lin, Xiang-Rui Wang, LIBLINEAR: A Library for Large Linear Classification Journal of Machine Learning Research. ,vol. 9, pp. 1871- 1874 ,(2008)
Kai Yu, Yihong Gong, Tong Zhang, Nonlinear Learning using Local Coordinate Coding neural information processing systems. ,vol. 22, pp. 2223- 2231 ,(2009)
David G. Lowe, Distinctive Image Features from Scale-Invariant Keypoints International Journal of Computer Vision. ,vol. 60, pp. 91- 110 ,(2004) , 10.1023/B:VISI.0000029664.99615.94
S. Lazebnik, C. Schmid, J. Ponce, Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories computer vision and pattern recognition. ,vol. 2, pp. 2169- 2178 ,(2006) , 10.1109/CVPR.2006.68
T. Ojala, M. Pietikainen, T. Maenpaa, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 24, pp. 971- 987 ,(2002) , 10.1109/TPAMI.2002.1017623
Cha Zhang, John C. Platt, Paul A. Viola, Multiple Instance Boosting for Object Detection neural information processing systems. ,vol. 18, pp. 1417- 1424 ,(2005)