Probabilistic optimized ranking for multimedia semantic concept detection via RVM

作者: Yan-Tao Zheng , Shi-Yong Neo , Tat-Seng Chua , Qi Tian

DOI: 10.1145/1386352.1386378

关键词:

摘要: We present a probabilistic ranking-driven classifier for the detection of video semantic concept, such as airplane, building, etc. Most existing concept systems utilize Support Vector Machines (SVM) to perform and ranking retrieved shots. However, margin maximization principle SVM does not optimization but merely classification error minimization. To tackle this problem, we exploit sparse Bayesian kernel model, namely relevance vector machine (RVM), detection. Based on automatic determination principle, RVM outputs posterior prediction concepts. This inference output is optimal target shots, according Probabilistic Ranking Principle. The probability individual uni-modal features also facilitates fusion multi-modal evidences minimize Bayes risk. demonstrate both theoretically empirically that outperforms testings TRECVID 07 dataset show produces statically significant improvements in MAP scores over SVM-based methods.

参考文章(32)
Xiao-Yong Wei, Hung-Khoon Tan, Wanlei Zhao, Yu-Gang Jiang, Chong-Wah Ngo, Xiao Wu, Feng Wang, Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and search. TRECVID. ,(2007)
Tao Mei, Linjun Yang, Xun Yuan, Jinhui Tang, Wei Lai, Xian-Sheng Hua, Zhiwei Gu, Guo-Jun Qi, Meng Wang, Zheng Lu, Jingjing Liu, Yuan Liu, Zheng-Jun Zha, MSRA-USTC-SJTU AT TRECVID 2007: HIGH-LEVEL FEATURE EXTRACTION AND SEARCH TRECVID. ,(2007)
Ling Zhang, Bo Zhang, Relationship between support vector set and kernel functions in SVM Journal of Computer Science and Technology. ,vol. 17, pp. 549- 555 ,(2002) , 10.1007/BF02948823
Geoffrey Hinton, Radford M. Neal, Bayesian learning for neural networks ,(1995)
Michael G. Christel, Rong Yan, Alexander G. Hauptmann, Jie Yang, Wei-Hao Lin, D. Das, Ming-yu Chen, Xiao Wu, Gerhard Backfried, Multi-Lingual Broadcast News Retrieval TRECVID. ,(2007)
C. Dorai, S. Venkatesh, Bridging the semantic gap with computational media aesthetics IEEE MultiMedia. ,vol. 10, pp. 15- 17 ,(2003) , 10.1109/MMUL.2003.1195157
Cees G. M. Snoek, Marcel Worring, Arnold W. M. Smeulders, Early versus late fusion in semantic video analysis Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA '05. pp. 399- 402 ,(2005) , 10.1145/1101149.1101236
Gordon V. Cormack, Thomas R. Lynam, Validity and power of t-test for comparing MAP and GMAP international acm sigir conference on research and development in information retrieval. pp. 753- 754 ,(2007) , 10.1145/1277741.1277892