Discriminative Fusion Approach for Automatic Image Annotation

作者: De-hong Wang , Sheng Gao , Qi Tian , Wing-kin Sung

DOI: 10.1109/MMSP.2005.248595

关键词:

摘要: In this paper, two discriminative fusion schemes are proposed for automatic image annotation. One is the ensemble-pattern association based and another model-based transformation. The approaches studied evaluated in a unified framework AIA on text representation of content MC MFoM learning. flexible fusing diverse visual features multiple modalities. learning can automatically weight most important classification. We evaluate Corel TRECVID 2003 datasets. experimental results clearly show that give significant improvement term mean F1 as well number detected concepts

参考文章(11)
Jiwoon Jeon, R. Manmatha, Using Maximum Entropy for Automatic Image Annotation conference on image and video retrieval. pp. 24- 32 ,(2004) , 10.1007/978-3-540-27814-6_7
B.L. Tseng, C.-Y. Lin, M. Naphade, A. Natsev, J.R. Smith, Normalized classifier fusion for semantic visual concept detection international conference on image processing. ,vol. 2, pp. 535- 538 ,(2003) , 10.1109/ICIP.2003.1246735
P. Duygulu, K. Barnard, J. F. G. de Freitas, D. A. Forsyth, Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary european conference on computer vision. ,vol. 2353, pp. 97- 112 ,(2002) , 10.1007/3-540-47979-1_7
David M Blei, Michael I Jordan, None, Modeling annotated data international acm sigir conference on research and development in information retrieval. pp. 127- 134 ,(2003) , 10.1145/860435.860460
Yi Wu, Edward Y. Chang, Kevin Chen-Chuan Chang, John R. Smith, Optimal multimodal fusion for multimedia data analysis acm multimedia. pp. 572- 579 ,(2004) , 10.1145/1027527.1027665
Apostol (Paul) Natsev, Milind R. Naphade, John R. Smith, Semantic representation: search and mining of multimedia content knowledge discovery and data mining. pp. 641- 646 ,(2004) , 10.1145/1014052.1014133
Fabrizio Sebastiani, Machine learning in automated text categorization ACM Computing Surveys. ,vol. 34, pp. 1- 47 ,(2002) , 10.1145/505282.505283
Sheng Gao, Wen Wu, Chin-Hui Lee, Tat-Seng Chua, A MFoM learning approach to robust multiclass multi-label text categorization international conference on machine learning. pp. 42- ,(2004) , 10.1145/1015330.1015361
Lavrenko Manmatha Jeon, V Lavrenko, R Manmatha, J Jeon, A Model for Learning the Semantics of Pictures neural information processing systems. ,vol. 16, pp. 553- 560 ,(2003)
Florent Monay, Daniel Gatica-Perez, On image auto-annotation with latent space models Proceedings of the eleventh ACM international conference on Multimedia - MULTIMEDIA '03. pp. 275- 278 ,(2003) , 10.1145/957013.957070