Bag-of-visual words based automatic image annotation

作者: Biniyam Kebede , Fekade Getahun

DOI: 10.1109/AFRCON.2015.7331994

关键词:

摘要: Content-based Image retrieval systems extract and retrieve images using their low-level features, such as color, texture, shape. Nevertheless, these visual contents do not allow a user to formulate semantically meaningful image query. annotation are solution solve the inadequacy of CBIR text based retrieval. There have been several studies on automatic utilizing machine learning techniques images' representation with low level features extracted either global or local methods. However, typically, approaches suffer from correlation between globally assigned annotations used obtain automatically. In this paper, we present an approach enhance effectiveness bag word that is created automatically set manually annotated training images. The experimentation performed 4,000 for training, 1000 testing ImageNet. result has shown 77.5% performance accuracy. work believed be one step towards enhancing existing minimizing semantic gap.

参考文章(21)
Yuhong Guo, Xin Li, Active learning with multi-label SVM classification international joint conference on artificial intelligence. pp. 1479- 1485 ,(2013)
Kraisak Kesorn, Multi modal multi-semantic image retrieval Queen Mary, University of London. ,(2010)
Christian Hentschel, Sebastian Stober, Andreas Nürnberger, Marcin Detyniecki, Automatic Image Annotation Using a Visual Dictionary Based on Reliable Image Segmentation Lecture Notes in Computer Science. ,vol. 4918, pp. 45- 56 ,(2008) , 10.1007/978-3-540-79860-6_4
Julia Vogel, Semantic scene modeling and retrieval Selected readings in vision and graphics. ,vol. 033, ,(2004) , 10.3929/ETHZ-A-004878063
Bernard Merialdo, Itheri Yahiaoui, Benoit Huet, Image similarity for automatic video summarization european signal processing conference. pp. 1- 4 ,(2002)
Ishwar K. Sethi, Ioana L. Coman, Daniela Stan, Mining association rules between low-level image features and high-level concepts Proceedings of SPIE. ,vol. 4384, pp. 279- 290 ,(2001) , 10.1117/12.421083
Lei Zhang, Jun Ma, Image annotation by incorporating word correlations into multi-class SVM Soft Computing. ,vol. 15, pp. 917- 927 ,(2011) , 10.1007/S00500-010-0558-2
Julia EunJu Nam, Mauricio Maurer, Klaus Mueller, A high-dimensional feature clustering approach to support knowledge-assisted visualization Computers & Graphics. ,vol. 33, pp. 607- 615 ,(2009) , 10.1016/J.CAG.2009.06.006
Wenbin Yi, Hong Tang, Experimental analysis on classification of unmanned aerial vehicle images using the probabilistic latent semantic analysis International Symposium on Spatial Analysis, Spatial-temporal Data Modeling, and Data Mining. ,vol. 7492, ,(2009) , 10.1117/12.838283
Jun Yang, Yu-Gang Jiang, Alexander G. Hauptmann, Chong-Wah Ngo, Evaluating bag-of-visual-words representations in scene classification Proceedings of the international workshop on Workshop on multimedia information retrieval - MIR '07. pp. 197- 206 ,(2007) , 10.1145/1290082.1290111