Automatic image annotation based on Gaussian mixture model considering cross-modal correlations

作者: Dongping Tian , Zhongzhi Shi

DOI: 10.1016/J.JVCIR.2017.01.015

关键词:

摘要: A unified framework is developed for image annotation based on GMM and MB.Automatic transformed into a graph partitioning problem.GMM fitted by the RPEM algorithm utilized to estimate initial annotations.Integrating label visual similarities of images associated with labels.Max-bisection implemented rank-two relaxation heuristics. Automatic has been an active topic research in field computer vision pattern recognition decades. In this paper, we present new method automatic Gaussian mixture model (GMM) considering cross-modal correlations. To be specific, first employ rival penalized expectation-maximization (RPEM) posterior probabilities each keyword. Next, similarity constructed weighted linear combination seamlessly integrating information from both low level features high semantic concepts together, which can effectively avoid phenomenon that different same candidate annotations would obtain refinement results. Followed heuristics over built applied further mine correlation so as capture refining results, plays crucial role retrieval. The main contributions work summarized follows: (1) Exploiting trained images. (2) corresponding labels. (3) Refining set generated through solving max-bisection graph. Compared current competitive SGMM-RW, achieve significant improvements 4% 5% precision, 6% 9% recall Corel5k Mirflickr25k, respectively.

参考文章(69)
Yuchai Wan, Xiabi Liu, Kunqi Tong, Xue Wei, Yi Wu, Fei Guan, Kunpeng Pang, GMM-ClusterForest: a novel indexing approach for multi-features based similarity search in high-dimensional spaces international conference on neural information processing. pp. 210- 217 ,(2012) , 10.1007/978-3-642-34481-7_26
Xin Luo, Kenji Kita, Region-Based Image Annotation Using Gaussian Mixture Model Lecture Notes in Electrical Engineering. pp. 503- 510 ,(2013) , 10.1007/978-3-642-34531-9_53
Yanjie Wang, Xiabi Liu, Yunde Jia, Automatic image annotation with cooperation of concept-specific and universal visual vocabularies conference on multimedia modeling. pp. 262- 272 ,(2010) , 10.1007/978-3-642-11301-7_28
Mohammad Valipour, Temperature analysis of reference evapotranspiration models Meteorological Applications. ,vol. 22, pp. 385- 394 ,(2015) , 10.1002/MET.1465
P. Duygulu, K. Barnard, J. F. G. de Freitas, D. A. Forsyth, Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary european conference on computer vision. ,vol. 2353, pp. 97- 112 ,(2002) , 10.1007/3-540-47979-1_7
Ameesh Makadia, Vladimir Pavlovic, Sanjiv Kumar, A New Baseline for Image Annotation Lecture Notes in Computer Science. pp. 316- 329 ,(2008) , 10.1007/978-3-540-88690-7_24
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Yohan Jin, Latifur Khan, B. Prabhakaran, Knowledge Based Image Annotation Refinement Journal of Signal Processing Systems. ,vol. 58, pp. 387- 406 ,(2010) , 10.1007/S11265-009-0391-Y
Guangyu Zhu, Shuicheng Yan, Yi Ma, Image tag refinement towards low-rank, content-tag prior and error sparsity Proceedings of the international conference on Multimedia - MM '10. pp. 461- 470 ,(2010) , 10.1145/1873951.1874028
Rui Zhang, Lei Zhang, Xin-Jing Wang, Ling Guan, Multi-feature pLSA for combining visual features in image annotation Proceedings of the 19th ACM international conference on Multimedia - MM '11. pp. 1513- 1516 ,(2011) , 10.1145/2072298.2072053