Tag localization with spatial correlations and joint group sparsity

作者: Yang Yang , Yi Yang , Zi Huang , Heng Tao Shen , Feiping Nie

DOI: 10.1109/CVPR.2011.5995499

关键词:

摘要: Nowadays numerous social images have been emerging on the Web. How to precisely label these is critical image retrieval. However, traditional image-level tagging methods may become less effective because global matching approaches can hardly cope with diversity and arbitrariness of Web content. This raises an urgent need for fine-grained schemes. In this work, we study how establish mapping between tags regions, i.e. localize so as better depict index content images. We propose spatial group sparse coding (SGSC) by extending robust encoding ability correlations among training regions. present in a two-dimensional space design group-specific kernels produce more interpretable regularizer. Further joint version SGSC model which able simultaneously encode intrinsically related regions within test image. An algorithm developed optimize objective function Joint SGSC. The tag localization task conducted propagating from sparsely selected groups target according reconstruction coefficients. Extensive experiments three public datasets illustrate that our proposed models achieve great performance improvements over state-of-the-art method task.

参考文章(25)
Shuiwang Ji, Jun Liu, Jieping Ye, SLEP: Sparse Learning with Efficient Projections ,(2011)
Hugo Jair Escalante, Carlos A. Hernández, Jesus A. Gonzalez, A. López-López, Manuel Montes, Eduardo F. Morales, L. Enrique Sucar, Luis Villaseñor, Michael Grubinger, The segmented and annotated IAPR TC-12 benchmark Computer Vision and Image Understanding. ,vol. 114, pp. 419- 428 ,(2010) , 10.1016/J.CVIU.2009.03.008
Jinhui Yuan, Jianmin Li, Bo Zhang, Exploiting spatial context constraints for automatic image region annotation Proceedings of the 15th international conference on Multimedia - MULTIMEDIA '07. pp. 595- 604 ,(2007) , 10.1145/1291233.1291379
Shenghua Gao, Ivor Wai-Hung Tsang, Liang-Tien Chia, Peilin Zhao, Local features are not lonely – Laplacian sparse coding for image classification computer vision and pattern recognition. pp. 3555- 3561 ,(2010) , 10.1109/CVPR.2010.5539943
Shaoting Zhang, Junzhou Huang, Yuchi Huang, Yang Yu, Hongsheng Li, Dimitris N. Metaxas, Automatic image annotation using group sparsity computer vision and pattern recognition. pp. 3312- 3319 ,(2010) , 10.1109/CVPR.2010.5540036
Timo Ojala, Matti Pietikäinen, David Harwood, A comparative study of texture measures with classification based on featured distributions Pattern Recognition. ,vol. 29, pp. 51- 59 ,(1996) , 10.1016/0031-3203(95)00067-4
Charles A. Micchelli, Interpolation of Scattered Data: Distance Matrices and Conditionally Positive Definite Functions Approximation Theory and Spline Functions. ,vol. 2, pp. 143- 145 ,(1984) , 10.1007/978-94-009-6466-2_7
Jamie Shotton, John Winn, Carsten Rother, Antonio Criminisi, TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context International Journal of Computer Vision. ,vol. 81, pp. 2- 23 ,(2009) , 10.1007/S11263-007-0109-1
Jianchao Yang, Kai Yu, Yihong Gong, Thomas Huang, Linear spatial pyramid matching using sparse coding for image classification computer vision and pattern recognition. pp. 1794- 1801 ,(2009) , 10.1109/CVPR.2009.5206757
Stefan Siersdorfer, Jose San Pedro, Mark Sanderson, Automatic video tagging using content redundancy international acm sigir conference on research and development in information retrieval. pp. 395- 402 ,(2009) , 10.1145/1571941.1572010