Semi-Supervised Multitask Learning for Scene Recognition

作者: Xiaoqiang Lu , Xuelong Li , Lichao Mou

DOI: 10.1109/TCYB.2014.2362959

关键词:

摘要: Scene recognition has been widely studied to understand visual information from the level of objects and their relationships. Toward scene recognition, many methods have proposed. They, however, encounter difficulty improve accuracy, mainly due two limitations: 1) lack analysis intrinsic relationships across different scales, say, initial input its down-sampled versions 2) existence redundant features. This paper develops a semi-supervised learning mechanism reduce above limitations. To address first limitation, we propose multitask model integrate images resolutions. For second build sparse feature selection-based manifold regularization (SFSMR) select optimal preserve underlying structure data. SFSMR coordinates advantages selection regulation. Finally, link SFSMR, method Experimental results report improvements accuracy in recognition.

参考文章(49)
Roland Kwitt, Nuno Vasconcelos, Nikhil Rasiwasia, Scene recognition on the semantic manifold european conference on computer vision. pp. 359- 372 ,(2012) , 10.1007/978-3-642-33765-9_26
Richard Socher, Andrew Y. Ng, Cliff C. Lin, Chris Manning, Parsing Natural Scenes and Natural Language with Recursive Neural Networks international conference on machine learning. pp. 129- 136 ,(2011)
Gert Lanckriet, Brian McFee, Daryl Lim, Robust Structural Metric Learning international conference on machine learning. pp. 615- 623 ,(2013)
Thomas Hofmann, Probabilistic latent semantic analysis uncertainty in artificial intelligence. ,vol. 15, pp. 289- 296 ,(1999)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Mandar Dixit, Nikhil Rasiwasia, Nuno Vasconcelos, Adapted Gaussian models for image classification CVPR 2011. pp. 937- 943 ,(2011) , 10.1109/CVPR.2011.5995674
Lining Zhang, Lipo Wang, Weisi Lin, Generalized Biased Discriminant Analysis for Content-Based Image Retrieval systems man and cybernetics. ,vol. 42, pp. 282- 290 ,(2012) , 10.1109/TSMCB.2011.2165335
Qi Wang, Yuan Yuan, Pingkun Yan, Xuelong Li, Saliency Detection by Multiple-Instance Learning IEEE Transactions on Systems, Man, and Cybernetics. ,vol. 43, pp. 660- 672 ,(2013) , 10.1109/TSMCB.2012.2214210
Christian Wojek, Stefan Walk, Stefan Roth, Konrad Schindler, Bernt Schiele, Monocular Visual Scene Understanding: Understanding Multi-Object Traffic Scenes IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 882- 897 ,(2013) , 10.1109/TPAMI.2012.174
Hideki Nakayama, Tatsuya Harada, Yasuo Kuniyoshi, Global Gaussian approach for scene categorization using information geometry computer vision and pattern recognition. pp. 2336- 2343 ,(2010) , 10.1109/CVPR.2010.5539921