Alleviating Feature Confusion for Generative Zero-shot Learning.

作者: Zi Huang , Ke Lu , Lei Zhu , Yang Yang , Jingjing Li

DOI:

关键词:

摘要: Lately, generative adversarial networks (GANs) have been successfully applied to zero-shot learning (ZSL) and achieved state-of-the-art performance. By synthesizing virtual unseen visual features, GAN-based methods convert the challenging ZSL task into a supervised problem. However, train generator on seen categories further apply it instances. An inevitable issue of such paradigm is that synthesized features are prone references incapable reflect novelty diversity real In nutshell, confusing. One cannot tell from ones using features. As result, too subtle be classified in generalized (GZSL) which involves both at test stage. this paper, we first introduce feature confusion issue. Then, propose new generating network, named alleviating GAN (AFC-GAN), challenge Specifically, present boundary loss maximizes decision ones. Furthermore, novel metric score (FCS) proposed quantify confusion. Extensive experiments five widely used datasets verify our method able outperform previous state-of-the-arts under GZSL protocols.

参考文章(38)
Tomas Mikolov, Andrea Frome, Greg S. Corrado, Samy Bengio, Mohammad Norouzi, Yoram Singer, Jonathon Shlens, Jeffrey Dean, Zero-Shot Learning by Convex Combination of Semantic Embeddings international conference on learning representations. ,(2014)
Bernardino Romera-Paredes, Philip H. S. Torr, An embarrassingly simple approach to zero-shot learning international conference on machine learning. pp. 2152- 2161 ,(2015) , 10.1007/978-3-319-50077-5_2
Serge Belongie, Peter Welinder, Pietro Perona, Steve Branson, Catherine Wah, The Caltech-UCSD Birds-200-2011 Dataset California Institute of Technology. ,(2011)
Zeynep Akata, Scott Reed, Daniel Walter, Honglak Lee, Bernt Schiele, Evaluation of output embeddings for fine-grained image classification computer vision and pattern recognition. pp. 2927- 2936 ,(2015) , 10.1109/CVPR.2015.7298911
G. Patterson, J. Hays, SUN attribute database: Discovering, annotating, and recognizing scene attributes computer vision and pattern recognition. pp. 2751- 2758 ,(2012) , 10.1109/CVPR.2012.6247998
, Generative Adversarial Nets neural information processing systems. ,vol. 27, pp. 2672- 2680 ,(2014) , 10.3156/JSOFT.29.5_177_2
Tomas Mikolov, Andrea Frome, Marc'Aurelio Ranzato, Greg S Corrado, Samy Bengio, Jeff Dean, Jon Shlens, DeViSE: A Deep Visual-Semantic Embedding Model neural information processing systems. ,vol. 26, pp. 2121- 2129 ,(2013)
Richard Socher, Milind Ganjoo, Andrew Ng, Christopher D Manning, Zero-Shot Learning Through Cross-Modal Transfer neural information processing systems. ,vol. 26, pp. 935- 943 ,(2013)
C.H. Lampert, H. Nickisch, S. Harmeling, Learning to detect unseen object classes by between-class attribute transfer computer vision and pattern recognition. pp. 951- 958 ,(2009) , 10.1109/CVPRW.2009.5206594
Zeynep Akata, Florent Perronnin, Zaid Harchaoui, Cordelia Schmid, Label-Embedding for Image Classification IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 38, pp. 1425- 1438 ,(2016) , 10.1109/TPAMI.2015.2487986