ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions

作者: Kripabandhu Ghosh , Saptarshi Ghosh , Anurag Roy , Vinay Kumar Verma

DOI:

关键词: Training setComputer sciencePattern recognitionImage retrievalShot (filmmaking)Representation (mathematics)Expectation–maximization algorithmBenchmark (computing)Hash functionArtificial intelligenceZero (linguistics)

摘要: Most existing algorithms for cross-modal Information Retrieval are based on a supervised train-test setup, where a model learns to align the mode of the query (eg, text) to the mode of …

参考文章(36)
Serge Belongie, Peter Welinder, Florian Schroff, Pietro Perona, Steve Branson, Takeshi Mita, Catherine Wah, Caltech-UCSD Birds 200 California Institute of Technology. ,(2010)
Yoshua Bengio, Hugo Larochelle, Dumitru Erhan, Zero-data learning of new tasks national conference on artificial intelligence. pp. 646- 651 ,(2008)
Tomas Mikolov, Andrea Frome, Greg S. Corrado, Samy Bengio, Mohammad Norouzi, Yoram Singer, Jonathon Shlens, Jeffrey Dean, Zero-Shot Learning by Convex Combination of Semantic Embeddings international conference on learning representations. ,(2014)
Max Welling, Diederik P Kingma, Auto-Encoding Variational Bayes international conference on learning representations. ,(2014)
, Generative Adversarial Nets neural information processing systems. ,vol. 27, pp. 2672- 2680 ,(2014) , 10.3156/JSOFT.29.5_177_2
Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert R.G. Lanckriet, Roger Levy, Nuno Vasconcelos, A new approach to cross-modal multimedia retrieval Proceedings of the international conference on Multimedia - MM '10. pp. 251- 260 ,(2010) , 10.1145/1873951.1873987
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, Li Fei-Fei, ImageNet: A large-scale hierarchical image database computer vision and pattern recognition. pp. 248- 255 ,(2009) , 10.1109/CVPR.2009.5206848
Richard Socher, Milind Ganjoo, Andrew Ng, Christopher D Manning, Zero-Shot Learning Through Cross-Modal Transfer neural information processing systems. ,vol. 26, pp. 935- 943 ,(2013)
Simon Osindero, Mehdi Mirza, Conditional Generative Adversarial Nets arXiv: Learning. ,(2014)
Mark Palatucci, Dean Pomerleau, Geoffrey E Hinton, Tom M Mitchell, None, Zero-shot Learning with Semantic Output Codes neural information processing systems. ,vol. 22, pp. 1410- 1418 ,(2009) , 10.1184/R1/6476456.V1