A Bayesian hierarchical model for learning natural scene categories

作者: Fei-Fei Li , P. Perona

DOI: 10.1109/CVPR.2005.16

关键词: Caltech 101Dynamic topic modelUnsupervised learningTraining setTheme (narrative)LabelMeBag-of-words model in computer visionMachine learningComputer scienceVisual dictionaryCategorizationNatural language processingArtificial intelligenceContextual image classification

摘要: We propose a novel approach to learn and recognize natural scene categories. Unlike previous work, it does not require experts annotate the training set. represent image of by collection local regions, denoted as codewords obtained unsupervised learning. Each region is represented part "theme". In such themes were learnt from hand-annotations experts, while our method learns theme distributions well distribution over without supervision. report satisfactory categorization performances on large set 13 categories complex scenes.

参考文章(19)
Thomas Leung, Jitendra Malik, Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons International Journal of Computer Vision. ,vol. 43, pp. 29- 44 ,(2001) , 10.1023/A:1011126920638
Julia Vogel, Bernt Schiele, A Semantic Typicality Measure for Natural Scene Categorization joint pattern recognition symposium. pp. 195- 203 ,(2004) , 10.1007/978-3-540-28649-3_24
Aude Oliva, Antonio Torralba, Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope International Journal of Computer Vision. ,vol. 42, pp. 145- 175 ,(2001) , 10.1023/A:1011139631724
Timor Kadir, Michael Brady, Saliency, Scale and Image Description International Journal of Computer Vision. ,vol. 45, pp. 83- 105 ,(2001) , 10.1023/A:1012460413855
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
F. F. Li, R. VanRullen, C. Koch, P. Perona, Rapid natural scene categorization in the near absence of attention Proceedings of the National Academy of Sciences of the United States of America. ,vol. 99, pp. 9596- 9601 ,(2002) , 10.1073/PNAS.092277599
Aki Vehtari, David B. Dunson, Andrew Gelman, Hal S. Stern, Donald B. Rubin, John B. Carlin, Bayesian Data Analysis ,(1995)
Simon Thorpe, Denis Fize, Catherine Marlot, Speed of processing in the human visual system. Nature. ,vol. 381, pp. 520- 522 ,(1996) , 10.1038/381520A0