Situation Recognition: Visual Semantic Role Labeling for Image Understanding

作者: Mark Yatskar , Luke Zettlemoyer , Ali Farhadi

DOI: 10.1109/CVPR.2016.597

关键词:

摘要: This paper introduces situation recognition, the problem of producing a concise summary of the situation an image depicts including:(1) the main activity (eg, clipping),(2) the …

参考文章(50)
Kingsbury Paul, Palmer Martha, None, From treebank to propbank language resources and evaluation. ,(2002)
Amir Roshan Zamir, Khurram Soomro, Mubarak Shah, UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild arXiv: Computer Vision and Pattern Recognition. ,(2012)
M. Hodosh, P. Young, J. Hockenmaier, Framing image description as a ranking task: data, models and evaluation metrics Journal of Artificial Intelligence Research. ,vol. 47, pp. 853- 899 ,(2013) , 10.1613/JAIR.3994
Eunbyung Park, Tamara L. Berg, Licheng Yu, Alexander C. Berg, Visual Madlibs: Fill in the blank Image Generation and Question Answering arXiv: Computer Vision and Pattern Recognition. ,(2015)
Dieu-Thu Le, Jasper Uijlings, Raffaella Bernardi, TUHOI: Trento Universal Human Object Interaction Dataset Proceedings of the Third Workshop on Vision and Language. pp. 17- 24 ,(2014) , 10.3115/V1/W14-5403
Zhiheng Huang, Junhua Mao, Haoyuan Gao, Lei Wang, Wei Xu, Jie Zhou, Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering arXiv: Computer Vision and Pattern Recognition. ,(2015)
Jitendra Malik, Saurabh Gupta, Visual Semantic Role Labeling arXiv: Computer Vision and Pattern Recognition. ,(2015)
Abhinav Gupta, Larry S. Davis, Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers european conference on computer vision. pp. 16- 29 ,(2008) , 10.1007/978-3-540-88682-2_3
Li Fei-Fei, Yuke Zhu, Christopher Ré, Ce Zhang, Building a Large-scale Multimodal Knowledge Base for Visual Question Answering. ,(2015)
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, C. Lawrence Zitnick, Microsoft COCO: Common Objects in Context Computer Vision – ECCV 2014. pp. 740- 755 ,(2014) , 10.1007/978-3-319-10602-1_48