作者: Ram Nevatia , Liang-Chieh Chen , Kan Chen , Haoyuan Gao , Jiang Wang
DOI:
关键词: Natural language processing 、 Question answering 、 Convolutional neural network 、 Feature (computer vision) 、 Deep learning 、 Computer science 、 Natural language 、 Image (mathematics) 、 Benchmark (computing) 、 Semantics 、 Artificial intelligence 、 Machine learning
摘要: We propose a novel attention based deep learning architecture for visual question answering task (VQA). Given an image and an image related natural language question, VQA …