Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games

作者: Meng Fang , Chengqi Zhang , Joey Tianyi Zhou , Ling Chen , Yali Du

DOI:

关键词: Artificial intelligenceProcess (engineering)Reinforcement learningContext (language use)Structure (mathematical logic)Representation (mathematics)Construct (python library)Natural languageComputer scienceKnowledge graphInference

摘要: We study reinforcement learning (RL) for text-based games, which are interactive simulations in the context of natural language. While different methods have been developed to represent environment information and language actions, existing RL agents not empowered with any reasoning capabilities deal textual games. In this work, we aim conduct explicit knowledge graphs decision making, so that actions an agent generated supported by interpretable inference procedure. propose a stacked hierarchical attention mechanism construct representation process exploiting structure graph. extensively evaluate our method on number man-made benchmark experimental results demonstrate performs better than agents.

参考文章(58)
Volodymyr Mnih, Ioannis Antonoglou, Koray Kavukcuoglu, Daan Wierstra, Martin A. Riedmiller, Alex Graves, David Silver, Playing Atari with Deep Reinforcement Learning arXiv: Learning. ,(2013)
Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh, VQA: Visual Question Answering 2015 IEEE International Conference on Computer Vision (ICCV). pp. 2425- 2433 ,(2015) , 10.1109/ICCV.2015.279
Karthik Narasimhan, Tejas Kulkarni, Regina Barzilay, Language Understanding for Text-based Games using Deep Reinforcement Learning empirical methods in natural language processing. pp. 1- 11 ,(2015) , 10.18653/V1/D15-1001
Léon Bottou, From machine learning to machine reasoning Machine Learning. ,vol. 94, pp. 133- 149 ,(2014) , 10.1007/S10994-013-5335-X
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis, None, Human-level control through deep reinforcement learning Nature. ,vol. 518, pp. 529- 533 ,(2015) , 10.1038/NATURE14236
Gabor Angeli, Melvin Jose Johnson Premkumar, Christopher D. Manning, Leveraging Linguistic Structure For Open Domain Information Extraction Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). ,vol. 1, pp. 344- 354 ,(2015) , 10.3115/V1/P15-1034
Nolan Miller, Gerda Oldham, Steven Pinker, The Language Instinct: How the Mind Creates Language Antioch Review. ,vol. 52, pp. 534- ,(1994) , 10.2307/4613021
Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, Eduard Hovy, Hierarchical Attention Networks for Document Classification north american chapter of the association for computational linguistics. pp. 1480- 1489 ,(2016) , 10.18653/V1/N16-1174
Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, Wei-Ying Ma, Collaborative Knowledge Base Embedding for Recommender Systems knowledge discovery and data mining. pp. 353- 362 ,(2016) , 10.1145/2939672.2939673
Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua, SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 6298- 6306 ,(2017) , 10.1109/CVPR.2017.667