Language Understanding for Text-based Games using Deep Reinforcement Learning

作者: Karthik Narasimhan , Tejas Kulkarni , Regina Barzilay

DOI: 10.18653/V1/D15-1001

关键词:

摘要: In this paper, we consider the task of learning control policies for text-based games. these games, all interactions in virtual world are through text and underlying state is not observed. The resulting language barrier makes such environments challenging automatic game players. We employ a deep reinforcement framework to jointly learn representations action using rewards as feedback. This enables us map descriptions into vector that capture semantics states. evaluate our approach on two worlds, comparing against baselines bag-ofwords bag-of-bigrams representations. Our algorithm outperforms both worlds demonstrating importance expressive 1

参考文章(27)
Cynthia Matuszek, Evan Herbst, Luke Zettlemoyer, Dieter Fox, Learning to Parse Natural Language Commands to a Robot Control System Experimental Robotics. pp. 403- 415 ,(2013) , 10.1007/978-3-319-00065-7_28
István Szita, Reinforcement Learning in Games Reinforcement Learning. pp. 539- 577 ,(2012) , 10.1007/978-3-642-27645-3_17
Damien Ernst, Arthur Louette, Introduction to Reinforcement Learning MIT Press. ,(1998)
Pavel Curtis, Mudding: Social phenomena in text-based virtual realities. Lawrence Erlbaum Associates Publishers. ,(1997)
S.R.K. Branavan, D. Silver, R. Barzilay, Learning to Win by Reading Manuals in a Monte-Carlo Framework meeting of the association for computational linguistics. ,vol. 43, pp. 268- 277 ,(2011) , 10.1613/JAIR.3484
Christopher Amato, Guy Shani, High-level reinforcement learning in strategy games adaptive agents and multi-agents systems. pp. 75- 82 ,(2010) , 10.5555/1838206.1838217
Tomas Mikolov, Greg S. Corrado, Kai Chen, Jeffrey Dean, Efficient Estimation of Word Representations in Vector Space international conference on learning representations. ,(2013)
Richard Sutton, Martin Müller, David Silver, Reinforcement learning of local shape in the game of go international joint conference on artificial intelligence. pp. 1053- 1058 ,(2007)
Jacob Andreas, Dan Klein, Alignment-Based Compositional Semantics for Instruction Following empirical methods in natural language processing. pp. 1165- 1174 ,(2015) , 10.18653/V1/D15-1138
Jacob Eisenstein, James Clarke, Dan Goldwasser, Dan Roth, Reading to Learn: Constructing Features from Semantic Abstracts empirical methods in natural language processing. pp. 958- 967 ,(2009) , 10.3115/1699571.1699637