Decision tree methods for finding reusable MDP homomorphisms

作者： Andrew G. Barto , Alicia Peregrin Wolfe

DOI:

关键词: Class (computer programming) 、 Computer science 、 State space 、 Bellman equation 、 Decision tree 、 Sample (statistics) 、 Artificial intelligence 、 Homomorphism 、 State (functional analysis)

摘要: State abstraction is a useful tool for agents interacting with complex environments. Good state abstractions are compact, reuseable, and easy to learn from sample data. This paper combines extends two existing classes of methods achieve these criteria. The first class search MDP homomorphisms (Ravindran 2004), which produce models reward transition probabilities in an abstract space. second methods, like the UTree algorithm (McCallum 1995), compact value function quickly Models based on can easily be extended such that they usable across tasks similar functions. However, cannot this fashion. We present results showing new, combined fulfills all three criteria: resulting learned data, used

uni-trier.de 本地加速

aaai.org 本地加速

aaai.org PDF 下载加速

参考文章(1)

Richard S. Sutton, Doina Precup, Satinder Singh, Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Artificial Intelligence. ,vol. 112, pp. 181- 211 ,(1999) , 10.1016/S0004-3702(99)00052-1

Decision tree methods for finding reusable MDP homomorphisms

来源期刊

我的账户

Decision tree methods for finding reusable MDP homomorphisms

来源期刊

相似文章 2

Abstraction and Knowledge Transfer in Reinforcement Learning

Reinforcement Learning Transfer Based on Subgoal Discovery and Subtask Similarity Hao Wang Shunguo Fan Jinhua Song Yang Gao Xingguo Chen

我的账户