SMDP homomorphisms: an algebraic approach to abstraction in semi-Markov decision processes

作者： Balaraman Ravindran , Andrew G. Barto

DOI:

关键词:

摘要: To operate effectively in complex environments learning agents require the ability to selectively ignore irrelevant details and form useful abstractions. In this article we consider question of what constitutes a abstraction stochastic sequential decision problem modeled as semi-Markov Decision Process (SMDPs). We introduce notion SMDP homomorphism argue that it provides tool for rigorous study SMDPs. present an minimization framework factored MDPs based on homomorphisms. also model different classes abstractions arise hierarchical systems. Although use options purposes illustration, ideas are more generally applicable. show conditions employ generalization earlier work by Dietterich applied framework.

参考文章(1)

Richard S. Sutton, Doina Precup, Satinder Singh, Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Artificial Intelligence. ,vol. 112, pp. 181- 211 ,(1999) , 10.1016/S0004-3702(99)00052-1

SMDP homomorphisms: an algebraic approach to abstraction in semi-Markov decision processes

来源期刊

我的账户

SMDP homomorphisms: an algebraic approach to abstraction in semi-Markov decision processes

来源期刊

相似文章 4

Abstraction and Knowledge Transfer in Reinforcement Learning

Comparing different methods to speed up reinforcement learning in a complex domain

Learning Representation and Control in Markov Decision Processes: New Frontiers

A unifying framework for computational reinforcement learning theory

我的账户