On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems

作者: Anne Condon , Omid Madani , Steve Hanks

DOI:

关键词:

摘要: We investigate the computability of problems in probabilistic planning and partially observable infinite-horizon Markov decision processes. The undecidability string-existence problem for finite automata is adapted to show that following plan existence undecidable: given a problem, determine whether there exists with success probability exceeding desirable threshold. Analogous policy-existence processes under discounted undiscounted total reward models, average-reward state-avoidance models are all shown be undecidable. results apply corresponding approximation as well.

参考文章(20)
Michael L. Littman, Probabilistic propositional planning: representations and complexity national conference on artificial intelligence. pp. 748- 754 ,(1997)
Martin Mundhenk, Judy Goldsmith, Eric Allender, The Complexity of Policy Evaluation for Finite-Horizon Partially-Observable Markov Decision Processes mathematical foundations of computer science. pp. 129- 138 ,(1997) , 10.1007/BFB0029956
Michael Lederman Littman, Algorithms for Sequential Decision Making Brown University. ,(1996)
Rūsiņš Freivalds, Probabilistic Two-Way Machines mathematical foundations of computer science. pp. 33- 45 ,(1981) , 10.1007/3-540-10856-4_72
J. Goldsmith, M. Mundhenk, Complexity issues in Markov decision processes conference on computational complexity. pp. 272- 280 ,(1998) , 10.1109/CCC.1998.694621
M. L. Littman, J. Goldsmith, M. Mundhenk, The computational complexity of probabilistic planning Journal of Artificial Intelligence Research. ,vol. 9, pp. 1- 36 ,(1998) , 10.1613/JAIR.505
Christos H. Papadimitriou, John N. Tsitsiklis, The Complexity of Markov Decision Processes Mathematics of Operations Research. ,vol. 12, pp. 441- 450 ,(1987) , 10.1287/MOOR.12.3.441
Nicholas Kushmerick, Steve Hanks, Daniel S. Weld, An algorithm for probabilistic planning Artificial Intelligence. ,vol. 76, pp. 239- 286 ,(1995) , 10.1016/0004-3702(94)00087-H