Planning with deadlines in stochastic domains

作者: Leslie Pack Kaelbling , Thomas Dean , Ann Nicholson , Jak Kirman

DOI:

关键词:

摘要: We provide a method, based on the theory of Markov decision problems, for efficient planning in stochastic domains. Goals are encoded as reward functions, expressing desirability each world state; planner must find policy (mapping from states to actions) that maximizes future rewards. Standard goals achievement, well maintenance and prioritized combinations goals, can be specified this way. An optimal found using existing methods, but these methods at best polynomial number domain, where is exponential propositions (or state variables). By information about starting state, function, transition probabilities we restrict planner's attention set likely encountered satisfying goal. Furthermore, generate more or less complete plans depending time it has available. describe experiments involving mobile robotics application consider problem schedulilng different phases algorithm given constraints.

参考文章(11)
Eric J. Horvitz, Reasoning under varying and uncertain resource constraints national conference on artificial intelligence. pp. 111- 116 ,(1988)
Mark Boddy, Anytime problem solving using dynamic programming national conference on artificial intelligence. pp. 738- 743 ,(1991)
Thomas Dean, Leslie Pack Kaelbling, Jak Kirman, Ann Nicholson, Deliberation scheduling for time-critical sequential decision making uncertainty in artificial intelligence. pp. 309- 316 ,(1993) , 10.1016/B978-1-4832-1451-1.50042-1
M. J. Schoppers, Universal plans for reactive robots in unpredictable environments international joint conference on artificial intelligence. pp. 1039- 1046 ,(1987)
Richard E Fikes, Peter E Hart, Nils J Nilsson, Learning and executing generalized robot plans Artificial Intelligence. ,vol. 3, pp. 485- 503 ,(1993) , 10.1016/0004-3702(72)90051-3
Thomas Dean, Mark Boddy, An analysis of time-dependent planning national conference on artificial intelligence. pp. 49- 54 ,(1988)
John Bresina, Mark Drummond, Anytime synthetic projection: maximizing the probability of goal satisfaction national conference on artificial intelligence. pp. 138- 144 ,(1990)
K. A. Bush, J. G. Kemeny, J. L. Snell, Finite Markov Chains. American Mathematical Monthly. ,vol. 67, pp. 1039- ,(1960) , 10.2307/2309264