Learning measures of progress for planning domains

作者: Robert Givan , Alan Fern , SungWook Yoon

DOI:

关键词: Artificial intelligenceRange (mathematics)HeuristicsRepresentation (mathematics)HeuristicHyper-heuristicComputer scienceSemi-supervised learningSet (psychology)Syntax (programming languages)Generalization error

摘要: We study an approach to learning heuristics for planning domains from example solutions. There has been little work on the types of used in deterministic and stochastic competitions. Perhaps one reason this is challenge providing a compact heuristic language that facilitates learning. Here we introduce new representation based lists set expressions described using taxonomic syntax. Next, review idea measure progress (parmar 2002), which any guaranteed be improvable at every state. take finding as our goal, describe simple algorithm purpose. evaluate across range planning-competition domains. The results show often greedily following learned highly effective. also can combined with rule-based policies, producing still stronger results.

参考文章(13)
Hector Geffner, Mario Martin, Learning generalized policies in planning using concept languages principles of knowledge representation and reasoning. pp. 667- 677 ,(2000)
Roni Khardon, Learning to take actions national conference on artificial intelligence. pp. 787- 792 ,(1996)
Claude Sammut, Scott Hurst, Dana Kedzier, Donald Michie, Learning to fly international conference on machine learning. pp. 385- 393 ,(1992) , 10.1016/B978-1-55860-247-2.50055-3
Bart Selman, Henry A. Kautz, Yi-Cheng Huang, Learning Declarative Control Rules for Constraint-BAsed Planning international conference on machine learning. pp. 415- 422 ,(2000)
J. Hoffmann, B. Nebel, The FF planning system: fast plan generation through heuristic search Journal of Artificial Intelligence Research. ,vol. 14, pp. 253- 302 ,(2001) , 10.1613/JAIR.855
Johannes Fürnkranz, Peter A. Flach, An analysis of rule evaluation metrics international conference on machine learning. pp. 202- 209 ,(2003)
Robert Givan, Alan Fern, SungWook Yoon, Inductive policy selection for first-order MDPs uncertainty in artificial intelligence. pp. 568- 576 ,(2002)
Roni Khardon, Learning action strategies for planning domains Artificial Intelligence. ,vol. 113, pp. 125- 148 ,(1999) , 10.1016/S0004-3702(99)00060-0
Aarati Parmar, A logical measure of progress for planning national conference on artificial intelligence. pp. 498- 505 ,(2002) , 10.5555/777092.777170
Eduardo F. Morales, Claude Sammut, Learning to fly by combining reinforcement learning with behavioural cloning Twenty-first international conference on Machine learning - ICML '04. pp. 76- ,(2004) , 10.1145/1015330.1015384