Real-time heuristic search with a priority queue

作者: Vadim Bulitko , D. Chris Rayner , Katherine Davison , Jieshan Lu , Kenneth Anderson

DOI:

关键词: Beam stack searchState space searchMathematicsArtificial intelligenceIncremental heuristic searchSearch algorithmMachine learningBest-first searchIterative deepening depth-first searchCombinatorial searchBeam search

摘要: Learning real-time search, which interleaves planning and acting, allows agents to learn from multiple trials respond quickly. Such algorithms require no prior knowledge of the environment can be deployed without pre-processing. We introduce Prioritized-LRTA* (P-LRTA*), a learning search algorithm based on Prioritized Sweeping. P-LRTA* focuses important areas space, where importance state is determined by magnitude updates made neighboring states. Empirical tests path-planning in commercial game maps show substantial speed-up over state-of-the-art algorithms.

参考文章(19)
Nathan Sturtevant, Vadim Bulitko, Maryia Kazakevich, Speeding up learning in real-time search via automatic state abstraction national conference on artificial intelligence. pp. 1349- 1354 ,(2005)
Kumar Iyer, Don M. Dini, Michael van Lent, Paul Carpenter, Building robust planning and execution systems for virtual worlds national conference on artificial intelligence. pp. 29- 35 ,(2006)
Li-Yen Shue, Reza Zamani, An Admissible Heuristic Search Algorithm international syposium on methodologies for intelligent systems. pp. 69- 75 ,(1993) , 10.1007/3-540-56804-2_7
Vadim Bulitko, Learning for Adaptive Real-time Search arXiv: Artificial Intelligence. ,(2004)
Peter Hart, Nils Nilsson, Bertram Raphael, A Formal Basis for the Heuristic Determination of Minimum Cost Paths IEEE Transactions on Systems Science and Cybernetics. ,vol. 4, pp. 100- 107 ,(1968) , 10.1109/TSSC.1968.300136
Andrew G. Barto, Steven J. Bradtke, Satinder P. Singh, Learning to act using real-time dynamic programming Artificial Intelligence. ,vol. 72, pp. 81- 138 ,(1995) , 10.1016/0004-3702(94)00011-O
Richard E. Korf, Real-time heuristic search Artificial Intelligence. ,vol. 42, pp. 189- 211 ,(1990) , 10.1016/0004-3702(90)90054-4
Andrew W. Moore, Christopher G. Atkeson, Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time Machine Learning. ,vol. 13, pp. 103- 130 ,(1993) , 10.1023/A:1022635613229
Sven Koenig, Craig Tovey, Yuri Smirnov, Performance bounds for planning in unknown terrain Artificial Intelligence. ,vol. 147, pp. 253- 279 ,(2003) , 10.1016/S0004-3702(03)00062-6
Vadim Bulitko, Greg Lee, Learning in real-time search: a unifying framework Journal of Artificial Intelligence Research. ,vol. 25, pp. 119- 157 ,(2006) , 10.1613/JAIR.1789