Dynamically constructed (PO)MDPs for adaptive robot planning

作者: Peter Stone , Shiqi Zhang , Piyush Khandelwal

DOI:

关键词: RobotArtificial intelligenceConstruct (python library)Computational complexity theoryDeclarative programmingMarkov decision processDomain (software engineering)Commonsense knowledgeComputer scienceSemantics (computer science)

摘要: To operate in human-robot coexisting environments, intelligent robots need to simultaneously reason with commonsense knowledge and plan under uncertainty. Markov decision processes (MDPs) partially observable MDPs (POMDPs), are good at planning uncertainty toward maximizing long-term rewards; P-LOG, a declarative programming language Answer Set semantics, is strong common-sense reasoning. In this paper, we present novel algorithm called iCORPP dynamically about, construct (PO)MDPs using P-LOG. successfully shields exogenous domain attributes from (PO)MDPs, which limits computational complexity enables adapt the value changes these produce. We conduct number of experimental trials two example problems simulation demonstrate on real robot. Results show significant improvements compared competitive baselines.

参考文章(27)
Jesse Hoey, Pascal Poupart, Craig Boutilier, Alex Mihailidis, POMDP Models for Assistive Technology national conference on artificial intelligence. pp. 294- 314 ,(2012) , 10.4018/978-1-60960-165-2.CH013
Damien Ernst, Arthur Louette, Introduction to Reinforcement Learning MIT Press. ,(1998)
Esra Erdem, Volkan Patoglu, Zeynep G. Saribatur, Integrating hybrid diagnostic reasoning in plan execution monitoring for cognitive factories with multiple robots international conference on robotics and automation. pp. 2007- 2013 ,(2015) , 10.1109/ICRA.2015.7139461
Shiqi Zhang, Mohan Sridharan, Jeremy L. Wyatt, Mixed Logical Inference and Probabilistic Planning for Robots in Unreliable Worlds IEEE Transactions on Robotics. ,vol. 31, pp. 699- 713 ,(2015) , 10.1109/TRO.2015.2422531
Levente Kocsis, Csaba Szepesvári, Bandit Based Monte-Carlo Planning Lecture Notes in Computer Science. pp. 282- 293 ,(2006) , 10.1007/11871842_29
Peter Stone, Shiqi Zhang, CORPP: commonsense reasoning and probabilistic planning, as applied to dialog with a mobile robot national conference on artificial intelligence. pp. 1394- 1400 ,(2015)
Steve Young, Milica Gasic, Blaise Thomson, Jason D. Williams, POMDP-Based Statistical Spoken Dialog Systems: A Review Proceedings of the IEEE. ,vol. 101, pp. 1160- 1179 ,(2013) , 10.1109/JPROC.2012.2225812
Matthew Richardson, Pedro Domingos, Markov logic networks Machine Learning. ,vol. 62, pp. 107- 136 ,(2006) , 10.1007/S10994-006-5833-1