Robot Sequential Decision Making using LSTM-based Learning and Logical-probabilistic Reasoning.

作者: Mohammad Shokrolah Shirazi , Shiqi Zhang , Saeid Amiri

DOI:

关键词:

摘要: Sequential decision-making (SDM) plays a key role in intelligent robotics, and can be realized very different ways, such as supervised learning, automated reasoning, probabilistic planning. The three families of methods follow assumptions have (dis)advantages. In this work, we aim at robot SDM framework that exploits the complementary features We utilize long short-term memory (LSTM), for passive state estimation with streaming sensor data, commonsense reasoning planning (CORPP) active information collection task accomplishment. experiments, mobile is tasked estimating human intentions using their motion trajectories, declarative contextual knowledge, human-robot interaction (dialog-based motion-based). Results suggest our performs better than its no-learning no-reasoning versions real-world office environment.

参考文章(44)
Cynthia Breazeal, Andrea L. Thomaz, Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance national conference on artificial intelligence. pp. 1000- 1005 ,(2006)
Diederik P. Kingma, Jimmy Ba, Adam: A Method for Stochastic Optimization arXiv: Learning. ,(2014)
Shiqi Zhang, Mohan Sridharan, Jeremy L. Wyatt, Mixed Logical Inference and Probabilistic Planning for Robots in Unreliable Worlds IEEE Transactions on Robotics. ,vol. 31, pp. 699- 713 ,(2015) , 10.1109/TRO.2015.2422531
Peter Stone, Shiqi Zhang, CORPP: commonsense reasoning and probabilistic planning, as applied to dialog with a mobile robot national conference on artificial intelligence. pp. 1394- 1400 ,(2015)
Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan, Show and tell: A neural image caption generator computer vision and pattern recognition. pp. 3156- 3164 ,(2015) , 10.1109/CVPR.2015.7298935
Matthew Richardson, Pedro Domingos, Markov logic networks Machine Learning. ,vol. 62, pp. 107- 136 ,(2006) , 10.1007/S10994-006-5833-1
Yusuke Kato, Takayuki Kanda, Hiroshi Ishiguro, May I help you?: Design of Human-like Polite Approaching Behavior human-robot interaction. pp. 35- 42 ,(2015) , 10.1145/2696454.2696463
Matteo Munaro, Filippo Basso, Emanuele Menegatti, Tracking people within groups with RGB-D data intelligent robots and systems. pp. 2101- 2107 ,(2012) , 10.1109/IROS.2012.6385772