Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty

作者: Scott Niekum , Ajinkya Jain

DOI:

关键词: Motion (physics)Hybrid automatonInferenceObject (computer science)Task (computing)Computer scienceDecomposition (computer science)Machine learningPartially observable Markov decision processReinforcement learningArtificial intelligenceKinematics

摘要: Sudden changes in the dynamics of robotic tasks, such as contact with an object or latching a door, are often viewed inconvenient discontinuities that make manipulation difficult. However, when these transitions well-understood, they can be leveraged to reduce uncertainty aid manipulation---for example, wiggling screw determine if it is fully inserted not. Current model-free reinforcement learning approaches require large amounts data learn leverage dynamics, scale poorly problem complexity grows, and do not transfer well significantly different problems. By contrast, hierarchical POMDP planning-based methods via plan decomposition, work on novel problems, directly consider uncertainty, but rely precise hand-specified models task decompositions. To combine advantages opposing paradigms, we propose new method, MICAH, which given unsegmented object's motion under applied actions, (1) detects changepoints model using action-conditional inference, (2) estimates individual local their parameters, (3) converts them into hybrid automaton compatible planning. We show MICAH more accurate robust noise than prior approaches. Further, planner demonstrate learned rich enough used for performing tasks objects ways encountered during training.

参考文章(32)
J. Sturm, C. Stachniss, W. Burgard, A probabilistic framework for learning kinematic models of articulated objects Journal of Artificial Intelligence Research. ,vol. 41, pp. 477- 526 ,(2011) , 10.1613/JAIR.3229
Scott Niekum, Sarah Osentoski, Christopher G. Atkeson, Andrew G. Barto, Online Bayesian changepoint detection for articulated motion models international conference on robotics and automation. pp. 1468- 1475 ,(2015) , 10.1109/ICRA.2015.7139383
Sebastian Thrun, Joelle Pineau, Geoff Gordon, Policy-contingent abstraction for robust robot control uncertainty in artificial intelligence. pp. 477- 484 ,(2002)
Tomás Lozano-Pérez, Leslie Pack Kaelbling, Emma Brunskill, Nicholas Roy, Continuous-State POMDPs with Hybrid Dynamics international symposium on artificial intelligence and mathematics. ,(2008)
Laurent Charlin, Pascal Poupart, Marc Toussaint, Hierarchical POMDP controller optimization by likelihood maximization uncertainty in artificial intelligence. pp. 562- 570 ,(2008)
Paul Fearnhead, Zhen Liu, On-line inference for multiple changepoint problems Journal of The Royal Statistical Society Series B-statistical Methodology. ,vol. 69, pp. 589- 605 ,(2007) , 10.1111/J.1467-9868.2007.00601.X
Jürgen Sturm, Learning kinematic models for articulated objects international joint conference on artificial intelligence. pp. 1851- 1856 ,(2009) , 10.1007/978-3-642-37160-8_4
Christos H. Papadimitriou, John N. Tsitsiklis, The Complexity of Markov Decision Processes Mathematics of Operations Research. ,vol. 12, pp. 441- 450 ,(1987) , 10.1287/MOOR.12.3.441
Patrick R. Barragan, Leslie Pack Kaelbling, Tomas Lozano-Perez, Interactive Bayesian Identification of Kinematic Mechanisms international conference on robotics and automation. pp. 2013- 2020 ,(2014) , 10.1109/ICRA.2014.6907126
P.H.S. Torr, A. Zisserman, MLESAC: A New Robust Estimator with Application to Estimating Image Geometry Computer Vision and Image Understanding. ,vol. 78, pp. 138- 156 ,(2000) , 10.1006/CVIU.1999.0832