Learning Hybrid Object Kinematics for Efficient Hierarchical Planning Under Uncertainty

作者: Scott Niekum , Ajinkya Jain

DOI: 10.1109/IROS45743.2020.9340749

关键词: Reinforcement learningPartially observable Markov decision processMachine learningArtificial intelligenceObject (computer science)Computer scienceTrajectoryData modelingHybrid automatonLearning automataKinematicsInference

摘要: Sudden changes in the dynamics of robotic tasks, such as contact with an object or latching a door, are often viewed inconvenient discontinuities that make manipulation difficult. However, when these transitions well-understood, they can be leveraged to reduce uncertainty aid manipulation—for example, wiggling screw determine if it is fully inserted not. Current model-free reinforcement learning approaches require large amounts data learn leverage dynamics, scale poorly problem complexity grows, and do not transfer well significantly different problems. By contrast, hierarchical POMDP planning-based methods via plan decomposition, work on novel problems, directly consider uncertainty, but rely precise hand-specified models task decompositions. To combine advantages opposing paradigms, we propose new method, MICAH, which given unsegmented object’s motion under applied actions, (1) detects changepoints model using action-conditional inference, (2) estimates individual local their parameters, (3) converts them into hybrid automaton compatible planning. We show MICAH more accurate robust noise than prior approaches. Further, planner demonstrate learned rich enough used for performing tasks objects ways encountered during training.

参考文章(21)
J. Sturm, C. Stachniss, W. Burgard, A probabilistic framework for learning kinematic models of articulated objects Journal of Artificial Intelligence Research. ,vol. 41, pp. 477- 526 ,(2011) , 10.1613/JAIR.3229
Scott Niekum, Sarah Osentoski, Christopher G. Atkeson, Andrew G. Barto, Online Bayesian changepoint detection for articulated motion models international conference on robotics and automation. pp. 1468- 1475 ,(2015) , 10.1109/ICRA.2015.7139383
Sudeep Pillai, Matthew Walter, Seth Teller, Learning articulated motions from visual demonstration robotics science and systems. ,vol. 10, ,(2014) , 10.15607/RSS.2014.X.050
Tomás Lozano-Pérez, Leslie Pack Kaelbling, Emma Brunskill, Nicholas Roy, Continuous-State POMDPs with Hybrid Dynamics international symposium on artificial intelligence and mathematics. ,(2008)
Laurent Charlin, Pascal Poupart, Marc Toussaint, Hierarchical POMDP controller optimization by likelihood maximization uncertainty in artificial intelligence. pp. 562- 570 ,(2008)
Paul Fearnhead, Zhen Liu, On-line inference for multiple changepoint problems Journal of The Royal Statistical Society Series B-statistical Methodology. ,vol. 69, pp. 589- 605 ,(2007) , 10.1111/J.1467-9868.2007.00601.X
Patrick R. Barragan, Leslie Pack Kaelbling, Tomas Lozano-Perez, Interactive Bayesian Identification of Kinematic Mechanisms international conference on robotics and automation. pp. 2013- 2020 ,(2014) , 10.1109/ICRA.2014.6907126
P.H.S. Torr, A. Zisserman, MLESAC: A New Robust Estimator with Application to Estimating Image Geometry Computer Vision and Image Understanding. ,vol. 78, pp. 138- 156 ,(2000) , 10.1006/CVIU.1999.0832
Dov Katz, Oliver Brock, Manipulating articulated objects with interactive perception international conference on robotics and automation. pp. 272- 277 ,(2008) , 10.1109/ROBOT.2008.4543220
Dov Katz, Moslem Kazemi, J. Andrew Bagnell, Anthony Stentz, Interactive segmentation, tracking, and kinematic modeling of unknown 3D articulated objects international conference on robotics and automation. pp. 5003- 5010 ,(2013) , 10.1109/ICRA.2013.6631292