作者: S. Chernova , M. Veloso
DOI: 10.1613/JAIR.2584
关键词:
摘要: We present Confidence-Based Autonomy (CBA), an interactive algorithm for policy learning from demonstration. The CBA consists of two components which take advantage the complimentary abilities humans and computer agents. first component, Confident Execution, enables agent to identify states in demonstration is required, request a human teacher learn based on acquired data. selects demonstrations measure action selection confidence, our results show that using Execution requires fewer than when are selected by teacher. second algorithmic Corrective Demonstration, correct any mistakes made through additional order improve future task performance. its individual compared evaluated complex simulated driving domain. complete best overall performance, successfully reproducing behavior while balancing tradeoff between number incorrect actions during learning.