作者: Masakazu Hirkoawa , Kenji Suzuki
DOI: 10.1007/978-3-642-32177-1_3
关键词:
摘要: This chapter describes a novel methodology for behavior learning of an agent, called Coaching. The proposed method is interactive and iterative which allows human trainer to give subjective evaluation the robotic agent in real time, can update reward function dynamically based on this simultaneously. We demonstrated that capable desired by receiving simple instructions such as positive negative. approach also effective when it difficult determine suitable situation advance. have conducted several experiments with simulated robot arm system, advantage verified throughout those experiments.