作者: Thomas L. Griffiths , Karthik Narasimhan , Mark K. Ho , Robert X. D. Hawkins , Theodore R. Sumers
DOI:
关键词:
摘要: We explore unconstrained natural language feedback as a learning signal for artificial agents. Humans use rich and varied language to teach, yet most prior work on interactive …