作者: Yisong Yue , Swarat Chaudhuri , Hoang M. Le , Abhinav Verma
DOI:
关键词:
摘要: We study the problem of programmatic reinforcement learning, in which policies are represented as short programs in a symbolic language. Programmatic policies can be more …