摘要: We propose the neural programmer-interpreter (NPI): a recurrent and compositional neural network that learns to represent and execute programs. NPI has three learnable …
Tom Schaul, Daniel Horgan, David Silver, Karol Gregor, Universal Value Function Approximatorsinternational conference on machine learning. pp. 1312- 1320 ,(2015)