Trajectory-Based Dynamic Programming

作者： Christopher G. Atkeson , Chenggang Liu

关键词:

摘要: We informally review our approach to using trajectory optimization accelerate dynamic programming. Dynamic programming provides a way design globally optimal control laws for nonlinear systems. However, the curse of dimensionality, exponential dependence memory and computation resources needed on dimensionality state control, limits application in practice. explore trajectory-based programming, which combines many local optimizations global are able solve problems with less than grid-based approaches, we couldn’t before tabular or function approximation approaches.

参考文章(5)

Stefan Schaal, Christopher G. Atkeson, Robot Learning From Demonstration international conference on machine learning. pp. 12- 20 ,(1997)

Jennie Si, Andrew G Barto, Warren B Powell, Don Wunsch, Handbook of Learning and Approximate Dynamic Programming (2004). ,(2004) , 10.1109/9780470544785

Yuval Tassa, Tom Erez, Emanuel Todorov, Synthesis and stabilization of complex behaviors through online trajectory optimization intelligent robots and systems. pp. 4906- 4913 ,(2012) , 10.1109/IROS.2012.6386025

Dimitri P. Bertsekas, Dynamic Programming and Optimal Control Athena Scientific. ,(1995)

A.G. Barto, R.S. Sutton, Reinforcement Learning: An Introduction ,(1988)

Trajectory-Based Dynamic Programming

来源期刊

我的账户

Trajectory-Based Dynamic Programming

来源期刊

相似文章 0

我的账户